Blacklight Plugin

Sort order for non-latin text

Details

  • Type: Improvement Improvement
  • Status: Closed Closed
  • Priority: Major Major
  • Resolution: Fixed
  • Affects Version/s: 2.3
  • Fix Version/s: 2.2
  • Component/s: None
  • Description:
    Hide
    Do search results sorted by title, author, whatever sort in the appropriate alphanumeric order for non-latin scripts and for diacritics, interfiled however they should be? There may be language by language issue --in Chinese, I believe sort order is determined by number of strokes in the first pictographic character, not by the alphabet ?

    Stanford has a title_sort and an author_sort field ... that doesn't sort correctly. We have a way to map Latin diacritics to plain chars (borrowed from Bob's code), but I'm not sure that's working ... and not sure what to do about non-latin chars.

    Stanford "recently" converted Symphony to Unicode and partnered with Sirsi to get iLink to display and sort results appropriately. So someone here has a solid set of test cases and internal expertise to help get BL up to snuff on these.

    Lauren Scott would be the key resource in coordinating this testing and feedback from the appropriate language experts.
    Show
    Do search results sorted by title, author, whatever sort in the appropriate alphanumeric order for non-latin scripts and for diacritics, interfiled however they should be? There may be language by language issue --in Chinese, I believe sort order is determined by number of strokes in the first pictographic character, not by the alphabet ? Stanford has a title_sort and an author_sort field ... that doesn't sort correctly. We have a way to map Latin diacritics to plain chars (borrowed from Bob's code), but I'm not sure that's working ... and not sure what to do about non-latin chars. Stanford "recently" converted Symphony to Unicode and partnered with Sirsi to get iLink to display and sort results appropriately. So someone here has a solid set of test cases and internal expertise to help get BL up to snuff on these. Lauren Scott would be the key resource in coordinating this testing and feedback from the appropriate language experts.

Activity

Hide
Naomi Dushay added a comment - 19/Mar/09 8:02 PM
careful with "composed" vs "decomposed" chars; traditional vs. other Chinese, etc.
Show
Naomi Dushay added a comment - 19/Mar/09 8:02 PM careful with "composed" vs "decomposed" chars; traditional vs. other Chinese, etc.
Hide
Naomi Dushay added a comment - 07/May/09 1:37 PM
This is now okay at Stanford, with the exception of the Ae and Oe ligatures, which I will put in a separate issue.
Show
Naomi Dushay added a comment - 07/May/09 1:37 PM This is now okay at Stanford, with the exception of the Ae and Oe ligatures, which I will put in a separate issue.

People

Dates

  • Created:
    19/Mar/09 8:01 PM
    Updated:
    07/May/09 1:37 PM
    Resolved:
    07/May/09 1:37 PM