[uClibc 0000719]: Many non-european letters are classified non-alphabetic

bugs at busybox.net bugs at busybox.net
Mon Feb 13 07:09:31 UTC 2006


The following issue has been SUBMITTED. 
====================================================================== 
http://busybox.net/bugs/view.php?id=719 
====================================================================== 
Reported By:                rfelker
Assigned To:                uClibc
====================================================================== 
Project:                    uClibc
Issue ID:                   719
Category:                   Internationalization / Localization
Reproducibility:            always
Severity:                   major
Priority:                   normal
Status:                     assigned
====================================================================== 
Date Submitted:             02-12-2006 23:09 PST
Last Modified:              02-12-2006 23:09 PST
====================================================================== 
Summary:                    Many non-european letters are classified
non-alphabetic
Description: 
uClibc inherits this bug from glibc, which incorrectly derives the
alphabetic property. In addition to the L*, Nl, Sl, etc. Unicode character
classes for letters, Unicode also includes an "Other_Alphabetic" class in
http://www.unicode.org/Public/UNIDATA/PropList.txt of combining marks (Mn
and Mc) in South Asian scripts which are certainly letters. Arguably all
combining marks should be included in class alpha (otherwise decomposed
alphabetic strings with accents/diacritics will be nonalphabetic), but the
ones in Other_Alphabetic MUST be included.

This bug results in most words in most South Asian scripts being
classified nonalphabetic; thus I consider it major.
====================================================================== 

Issue History 
Date Modified   Username       Field                    Change               
====================================================================== 
02-12-06 23:09  rfelker        New Issue                                    
02-12-06 23:09  rfelker        Status                   new => assigned     
02-12-06 23:09  rfelker        Assigned To               => uClibc          
======================================================================




More information about the uClibc-cvs mailing list