[uClibc 0000713]: UTF-8 encoder/decoder reject UFFFE,UFFFF

bugs at busybox.net bugs at busybox.net
Fri Feb 10 07:16:52 UTC 2006


The following issue has been SUBMITTED. 
====================================================================== 
http://busybox.net/bugs/view.php?id=713 
====================================================================== 
Reported By:                rfelker
Assigned To:                uClibc
====================================================================== 
Project:                    uClibc
Issue ID:                   713
Category:                   Internationalization / Localization
Reproducibility:            always
Severity:                   minor
Priority:                   normal
Status:                     assigned
====================================================================== 
Date Submitted:             02-09-2006 23:16 PST
Last Modified:              02-09-2006 23:16 PST
====================================================================== 
Summary:                    UTF-8 encoder/decoder reject UFFFE,UFFFF
Description: 
These code points are noncharacters, but they are not invalid UTF-8
sequences. UTF-8 encodes 'Unicode scalar values' which can be any integer
in the range 0-0xd7ff or 0xe000-0x10ffff. See:

http://www.unicode.org/versions/Unicode4.0.0/ch03.pdf
Section 3.9, page 73-74 (PDF page 20-21)

Read the last bullet point under D29:

To ensure that the mapping for a Unicode encoding form is one-to-one, all
Unicode scalar values, including those corresponding to noncharacter code
points and unassigned code points, must be mapped to unique code unit
sequences. Note that this requirement does not extend to high-surrogate
and low-surrogate code points, which are excluded by definition from the
set of Unicode scalar values.

====================================================================== 

Issue History 
Date Modified   Username       Field                    Change               
====================================================================== 
02-09-06 23:16  rfelker        New Issue                                    
02-09-06 23:16  rfelker        Status                   new => assigned     
02-09-06 23:16  rfelker        Assigned To               => uClibc          
======================================================================




More information about the uClibc-cvs mailing list