Cyrillic letters proplem

Aurelien Jacobs aurel at gnuage.org
Fri Jan 20 12:35:37 UTC 2006


On Fri, 20 Jan 2006 13:09:17 +0100
walter harms <wharms at bfs.de> wrote:

> maybe i am wrong (read: no UTF experience),
> but text utils should use wchar_t instead of char and therefore
> use the wc functions.

It depends... With UTF-8, if you want to handle a single code point
(a single charater, then you need to use wchar_t, or simply int).
When you want to handle a string, then you still use a char *, but
there is no 1:1 mapping between a byte and an actual character.

> is there a HOWTO for uft programming somewhere ?

This might help :
  http://www.cl.cam.ac.uk/~mgk25/unicode.html
And wikipedia also has lots of details and links :
  http://en.wikipedia.org/wiki/Unicode

Aurel



More information about the busybox mailing list