[Bug 7538] New: uncorrect count symbols in Unicode

bugzilla at busybox.net bugzilla at busybox.net
Fri Oct 17 11:56:39 UTC 2014


https://bugs.busybox.net/show_bug.cgi?id=7538

           Summary: uncorrect count symbols in Unicode
           Product: Busybox
           Version: unspecified
          Platform: PC
        OS/Version: Linux
            Status: NEW
          Severity: minor
          Priority: P5
         Component: Other
        AssignedTo: unassigned at busybox.net
        ReportedBy: mad.deer at gmail.com
                CC: busybox-cvs at busybox.net
   Estimated Hours: 0.0


I have locale LANG=en_US.UTF-8 but some tools in busubox all versions doesn't
work with unicode correct. For example awk : 

$ echo тест | busybox awk '{ print length($0) }' 
8
$ echo test | busybox awk '{ print length($0) }' 
4
$ echo тест | awk '{ print length($0) }' #it's standart awk of distrib 
4 

same situation is in sed: 
$ echo -ne "тест" | sed -e :a -e "s/^.\{0,10\}$/& /;ta" | wc -m 
11
$ echo -ne "тест" | busybox  sed -e :a -e "s/^.\{0,10\}$/& /;ta" | wc -m 
7
$ echo -ne "test" | busybox sed -e :a -e "s/^.\{0,10\}$/& /;ta" | wc -m 
11

-- 
Configure bugmail: https://bugs.busybox.net/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


More information about the busybox-cvs mailing list