[Bug 7538] New: uncorrect count symbols in Unicode

bugzilla at busybox.net bugzilla at busybox.net
Fri Oct 17 11:56:39 UTC 2014


           Summary: uncorrect count symbols in Unicode
           Product: Busybox
           Version: unspecified
          Platform: PC
        OS/Version: Linux
            Status: NEW
          Severity: minor
          Priority: P5
         Component: Other
        AssignedTo: unassigned at busybox.net
        ReportedBy: mad.deer at gmail.com
                CC: busybox-cvs at busybox.net
   Estimated Hours: 0.0

I have locale LANG=en_US.UTF-8 but some tools in busubox all versions doesn't
work with unicode correct. For example awk : 

$ echo тест | busybox awk '{ print length($0) }' 
$ echo test | busybox awk '{ print length($0) }' 
$ echo тест | awk '{ print length($0) }' #it's standart awk of distrib 

same situation is in sed: 
$ echo -ne "тест" | sed -e :a -e "s/^.\{0,10\}$/& /;ta" | wc -m 
$ echo -ne "тест" | busybox  sed -e :a -e "s/^.\{0,10\}$/& /;ta" | wc -m 
$ echo -ne "test" | busybox sed -e :a -e "s/^.\{0,10\}$/& /;ta" | wc -m 

Configure bugmail: https://bugs.busybox.net/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.

More information about the busybox-cvs mailing list