[Bug 7538] New: uncorrect count symbols in Unicode
bugzilla at busybox.net
bugzilla at busybox.net
Fri Oct 17 11:56:39 UTC 2014
https://bugs.busybox.net/show_bug.cgi?id=7538
Summary: uncorrect count symbols in Unicode
Product: Busybox
Version: unspecified
Platform: PC
OS/Version: Linux
Status: NEW
Severity: minor
Priority: P5
Component: Other
AssignedTo: unassigned at busybox.net
ReportedBy: mad.deer at gmail.com
CC: busybox-cvs at busybox.net
Estimated Hours: 0.0
I have locale LANG=en_US.UTF-8 but some tools in busubox all versions doesn't
work with unicode correct. For example awk :
$ echo тест | busybox awk '{ print length($0) }'
8
$ echo test | busybox awk '{ print length($0) }'
4
$ echo тест | awk '{ print length($0) }' #it's standart awk of distrib
4
same situation is in sed:
$ echo -ne "тест" | sed -e :a -e "s/^.\{0,10\}$/& /;ta" | wc -m
11
$ echo -ne "тест" | busybox sed -e :a -e "s/^.\{0,10\}$/& /;ta" | wc -m
7
$ echo -ne "test" | busybox sed -e :a -e "s/^.\{0,10\}$/& /;ta" | wc -m
11
--
Configure bugmail: https://bugs.busybox.net/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
More information about the busybox-cvs
mailing list