GNU bug report logs - #47858
Document that FULLWIDTH DIGITs boggle coreutils brain currently

Previous Next

Package: coreutils;

Reported by: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>

Date: Sun, 18 Apr 2021 01:13:04 UTC

Severity: minor

To reply to this bug, email your comments to 47858 AT debbugs.gnu.org.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-coreutils <at> gnu.org:
bug#47858; Package coreutils. (Sun, 18 Apr 2021 01:13:04 GMT) Full text and rfc822 format available.

Acknowledgement sent to 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>:
New bug report received and forwarded. Copy sent to bug-coreutils <at> gnu.org. (Sun, 18 Apr 2021 01:13:04 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: 積丹尼 Dan Jacobson <jidanni <at> jidanni.org>
To: bug-coreutils <at> gnu.org
Subject: Document that FULLWIDTH DIGITs boggle coreutils brain currently
Date: Sun, 18 Apr 2021 08:09:24 +0800
(info "(coreutils) Correct/Incorrect ordering and Expected/Unexpected results")
needs to "admit" or "at least mention" that "coreutils just hasn't thought about FULLWIDTH
DIGITs yet." I.e., Europeans are in luck. East Asians are out of luck.

$ seq 9 11|sort
10
11
9
$ seq 9 11|sort -n
9
10
11
$ seq 9 11|sort -V
9
10
11

$ echo -e 9\\n10\\n11|sort
10
11
9
$ echo -e 9\\n10\\n11|sort -V
10
11
9
$ echo -e 9\\n10\\n11|sort -n
10
11
9

$ unicode 9
U+FF19 FULLWIDTH DIGIT NINE
UTF-8: ef bc 99 UTF-16BE: ff19 Decimal: &#65305; Octal: \0177431
9
Category: Nd (Number, Decimal Digit); East Asian width: F (fullwidth)
Unicode block: FF00..FFEF; Halfwidth and Fullwidth Forms
Numeric value: 9
Digit value: 9
Bidi: EN (European Number)

Decomposition: <wide> 0039

Nope, LC_ALL=zh_TW.UTF-8 didn't help.
sort (GNU coreutils) 8.32




This bug report was last modified 3 years and 15 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.