GNU bug report logs - #26319
sort -u deletes characters it shouldn't

Previous Next

Package: coreutils;

Reported by: sort-u <at> re-gister.com

Date: Fri, 31 Mar 2017 04:35:02 UTC

Severity: normal

Tags: notabug

Done: Assaf Gordon <assafgordon <at> gmail.com>

Bug is archived. No further changes may be made.

To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 26319 in the body.
You can then email your comments to 26319 AT debbugs.gnu.org in the normal way.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to bug-coreutils <at> gnu.org:
bug#26319; Package coreutils. (Fri, 31 Mar 2017 04:35:02 GMT) Full text and rfc822 format available.

Acknowledgement sent to sort-u <at> re-gister.com:
New bug report received and forwarded. Copy sent to bug-coreutils <at> gnu.org. (Fri, 31 Mar 2017 04:35:02 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: sort-u <at> re-gister.com
To: bug-coreutils <at> gnu.org
Subject: sort -u deletes characters it shouldn't
Date: Thu, 30 Mar 2017 23:27:53 +0200
[Message part 1 (text/plain, inline)]
sort (GNU coreutils) 8.26local of the system: en_US.UTF-8contents of /etc/locale.conf:LANG=en_US.UTF-8LC_TIME=fr_FR.UTF-8LC_NUMERIC=fr_FR.UTF-8========================================$ echo $'䜌\n龹' | sort -uExpected output:龹䜌Actual output:䜌
[Message part 2 (text/html, inline)]

Information forwarded to bug-coreutils <at> gnu.org:
bug#26319; Package coreutils. (Fri, 31 Mar 2017 05:16:02 GMT) Full text and rfc822 format available.

Message #8 received at 26319 <at> debbugs.gnu.org (full text, mbox):

From: Pádraig Brady <P <at> draigBrady.com>
To: sort-u <at> re-gister.com, 26319 <at> debbugs.gnu.org
Subject: Re: bug#26319: sort -u deletes characters it shouldn't
Date: Thu, 30 Mar 2017 22:15:04 -0700
tag 26319 notabug
close 26319
stop

On 30/03/17 14:27, sort-u <at> re-gister.com wrote:
> sort (GNU coreutils) 8.26
> $ echo $'䜌\n龹' | sort -u
> Expected output:龹䜌
> Actual output:䜌

That's because the strcoll() in that locale is comparing them as equal.
This needs to be addressed in the locale data for glibc etc.

thanks,
Pádraig




Added tag(s) notabug. Request was from Assaf Gordon <assafgordon <at> gmail.com> to control <at> debbugs.gnu.org. (Mon, 29 Oct 2018 03:00:03 GMT) Full text and rfc822 format available.

bug closed, send any further explanations to 26319 <at> debbugs.gnu.org and sort-u <at> re-gister.com Request was from Assaf Gordon <assafgordon <at> gmail.com> to control <at> debbugs.gnu.org. (Mon, 29 Oct 2018 03:00:03 GMT) Full text and rfc822 format available.

bug archived. Request was from Debbugs Internal Request <help-debbugs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Mon, 26 Nov 2018 12:24:05 GMT) Full text and rfc822 format available.

This bug report was last modified 5 years and 153 days ago.

Previous Next


GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.