Received: (at 54124) by debbugs.gnu.org; 24 Feb 2022 03:06:38 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Wed Feb 23 22:06:38 2022
Received: from localhost ([127.0.0.1]:46996 helo=debbugs.gnu.org)
by debbugs.gnu.org with esmtp (Exim 4.84_2)
(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
id 1nN4Sk-0002uC-8y
for submit <at> debbugs.gnu.org; Wed, 23 Feb 2022 22:06:38 -0500
Received: from zimbra.cs.ucla.edu ([131.179.128.68]:52060)
by debbugs.gnu.org with esmtp (Exim 4.84_2)
(envelope-from <eggert@HIDDEN>) id 1nN4Si-0002tw-If
for 54124 <at> debbugs.gnu.org; Wed, 23 Feb 2022 22:06:37 -0500
Received: from localhost (localhost [127.0.0.1])
by zimbra.cs.ucla.edu (Postfix) with ESMTP id 70A7816015F;
Wed, 23 Feb 2022 19:06:30 -0800 (PST)
Received: from zimbra.cs.ucla.edu ([127.0.0.1])
by localhost (zimbra.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10032)
with ESMTP id 3K8BubZOU1Ww; Wed, 23 Feb 2022 19:06:29 -0800 (PST)
Received: from localhost (localhost [127.0.0.1])
by zimbra.cs.ucla.edu (Postfix) with ESMTP id C3204160162;
Wed, 23 Feb 2022 19:06:29 -0800 (PST)
X-Virus-Scanned: amavisd-new at zimbra.cs.ucla.edu
Received: from zimbra.cs.ucla.edu ([127.0.0.1])
by localhost (zimbra.cs.ucla.edu [127.0.0.1]) (amavisd-new, port 10026)
with ESMTP id ecZVEhOrMozu; Wed, 23 Feb 2022 19:06:29 -0800 (PST)
Received: from [131.179.64.200] (Penguin.CS.UCLA.EDU [131.179.64.200])
by zimbra.cs.ucla.edu (Postfix) with ESMTPSA id A16E71600F9;
Wed, 23 Feb 2022 19:06:29 -0800 (PST)
Message-ID: <239351e3-01e9-3a64-1336-b049b7250d4d@HIDDEN>
Date: Wed, 23 Feb 2022 19:06:29 -0800
MIME-Version: 1.0
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101
Thunderbird/91.5.0
Subject: Re: bug#54124: fmt inserts garbage in certain cases?
Content-Language: en-US
To: =?UTF-8?Q?P=c3=a1draig_Brady?= <P@HIDDEN>, JD <john1doe@HIDDEN>,
54124 <at> debbugs.gnu.org
References: <CI3DGCYKZW8W.C8AILPSC6NEH@HIDDEN>
<74f1591a-b7b6-525a-0a15-85d2bf017769@HIDDEN>
<cb3e1d02-0c4f-11c2-21ca-f148bda09cde@HIDDEN>
From: Paul Eggert <eggert@HIDDEN>
Organization: UCLA Computer Science Department
In-Reply-To: <cb3e1d02-0c4f-11c2-21ca-f148bda09cde@HIDDEN>
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: quoted-printable
X-Spam-Score: -2.3 (--)
X-Debbugs-Envelope-To: 54124
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>,
<mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>,
<mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -3.3 (---)
On 2/23/22 17:29, P=C3=A1draig Brady wrote:
> Given isspace('\n') returns true, then it makes some sense that=20
> isspace("Next Line")
> would return true,
POSIX says that the application must insure that argument to isspace is=20
either EOF or "a character representable as an unsigned char", and=20
arguably since 0x85 not either one of those things the behavior of=20
isspace(0x85) is undefined.
However, the C standard does not have this wording, and since POSIX is=20
supposed to defer to the C standard here, this appears to be a bug in=20
POSIX (as well as a bug in macOS). It's understandable if the Apple C=20
library's developers got confused by the POSIX wording.
bug-coreutils@HIDDEN:bug#54124; Package coreutils.
Full text available.
Received: (at 54124) by debbugs.gnu.org; 24 Feb 2022 01:30:06 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Wed Feb 23 20:30:06 2022
Received: from localhost ([127.0.0.1]:46878 helo=debbugs.gnu.org)
by debbugs.gnu.org with esmtp (Exim 4.84_2)
(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
id 1nN2xK-0004TW-0c
for submit <at> debbugs.gnu.org; Wed, 23 Feb 2022 20:30:06 -0500
Received: from mail-wm1-f51.google.com ([209.85.128.51]:56301)
by debbugs.gnu.org with esmtp (Exim 4.84_2)
(envelope-from <pixelbeat@HIDDEN>) id 1nN2xI-0004SZ-HR
for 54124 <at> debbugs.gnu.org; Wed, 23 Feb 2022 20:30:04 -0500
Received: by mail-wm1-f51.google.com with SMTP id i19so271135wmq.5
for <54124 <at> debbugs.gnu.org>; Wed, 23 Feb 2022 17:30:04 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112;
h=sender:message-id:date:mime-version:user-agent:subject
:content-language:from:to:references:in-reply-to
:content-transfer-encoding;
bh=kH3Riv+n+SWjG6s9J016rYLZ74aYuIqfWQq767OIK2c=;
b=AA0Xpz45BI4t01KzQu1eKxKoAtpFgom7XhZxqbq8x4LCpCJIF14tR5EzYK5J0xz3Lb
xCnjqvTIo1KatmKlLAkIjgwe0Dbia+0Tgju+pEESB9SNbP25p0GLQELsDiYA9vCc4aaZ
fiAc1BNx0Lua8dxmUmo805eC9GiLpExhqnWMkgANjByh+WNSRj+n234FHo4RPZWNUdVi
xjK3yTeEt9/kjpk26pcnJCuEPzKmhr+bHiPVGBghrOGSJ02GHauSKdTl5BJTyqH6xgqM
CgG0mPvqE2ZCNSw25zcVUGbXqqpj+SsPKQ2xDRMK/de0WkdcdXnMcQzeSN4pxdxjXPrN
SLHA==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
d=1e100.net; s=20210112;
h=x-gm-message-state:sender:message-id:date:mime-version:user-agent
:subject:content-language:from:to:references:in-reply-to
:content-transfer-encoding;
bh=kH3Riv+n+SWjG6s9J016rYLZ74aYuIqfWQq767OIK2c=;
b=x5wmfS/Hul8hnbwPKEAfdnPZOkU9MP+99+G/dYQ5aviMz+OCGTI8S16/wFxQSWsnhY
J01q7nVQuyFa89veCNl+hLNFwNrtSqjaP7Ix7+HD4Gm7fzgiZ2uMJ0LeHbOUYTBKRmIS
ivEjP3Y6lY3V2FPlRXUMSbTMRRTqX4VrnyBDhgsPxTYwLeQ1DOoRnNgtw9VdCXF5/iKF
a5TYrMBapP0Gi/Y/BPFW2ZeVueSxwPB0E7/A1OYMdrlRORdW7pADzO9ayjUa3AiimG5R
2U4fhaE5z5AYArQso58WTALZ0sBRfO/y37g47owKFf/vSrZIQEo4Hr30SMV9VkYHYxtS
xfqA==
X-Gm-Message-State: AOAM530QS2lav2wL4feix9iWzB8H2SpVehMwgdpNAE+O8mKp2qvLSId2
rObn11Bs31slu9t8oas9Q0k=
X-Google-Smtp-Source: ABdhPJypaH5MagNOLARveutqV2/7cb6cu9qa461Zz5ugyDWUXU9UYFvj5cZw734Dn5x6rFvxlGGZaQ==
X-Received: by 2002:a05:600c:2e47:b0:37b:9d9b:4c90 with SMTP id
q7-20020a05600c2e4700b0037b9d9b4c90mr9483527wmf.69.1645666198541;
Wed, 23 Feb 2022 17:29:58 -0800 (PST)
Received: from [192.168.1.9]
(95-44-90-175-dynamic.agg2.lod.rsl-rtd.eircom.net. [95.44.90.175])
by smtp.googlemail.com with ESMTPSA id
r2-20020a05600c35c200b00352cdcdd7b2sm10174999wmq.0.2022.02.23.17.29.56
(version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128);
Wed, 23 Feb 2022 17:29:57 -0800 (PST)
Message-ID: <cb3e1d02-0c4f-11c2-21ca-f148bda09cde@HIDDEN>
Date: Thu, 24 Feb 2022 01:29:56 +0000
MIME-Version: 1.0
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:97.0) Gecko/20100101
Thunderbird/97.0
Subject: Re: bug#54124: fmt inserts garbage in certain cases?
Content-Language: en-US
From: =?UTF-8?Q?P=c3=a1draig_Brady?= <P@HIDDEN>
To: JD <john1doe@HIDDEN>, 54124 <at> debbugs.gnu.org
References: <CI3DGCYKZW8W.C8AILPSC6NEH@HIDDEN>
<74f1591a-b7b6-525a-0a15-85d2bf017769@HIDDEN>
In-Reply-To: <74f1591a-b7b6-525a-0a15-85d2bf017769@HIDDEN>
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
X-Spam-Score: 0.5 (/)
X-Debbugs-Envelope-To: 54124
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>,
<mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>,
<mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -0.5 (/)
On 23/02/2022 17:55, Pádraig Brady wrote:
> I think isspace(x85) returning true on macOS is a bug,
Bug is a bit of a strong word here.
A digression into why 0x85 is being treated specially here.
Note Cyrillic kha "х" is encoded in UTF-8 as:
$ printf '\u0445' | od -tx1
0000000 d1 85
What I think is happening is \u0085 represents "Next Line" in unicode.
This is present in unicode to support mapping to/from the corresponding char in EBCDIC,
which had a distinct char for this in addition to CR and LF.
Given isspace('\n') returns true, then it makes some sense that isspace("Next Line")
would return true, and I guess through implementation details
isspace(int) is operating on utf32 on macOS in UTF-8 locales
and this returning true for this value.
BTW 0xA0 is the only other value that isspace() returns true for
(other than the standard c_isspace() values of course).
This is non breaking space, so it's best we don't split on it anyway.
I.e. this is another benefit to the change.
I still think using c_isspace() to avoid this issue is best,
and intend to push the change tomorrow.
cheers,
Pádraig
bug-coreutils@HIDDEN:bug#54124; Package coreutils.
Full text available.
Received: (at 54124) by debbugs.gnu.org; 23 Feb 2022 17:56:11 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Wed Feb 23 12:56:11 2022
Received: from localhost ([127.0.0.1]:46426 helo=debbugs.gnu.org)
by debbugs.gnu.org with esmtp (Exim 4.84_2)
(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
id 1nMvrs-0005Tr-D4
for submit <at> debbugs.gnu.org; Wed, 23 Feb 2022 12:56:11 -0500
Received: from mail-wm1-f46.google.com ([209.85.128.46]:44993)
by debbugs.gnu.org with esmtp (Exim 4.84_2)
(envelope-from <pixelbeat@HIDDEN>) id 1nMvrq-0005Tc-IR
for 54124 <at> debbugs.gnu.org; Wed, 23 Feb 2022 12:55:59 -0500
Received: by mail-wm1-f46.google.com with SMTP id
d14-20020a05600c34ce00b0037bf4d14dc7so4855586wmq.3
for <54124 <at> debbugs.gnu.org>; Wed, 23 Feb 2022 09:55:58 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112;
h=sender:message-id:date:mime-version:user-agent:subject
:content-language:to:references:from:in-reply-to;
bh=CimD2wQAhP8R17YOU3l4Y3DF62jwwKS3XajQPM6cN8g=;
b=o7XOK52PjybVLQfHAA4vPlSCXsFCTt7S38nhIaBfTein+lRd72uDCWopckO7uJ35JD
C2XclJKTI2EBfZbumx7WdTfUQdK1Dn/muBtoXm6Z9DwNm0QwcuXBzgG0QfIP3bi7rTzQ
vaJ4XIGPntpBdRNt0gGhYBi4m5VCXbK4B31TTtb3+ADztalt01l9tgijcmlgXuDlfD97
HZXJVFseR0LKYAlzJ5afJJYXtvkxn4RDXM5/nVJJcniJD00D8aNSYeJjbzkuWeHear2G
kkxUd+Opr4+DnvPctibj4yWdkQkO9dH9YKHynUE8REyqTy8USfWoKEx+m7isFn5tl3Bu
ORyw==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
d=1e100.net; s=20210112;
h=x-gm-message-state:sender:message-id:date:mime-version:user-agent
:subject:content-language:to:references:from:in-reply-to;
bh=CimD2wQAhP8R17YOU3l4Y3DF62jwwKS3XajQPM6cN8g=;
b=wFLnUmBA1cnNKUteGmSa9wox5uLrd+iJbrKgvIvpX6yjDdmgEPytwAfAU7nGzAzfXb
vY5NDKAJBL9qkTkHo+/f7tmqAEceOGulU/VKShj+oWNFqscMvO5rFkJyoy10/gP8e5rb
KftpT7coopLSKVIlj9QVrx6JgAF+DbD4ii26vkjnOm0E6H4lFkVL7LFSpN7hlHgYGCvJ
B9cyuq7dDtp2wO3FtwFN57BPXJO7SJG+mB++y6YXp2uMy2wpGCIV3oDPmxt4o0aHj1ud
wMg5vjcnFy83aVwf47Q79EnOLprJObnhydQi2tKIYUYoRkwe2sThIH/oB6f6cAWm7dnN
y79A==
X-Gm-Message-State: AOAM531O2TiAjmuwo3KI7NmjP/u0g+2PNxE1aMZDUVZ9WdMeoOKl6etA
7YAzMCWh+JiC3VW5GoaTSmc=
X-Google-Smtp-Source: ABdhPJxI8tLJOszLoFD4sLdX52UsfqJxaNOgMpBpeTY8wRnoguXJnUYgSwK9rMiegnmYSRaEeOoE1Q==
X-Received: by 2002:a05:600c:4ecb:b0:37c:9125:ac03 with SMTP id
g11-20020a05600c4ecb00b0037c9125ac03mr671101wmq.98.1645638952450;
Wed, 23 Feb 2022 09:55:52 -0800 (PST)
Received: from [192.168.1.9]
(95-44-90-175-dynamic.agg2.lod.rsl-rtd.eircom.net. [95.44.90.175])
by smtp.googlemail.com with ESMTPSA id
d14-20020a05600c34ce00b0037bdd89e3a5sm8670608wmq.3.2022.02.23.09.55.50
(version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128);
Wed, 23 Feb 2022 09:55:51 -0800 (PST)
Content-Type: multipart/mixed; boundary="------------4iW4YB0vtD8l2k0PcckECv41"
Message-ID: <74f1591a-b7b6-525a-0a15-85d2bf017769@HIDDEN>
Date: Wed, 23 Feb 2022 17:55:49 +0000
MIME-Version: 1.0
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:97.0) Gecko/20100101
Thunderbird/97.0
Subject: Re: bug#54124: fmt inserts garbage in certain cases?
Content-Language: en-US
To: JD <john1doe@HIDDEN>, 54124 <at> debbugs.gnu.org
References: <CI3DGCYKZW8W.C8AILPSC6NEH@HIDDEN>
From: =?UTF-8?Q?P=c3=a1draig_Brady?= <P@HIDDEN>
In-Reply-To: <CI3DGCYKZW8W.C8AILPSC6NEH@HIDDEN>
X-Spam-Score: 0.5 (/)
X-Debbugs-Envelope-To: 54124
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>,
<mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>,
<mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -0.5 (/)
This is a multi-part message in MIME format.
--------------4iW4YB0vtD8l2k0PcckECv41
Content-Type: text/plain; charset=UTF-8; format=flowed
Content-Transfer-Encoding: 8bit
On 23/02/2022 10:58, JD wrote:
> Hi!
>
> I have fmt from coreutils 8.32.1 installed via MacPorts.
>
> If I run the following command: `echo х х х х х х х х х х х х х х х х х х х х х х х х х х | gfmt -sw 10` (which is just echoing 26 Cyrillic 'х' ('kha') letters), I get the following results:
>
> https://i.imgur.com/yRx7uuz.png (iTerm2)
> https://i.imgur.com/7oQ0UPz.png (iTerm2 if passed via `more`)
> https://i.imgur.com/UlLrEMy.png (Alacritty)
>
> And if I delete just two 'х' letters, like this: `echo х х х х х х х х х х х х х х х х х х х х х х х х | gfmt -sw 10`, evertyhitng shows just fine: https://i.imgur.com/DwuWxyx.png
>
> Would be grateful for any advice :)
The issue here is that (on macOS 10.15.7 at least),
isspace(0x85) returns true for UTF-8 locales
(but not for "C" or "iso8859-1" locales).
BTW iscntrl() returns true for 0x85 on all non C locales
on both Linux and macOS.
Now gnulib says wrt isspace() that:
"This function's behaviour depends on the locale, but does not support
the multibyte characters that occur in strings in locales with
@code{MB_CUR_MAX > 1} (this includes all the common UTF-8 locales)."
I think isspace(x85) returning true on macOS is a bug,
but we should probably avoid isspace() in fmt altogether
given it's inconsistency with multibyte locales.
The attached uses c_isspace() instead.
cheers,
Pádraig
--------------4iW4YB0vtD8l2k0PcckECv41
Content-Type: text/x-patch; charset=UTF-8; name="fmt-utf8-macOS.patch"
Content-Disposition: attachment; filename="fmt-utf8-macOS.patch"
Content-Transfer-Encoding: base64
RnJvbSAxNjZiNjc4M2JjMWE2ZTBjZTIwNjExNGMxZDU5M2MyNTI4ZTNjZmExIE1vbiBTZXAg
MTcgMDA6MDA6MDAgMjAwMQpGcm9tOiA9P1VURi04P3E/UD1DMz1BMWRyYWlnPTIwQnJhZHk/
PSA8UEBkcmFpZ0JyYWR5LmNvbT4KRGF0ZTogV2VkLCAyMyBGZWIgMjAyMiAxNzo1MDo0NiAr
MDAwMApTdWJqZWN0OiBbUEFUQ0hdIGZtdDogZml4IGludmFsaWQgbXVsdGktYnl0ZSBzcGxp
dHRpbmcgb24gbWFjT1MKCk9uIG1hY09TLCBpc3NwYWNlKDB4ODUpIHJldHVybnMgdHJ1ZSwK
d2hpY2ggcmVzdWx0cyBpbiBzcGxpdHRpbmcgd2l0aGluIG11bHRpLWJ5dGUgY2hhcmFjdGVy
cy4KCiogc3JjL2ZtdC5jIChnZXRfbGluZSk6IHMvaXNzcGFjZS9jX2lzc3BhY2UvLgoqIHRl
c3RzL2ZtdC9ub24tc3BhY2Uuc2g6IEFkZCBhIG5ldyB0ZXN0LgoqIHRlc3RzL2xvY2FsLm1r
OiBSZWZlcmVuY2UgbmV3IHRlc3QuCiogTkVXUzogTWVudGlvbiB0aGUgZml4LgpBZGRyZXNz
ZXMgaHR0cHM6Ly9idWdzLmdudS5vcmcvNTQxMjQKLS0tCiBORVdTICAgICAgICAgICAgICAg
ICAgIHwgIDQgKysrKwogc3JjL2ZtdC5jICAgICAgICAgICAgICB8ICAzICsrLQogdGVzdHMv
Zm10L25vbi1zcGFjZS5zaCB8IDQ5ICsrKysrKysrKysrKysrKysrKysrKysrKysrKysrKysr
KysrKysrKysrKwogdGVzdHMvbG9jYWwubWsgICAgICAgICB8ICAzICsrLQogNCBmaWxlcyBj
aGFuZ2VkLCA1NyBpbnNlcnRpb25zKCspLCAyIGRlbGV0aW9ucygtKQogY3JlYXRlIG1vZGUg
MTAwNzU1IHRlc3RzL2ZtdC9ub24tc3BhY2Uuc2gKCmRpZmYgLS1naXQgYS9ORVdTIGIvTkVX
UwppbmRleCBlZjY1YjRhYjguLjM1ZDlhNTBkZCAxMDA2NDQKLS0tIGEvTkVXUworKysgYi9O
RVdTCkBAIC0yMSw2ICsyMSwxMCBAQCBHTlUgY29yZXV0aWxzIE5FV1MgICAgICAgICAgICAg
ICAgICAgICAgICAgICAgICAgICAgICAtKi0gb3V0bGluZSAtKi0KICAgYW5kIEIgaXMgaW4g
c29tZSBvdGhlciBmaWxlIHN5c3RlbS4KICAgW2J1ZyBpbnRyb2R1Y2VkIGluIGNvcmV1dGls
cy05LjBdCiAKKyAgT24gbWFjT1MsIGZtdCBubyBsb25nZXIgY29ycnVwdHMgbXVsdGktYnl0
ZSBjaGFyYWN0ZXJzCisgIGJ5IG1pc2RldGVjdGluZyB0aGVpciBjb21wb25lbnQgYnl0ZXMg
YXMgc3BhY2VzLgorICBbVGhpcyBidWcgd2FzIHByZXNlbnQgaW4gInRoZSBiZWdpbm5pbmci
Ll0KKwogICAnaWQgeHl6JyBub3cgdXNlcyB0aGUgbmFtZSAneHl6JyB0byBkZXRlcm1pbmUg
Z3JvdXBzLCBpbnN0ZWFkIG9mIHh5eidzIHVpZC4KICAgW2J1ZyBpbnRyb2R1Y2VkIGluIGNv
cmV1dGlscy04LjIyXQogCmRpZmYgLS1naXQgYS9zcmMvZm10LmMgYi9zcmMvZm10LmMKaW5k
ZXggMWViNzAxOWIwLi4wNWJhZmFiZDYgMTAwNjQ0Ci0tLSBhL3NyYy9mbXQuYworKysgYi9z
cmMvZm10LmMKQEAgLTI2LDYgKzI2LDcgQEAKICAgIGl0IHRvIGJlIGEgdHlwZSBnZXQgc3lu
dGF4IGVycm9ycyBmb3IgdGhlIHZhcmlhYmxlIGRlY2xhcmF0aW9uIGJlbG93LiAgKi8KICNk
ZWZpbmUgd29yZCB1bnVzZWRfd29yZF90eXBlCiAKKyNpbmNsdWRlICJjLWN0eXBlLmgiCiAj
aW5jbHVkZSAic3lzdGVtLmgiCiAjaW5jbHVkZSAiZXJyb3IuaCIKICNpbmNsdWRlICJkaWUu
aCIKQEAgLTcwMiw3ICs3MDMsNyBAQCBnZXRfbGluZSAoRklMRSAqZiwgaW50IGMpCiAgICAg
ICAgICAgKndwdHIrKyA9IGM7CiAgICAgICAgICAgYyA9IGdldGMgKGYpOwogICAgICAgICB9
Ci0gICAgICB3aGlsZSAoYyAhPSBFT0YgJiYgIWlzc3BhY2UgKGMpKTsKKyAgICAgIHdoaWxl
IChjICE9IEVPRiAmJiAhY19pc3NwYWNlIChjKSk7CiAgICAgICBpbl9jb2x1bW4gKz0gd29y
ZF9saW1pdC0+bGVuZ3RoID0gd3B0ciAtIHdvcmRfbGltaXQtPnRleHQ7CiAgICAgICBjaGVj
a19wdW5jdHVhdGlvbiAod29yZF9saW1pdCk7CiAKZGlmZiAtLWdpdCBhL3Rlc3RzL2ZtdC9u
b24tc3BhY2Uuc2ggYi90ZXN0cy9mbXQvbm9uLXNwYWNlLnNoCm5ldyBmaWxlIG1vZGUgMTAw
NzU1CmluZGV4IDAwMDAwMDAwMC4uYjU5ODM4OTgzCi0tLSAvZGV2L251bGwKKysrIGIvdGVz
dHMvZm10L25vbi1zcGFjZS5zaApAQCAtMCwwICsxLDQ5IEBACisjIS9iaW4vc2gKKyMgVGVz
dCBmbXQgc3BhY2UgaGFuZGxpbmcKKworIyBDb3B5cmlnaHQgKEMpIDIwMjIgRnJlZSBTb2Z0
d2FyZSBGb3VuZGF0aW9uLCBJbmMuCisKKyMgVGhpcyBwcm9ncmFtIGlzIGZyZWUgc29mdHdh
cmU6IHlvdSBjYW4gcmVkaXN0cmlidXRlIGl0IGFuZC9vciBtb2RpZnkKKyMgaXQgdW5kZXIg
dGhlIHRlcm1zIG9mIHRoZSBHTlUgR2VuZXJhbCBQdWJsaWMgTGljZW5zZSBhcyBwdWJsaXNo
ZWQgYnkKKyMgdGhlIEZyZWUgU29mdHdhcmUgRm91bmRhdGlvbiwgZWl0aGVyIHZlcnNpb24g
MyBvZiB0aGUgTGljZW5zZSwgb3IKKyMgKGF0IHlvdXIgb3B0aW9uKSBhbnkgbGF0ZXIgdmVy
c2lvbi4KKworIyBUaGlzIHByb2dyYW0gaXMgZGlzdHJpYnV0ZWQgaW4gdGhlIGhvcGUgdGhh
dCBpdCB3aWxsIGJlIHVzZWZ1bCwKKyMgYnV0IFdJVEhPVVQgQU5ZIFdBUlJBTlRZOyB3aXRo
b3V0IGV2ZW4gdGhlIGltcGxpZWQgd2FycmFudHkgb2YKKyMgTUVSQ0hBTlRBQklMSVRZIG9y
IEZJVE5FU1MgRk9SIEEgUEFSVElDVUxBUiBQVVJQT1NFLiAgU2VlIHRoZQorIyBHTlUgR2Vu
ZXJhbCBQdWJsaWMgTGljZW5zZSBmb3IgbW9yZSBkZXRhaWxzLgorCisjIFlvdSBzaG91bGQg
aGF2ZSByZWNlaXZlZCBhIGNvcHkgb2YgdGhlIEdOVSBHZW5lcmFsIFB1YmxpYyBMaWNlbnNl
CisjIGFsb25nIHdpdGggdGhpcyBwcm9ncmFtLiAgSWYgbm90LCBzZWUgPGh0dHBzOi8vd3d3
LmdudS5vcmcvbGljZW5zZXMvPi4KKworLiAiJHtzcmNkaXI9Ln0vdGVzdHMvaW5pdC5zaCI7
IHBhdGhfcHJlcGVuZF8gLi9zcmMKK3ByaW50X3Zlcl8gZm10IHByaW50ZgorCisjIEJlZm9y
ZSBjb3JldXRpbHMgOS4xIG1hY09TIHRyZWF0ZWQgYnl0ZXMgbGlrZSAweDg1CisjIGFzIHNw
YWNlIGNoYXJhY3RlcnMgaW4gbXVsdGktYnl0ZSBsb2NhbGVzIChpbmNsdWRpbmcgVVRGLTgp
CisKK2NoZWNrX25vbl9zcGFjZSgpIHsKKyAgY2hhcj0iJDEiCisgIHRlc3QgIiQoZW52IHBy
aW50ZiAiPSRjaGFyPSIgfCBmbXQgLXMgLXcxIHwgd2MgLWwpIiA9IDEgfHwgZmFpbD0xCit9
CisKK2V4cG9ydCBMQ19BTEw9ZW5fVVMuaXNvODg1OS0xICAjIG9ubHkgbG93ZXJjYXNlIGZv
cm0gd29ya3Mgb24gbWFjT1MgMTAuMTUuNworaWYgdGVzdCAiJChsb2NhbGUgY2hhcm1hcCAy
Pi9kZXYvbnVsbCB8IHNlZCAncy9pc28vSVNPLS8nKSIgPSBJU08tODg1OS0xOyB0aGVuCisg
IGNoZWNrX25vbl9zcGFjZSAnXHhBMCcKK2ZpCisKK2V4cG9ydCBMQ19BTEw9ZW5fVVMuVVRG
LTgKK2lmIHRlc3QgIiQobG9jYWxlIGNoYXJtYXAgMj4vZGV2L251bGwpIiA9IFVURi04OyB0
aGVuCisgIGNoZWNrX25vbl9zcGFjZSAnXHUwMEEwJyAgIyBObyBicmVhayBzcGFjZQorICBj
aGVja19ub25fc3BhY2UgJ1x1MjAwNycgICMgVE9ETzogc2hvdWxkIHByb2JhYmx5IHNwbGl0
IG9uIGZpZ3VyZSBzcGFjZQorICBjaGVja19ub25fc3BhY2UgJ1x1MjAyRicgICMgTmFycm93
IG5vIGJyZWFrIHNwYWNlCisgIGNoZWNrX25vbl9zcGFjZSAnXHUyMDYwJyAgIyB6ZXJvLXdp
ZHRoIG5vIGJyZWFrIHNwYWNlCisgIGNoZWNrX25vbl9zcGFjZSAnXHUwNDQ1JyAgIyBDeXJp
bGxpYyBraGEgaGFzIDB4ODUsIHdoaWNoIG1hY09TIGlzc3BhY2UoKT10cnVlCitmaQorCitl
eHBvcnQgTENfQUxMPXJ1X1JVLktPSTgtUgoraWYgdGVzdCAiJChsb2NhbGUgY2hhcm1hcCAy
Pi9kZXYvbnVsbCkiID0gS09JOC1SOyB0aGVuCisgIGNoZWNrX25vbl9zcGFjZSAnXHg5QScK
K2ZpCisKK0V4aXQgJGZhaWwKZGlmZiAtLWdpdCBhL3Rlc3RzL2xvY2FsLm1rIGIvdGVzdHMv
bG9jYWwubWsKaW5kZXggZjEzNzZmYjcxLi5mOTdkZGNiOTggMTAwNjQ0Ci0tLSBhL3Rlc3Rz
L2xvY2FsLm1rCisrKyBiL3Rlc3RzL2xvY2FsLm1rCkBAIC0yMzcsOCArMjM3LDkgQEAgYWxs
X3Rlc3RzID0JCQkJCVwKICAgdGVzdHMvY2hncnAvcG9zaXgtSC5zaAkJCVwKICAgdGVzdHMv
Y2hncnAvcmVjdXJzZS5zaAkJCVwKICAgdGVzdHMvZm10L2Jhc2UucGwJCQkJXAotICB0ZXN0
cy9mbXQvbG9uZy1saW5lLnNoCQkJXAogICB0ZXN0cy9mbXQvZ29hbC1vcHRpb24uc2gJCQlc
CisgIHRlc3RzL2ZtdC9sb25nLWxpbmUuc2gJCQlcCisgIHRlc3RzL2ZtdC9ub24tc3BhY2Uu
c2gJCQlcCiAgIHRlc3RzL21pc2MvZWNoby5zaAkJCQlcCiAgIHRlc3RzL21pc2MvZW52LnNo
CQkJCVwKICAgdGVzdHMvbWlzYy9lbnYtc2lnbmFsLWhhbmRsZXIuc2gJCVwKLS0gCjIuMjYu
MgoK
--------------4iW4YB0vtD8l2k0PcckECv41--
bug-coreutils@HIDDEN:bug#54124; Package coreutils.
Full text available.
Received: (at submit) by debbugs.gnu.org; 23 Feb 2022 11:27:50 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Wed Feb 23 06:27:50 2022
Received: from localhost ([127.0.0.1]:44027 helo=debbugs.gnu.org)
by debbugs.gnu.org with esmtp (Exim 4.84_2)
(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
id 1nMpoD-0000RO-Dt
for submit <at> debbugs.gnu.org; Wed, 23 Feb 2022 06:27:50 -0500
Received: from lists.gnu.org ([209.51.188.17]:33160)
by debbugs.gnu.org with esmtp (Exim 4.84_2)
(envelope-from <john1doe@HIDDEN>) id 1nMpMO-000815-IL
for submit <at> debbugs.gnu.org; Wed, 23 Feb 2022 05:59:05 -0500
Received: from eggs.gnu.org ([209.51.188.92]:50450)
by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
(Exim 4.90_1) (envelope-from <john1doe@HIDDEN>) id 1nMpMO-0001hq-7g
for bug-coreutils@HIDDEN; Wed, 23 Feb 2022 05:59:04 -0500
Received: from forward103o.mail.yandex.net ([37.140.190.177]:41040)
by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
(Exim 4.90_1) (envelope-from <john1doe@HIDDEN>) id 1nMpML-0004eF-7u
for bug-coreutils@HIDDEN; Wed, 23 Feb 2022 05:59:03 -0500
Received: from forward103q.mail.yandex.net (forward103q.mail.yandex.net
[IPv6:2a02:6b8:c0e:50:0:640:b21c:d009])
by forward103o.mail.yandex.net (Yandex) with ESMTP id CEFE710A89FD
for <bug-coreutils@HIDDEN>; Wed, 23 Feb 2022 13:58:55 +0300 (MSK)
Received: from vla1-c131b56fb0b2.qloud-c.yandex.net
(vla1-c131b56fb0b2.qloud-c.yandex.net
[IPv6:2a02:6b8:c0d:2994:0:640:c131:b56f])
by forward103q.mail.yandex.net (Yandex) with ESMTP id CB13756A000F
for <bug-coreutils@HIDDEN>; Wed, 23 Feb 2022 13:58:55 +0300 (MSK)
Received: from vla5-3832771863b8.qloud-c.yandex.net
(vla5-3832771863b8.qloud-c.yandex.net [2a02:6b8:c18:3417:0:640:3832:7718])
by vla1-c131b56fb0b2.qloud-c.yandex.net (mxback/Yandex) with ESMTP id
wozkUPzEsl-wtceCQDF; Wed, 23 Feb 2022 13:58:55 +0300
X-Yandex-Fwd: 2
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ya.ru; s=mail;
t=1645613935; bh=Kwi96ywYtrShYOwFVPlLBSLZHYlBj52nLLuMspMiqZ0=;
h=Subject:From:Date:Cc:Message-Id:To;
b=DHYIwqKN1hjBmHj7IB9X3vCBK0NVMyPjDNicTDx3aoejH+BbrShjoxNrB2gaN1rhc
63VRDaGDOWypF9inlPVXxMLVIJ/ulh0YIoORaMCQo8v1veoc7FP9mnHh4A1bFrg2Uz
txUxaZ2UPU4aVKKbRaKpGP3v4CuP5GUsKtn7JRqU=
Authentication-Results: vla1-c131b56fb0b2.qloud-c.yandex.net;
dkim=pass header.i=@ya.ru
Received: by vla5-3832771863b8.qloud-c.yandex.net (smtp/Yandex) with ESMTPSA
id iBKPyX2v4B-wtJKu1PA; Wed, 23 Feb 2022 13:58:55 +0300
(using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits))
(Client certificate not present)
Mime-Version: 1.0
Content-Transfer-Encoding: quoted-printable
Content-Type: text/plain; charset=UTF-8
Date: Wed, 23 Feb 2022 12:58:54 +0200
Message-Id: <CI3DGCYKZW8W.C8AILPSC6NEH@HIDDEN>
To: <bug-coreutils@HIDDEN>
From: "JD" <john1doe@HIDDEN>
Subject: fmt inserts garbage in certain cases?
X-Gpg-Key-Server: http://pgp.mit.edu
X-Gpg-Key-Fingerprint: 3B87 29EA 2136 7F0B 9AB6 8345 40A8 221E 52CB FA20
X-Gpg-Key: http://infornography.kpoxa.org/personal/s/key/GPG.public.key
Received-SPF: pass client-ip=37.140.190.177; envelope-from=john1doe@HIDDEN;
helo=forward103o.mail.yandex.net
X-Spam_score_int: -27
X-Spam_score: -2.8
X-Spam_bar: --
X-Spam_report: (-2.8 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1,
DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001,
RCVD_IN_DNSWL_LOW=-0.7, RCVD_IN_MSPIKE_H5=0.001, RCVD_IN_MSPIKE_WL=0.001,
SPF_HELO_NONE=0.001, SPF_PASS=-0.001,
T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no
X-Spam_action: no action
X-Spam-Score: 0.2 (/)
X-Debbugs-Envelope-To: submit
X-Mailman-Approved-At: Wed, 23 Feb 2022 06:27:48 -0500
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>,
<mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>,
<mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -2.3 (--)
Hi!
I have fmt from coreutils 8.32.1 installed via MacPorts.
If I run the following command: `echo =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =
=D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=
=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 |=
gfmt -sw 10` (which is just echoing 26 Cyrillic '=D1=85' ('kha') letters),=
I get the following results:
https://i.imgur.com/yRx7uuz.png (iTerm2)=20
https://i.imgur.com/7oQ0UPz.png (iTerm2 if passed via `more`)=20
https://i.imgur.com/UlLrEMy.png (Alacritty)
And if I delete just two '=D1=85' letters, like this: `echo =D1=85 =D1=85 =
=D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=
=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =D1=85 =
=D1=85 | gfmt -sw 10`, evertyhitng shows just fine: https://i.imgur.com/Dwu=
Wxyx.png
Would be grateful for any advice :)
--=20
JD
"JD" <john1doe@HIDDEN>:bug-coreutils@HIDDEN.
Full text available.bug-coreutils@HIDDEN:bug#54124; Package coreutils.
Full text available.
GNU bug tracking system
Copyright (C) 1999 Darren O. Benham,
1997 nCipher Corporation Ltd,
1994-97 Ian Jackson.