GNU bug report logs - #32308
doc: join: mention whitespace is significant

Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.

Package: coreutils; Severity: wishlist; Reported by: Dima Kogan <dima@HIDDEN>; dated Mon, 30 Jul 2018 03:12:01 UTC; Maintainer for coreutils is bug-coreutils@HIDDEN.
Changed bug title to 'doc: join: mention whitespace is significant' from 'join doesn't ignore trailing whitespace' Request was from Assaf Gordon <assafgordon@HIDDEN> to control <at> debbugs.gnu.org. Full text available.
Severity set to 'wishlist' from 'normal' Request was from Assaf Gordon <assafgordon@HIDDEN> to control <at> debbugs.gnu.org. Full text available.

Message received at 32308 <at> debbugs.gnu.org:


Received: (at 32308) by debbugs.gnu.org; 30 Oct 2018 03:49:05 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Mon Oct 29 23:49:05 2018
Received: from localhost ([127.0.0.1]:52854 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1gHL1d-0001Vz-K6
	for submit <at> debbugs.gnu.org; Mon, 29 Oct 2018 23:49:05 -0400
Received: from mail-pg1-f175.google.com ([209.85.215.175]:41701)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <assafgordon@HIDDEN>)
 id 1gHL1b-0001VJ-Jz; Mon, 29 Oct 2018 23:49:03 -0400
Received: by mail-pg1-f175.google.com with SMTP id 23-v6so4925274pgc.8;
 Mon, 29 Oct 2018 20:49:03 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025;
 h=subject:to:references:from:message-id:date:user-agent:mime-version
 :in-reply-to:content-language:content-transfer-encoding;
 bh=6KqGGdzPTZSzAPHISaVUVtg2ex97YG5nY1RUSNaOVp8=;
 b=nELbAQxmBAIH2ZVKQeMYIFW/wTHNwQViA5hZqWVN+ec6JCF/zT195KxZF05WMt+CYV
 u1GYY5lzua/yx2NJFQWckojvzoHD/51ymanp8SwOkQ0rSNnOkSRTOpUXvw+IxyrPSyiq
 0aESqq28oBFwxYondjJrSkPswwbQwBl/J3n+aKN6unTRRAz5aMZrx3UubejjkH1X+Bj9
 d6O74IVcCF8x6p9pi5ROEeh9v1p9Fgv44Tihzwq+IXNvLB2yWypmldfp2JYLfLhV2l92
 BkvUDbEfCKXB09CXT6hGVG2olA1uyi/5ZWg2iL9cjNF/MhkTrJnjnVkDUhyOd+4hU4ND
 n+3g==
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
 d=1e100.net; s=20161025;
 h=x-gm-message-state:subject:to:references:from:message-id:date
 :user-agent:mime-version:in-reply-to:content-language
 :content-transfer-encoding;
 bh=6KqGGdzPTZSzAPHISaVUVtg2ex97YG5nY1RUSNaOVp8=;
 b=ngtHkSsWZEdEm7mMVENVk2OrhMQyhBOLHjIOe+JS+KOZzirxMM7t182k1vH1Sa2KwW
 WUMs2VHesWIPCfshceVUz6hnD/VWJxzsq+3Wmuh+FEeETklZfE/hjpGRbIW/xr9v1486
 COxxhv8x64HjhGlJ3PPOlXUMFw7F3ZVCwGQJpr7SZ0V0cNtnPYbD/JGKKq+CPXOgweIk
 PPzIa1aCZID2ks8E6g5P6aIQP337iDzhveVB462SwFGAHWjVq+JLDdTiCDWLDzCHsANK
 mdXP1dBGCCn+zpmDpSMyCASSaikSQWnNhqTCKUE4EIDiQ20CfzRISDRAguLt//meLiZ5
 QgVw==
X-Gm-Message-State: AGRZ1gId9B8eqoa40WRPLqy4EhZsNu4/yhC6MOnafpYf4FHbL/nPRMYg
 Y5AIUifCqCAvC6ltWYQ73ZGZIipLj98=
X-Google-Smtp-Source: AJdET5cUrFGAtNubSI+swuUpLhvvfeeSP8pbjm7mT9y2ULIK9AnzkfIjzfjL2U9qDQt+tWE2ML5/mw==
X-Received: by 2002:a63:cd45:: with SMTP id
 a5-v6mr16766283pgj.43.1540871337148; 
 Mon, 29 Oct 2018 20:48:57 -0700 (PDT)
Received: from tomato.housegordon.com (moose.housegordon.com. [184.68.105.38])
 by smtp.googlemail.com with ESMTPSA id
 a4-v6sm27093665pga.2.2018.10.29.20.48.55
 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128);
 Mon, 29 Oct 2018 20:48:55 -0700 (PDT)
Subject: Re: bug#32308: join doesn't ignore trailing whitespace
To: Dima Kogan <dima@HIDDEN>, 32308 <at> debbugs.gnu.org
References: <87muu97707.fsf@HIDDEN>
From: Assaf Gordon <assafgordon@HIDDEN>
Message-ID: <8ca61b2d-7a00-0124-83fa-544e269ec3d6@HIDDEN>
Date: Mon, 29 Oct 2018 21:48:54 -0600
User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101
 Thunderbird/60.2.1
MIME-Version: 1.0
In-Reply-To: <87muu97707.fsf@HIDDEN>
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Language: en-US
Content-Transfer-Encoding: 7bit
X-Spam-Score: -0.0 (/)
X-Debbugs-Envelope-To: 32308
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.0 (-)

severity 32308 wishlist
retitle 32308 doc: join: mention whitespace is significant
stop

Hello,

On 2018-07-29 9:11 p.m., Dima Kogan wrote:
> It appears that join doesn't ignore trailing whitespace, resulting
> in surprising results. 
[...]
  > I guess the manpage isn't explicit about what happens in this case, but
> as a user, I find this to be extremely surprising, and this thus feels
> like a bug.

I've marked this as a wishlist item - to be improved in the future.

-assaf




Information forwarded to bug-coreutils@HIDDEN:
bug#32308; Package coreutils. Full text available.

Message received at submit <at> debbugs.gnu.org:


Received: (at submit) by debbugs.gnu.org; 30 Jul 2018 03:11:26 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Sun Jul 29 23:11:26 2018
Received: from localhost ([127.0.0.1]:34258 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1fjyaj-0001YS-TL
	for submit <at> debbugs.gnu.org; Sun, 29 Jul 2018 23:11:26 -0400
Received: from eggs.gnu.org ([208.118.235.92]:60069)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <dima@HIDDEN>) id 1fjyag-0001YA-Hy
 for submit <at> debbugs.gnu.org; Sun, 29 Jul 2018 23:11:24 -0400
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
 (envelope-from <dima@HIDDEN>) id 1fjyaa-0003NF-Db
 for submit <at> debbugs.gnu.org; Sun, 29 Jul 2018 23:11:17 -0400
X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org
X-Spam-Level: 
X-Spam-Status: No, score=0.0 required=5.0 tests=BAYES_40,T_DKIM_INVALID
 autolearn=disabled version=3.3.2
Received: from lists.gnu.org ([2001:4830:134:3::11]:43441)
 by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:32)
 (Exim 4.71) (envelope-from <dima@HIDDEN>)
 id 1fjyaa-0003Mn-8F
 for submit <at> debbugs.gnu.org; Sun, 29 Jul 2018 23:11:16 -0400
Received: from eggs.gnu.org ([2001:4830:134:3::10]:51038)
 by lists.gnu.org with esmtp (Exim 4.71)
 (envelope-from <dima@HIDDEN>) id 1fjyaZ-0001wW-54
 for bug-coreutils@HIDDEN; Sun, 29 Jul 2018 23:11:16 -0400
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
 (envelope-from <dima@HIDDEN>) id 1fjyaU-0003Bj-6r
 for bug-coreutils@HIDDEN; Sun, 29 Jul 2018 23:11:15 -0400
Received: from out1-smtp.messagingengine.com ([66.111.4.25]:60675)
 by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32)
 (Exim 4.71) (envelope-from <dima@HIDDEN>)
 id 1fjyaT-00038f-VL
 for bug-coreutils@HIDDEN; Sun, 29 Jul 2018 23:11:10 -0400
Received: from compute1.internal (compute1.nyi.internal [10.202.2.41])
 by mailout.nyi.internal (Postfix) with ESMTP id 604EF21957
 for <bug-coreutils@HIDDEN>; Sun, 29 Jul 2018 23:11:08 -0400 (EDT)
Received: from mailfrontend1 ([10.202.2.162])
 by compute1.internal (MEProxy); Sun, 29 Jul 2018 23:11:08 -0400
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=secretsauce.net;
 h=content-type:date:from:message-id:mime-version:subject:to
 :x-me-sender:x-me-sender:x-sasl-enc; s=fm3; bh=qIQfshBwuaDiq2Pqj
 lHEZOIXc8Lw9lqBDy5sYVMaQDk=; b=nNlzn/98qo7Qmyeqn69XIylVraqAfrVvz
 8bEZe3CxgvGuuytLnwNuJmPYZdlF32dUZab+80933wuLPLiSUaO8DeGHaoSdUwBx
 XqGuQhkc2+KCdJHSjucJBTV19u1K2kLCXV5GPgSBs+hZiMyoy5FSl/sAHk3xQ6n9
 P6gCZXl1uGt8rjguF0thcfiHlDpzxX3ivhYwkdsmx0Pdj322rJZis1LqrSkOE5x3
 fNLP/z3Bl+KPKQWsQMATKXTP7LInk3enj4v9YpIx8nEp3G9APi88ym1LoYbbv122
 d8maVdC2HlNHpH7DvN0geXtlYg76ZOd7EPslct0ykuERoCCfigCIg==
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=
 messagingengine.com; h=content-type:date:from:message-id
 :mime-version:subject:to:x-me-sender:x-me-sender:x-sasl-enc; s=
 fm3; bh=qIQfshBwuaDiq2PqjlHEZOIXc8Lw9lqBDy5sYVMaQDk=; b=ZTN76G/T
 bVAS1SXd44gLwKejhi7U3epGLkZBh7ppXZMq5/wFqcKurOqWAg0FmmUOtpICg4hd
 Z1Rg3jdvaGPSX3EKYfTQsd9JN+fCx87D/EwH6YBht1qIPJPG89czTuuA3ui5aLkX
 4iGnid3dtLKNcyY8I1xDzZUrgWN9wOq2I4ZHLfz8Wc2AUfkJtlpqdrm0qHABgDZU
 ekAKXNWPoeCF+pybalt4f3XkpkuF0NfUzK1VFTalmKM5VNp8K+bg5fFks0sxyI38
 wNMzUGPNQIASkCglyTElodbjwCQf+fB8faKP+Wk959p0ladNnmevw7gcG8yzeK+E
 YhSi51lWJaotcg==
X-ME-Proxy: <xmx:y4FeW1HCzr3ALaT8sDAoZi2rJ4B_0eIY8uVOjOeDyiQBRzZRQbfWrQ>
 <xmx:y4FeW4TLIxz8XgwGw1sfk-OIta-EaBP-4N2psWUrAnl8QW52FO8w0A>
 <xmx:y4FeW-u8jCModKycujVxcZG_RUPAJT3pYlPEE1T5JGi6d8I-SNtPsQ>
 <xmx:y4FeW3x94yR_P9p6fIEy1QR5WcaOdE_gYeK55t18V4bh5wftBCJyRQ>
 <xmx:y4FeW46IfimWypwJKfQhcPfdbcQbDB3ikcWao0BNkAjKmt-Boi_8Ww>
 <xmx:zIFeW_94UQ-dq6Iga11ihyGwgutZ89nZ9mMI7OT_D3A727x0Vp0b4w>
X-ME-Sender: <xms:y4FeW8FQo4GhZIhp7sB_NDz5s_3rlcH_5viYn_0tkaSHLjGf6JEbhg>
Received: from scrawny (107-210-213-243.lightspeed.irvnca.sbcglobal.net
 [107.210.213.243])
 by mail.messagingengine.com (Postfix) with ESMTPA id 919EFE405A
 for <bug-coreutils@HIDDEN>; Sun, 29 Jul 2018 23:11:07 -0400 (EDT)
Received: from dima by scrawny with local (Exim 4.90_1)
 (envelope-from <dima@HIDDEN>) id 1fjyaO-0002JT-6m
 for bug-coreutils@HIDDEN; Sun, 29 Jul 2018 20:11:04 -0700
User-agent: mu4e 0.9.19; emacs 27.0.50
From: Dima Kogan <dima@HIDDEN>
To: bug-coreutils@HIDDEN
Subject: join doesn't ignore trailing whitespace
Date: Sun, 29 Jul 2018 20:11:04 -0700
Message-ID: <87muu97707.fsf@HIDDEN>
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="=-=-="
X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic]
 [fuzzy]
X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x
X-Received-From: 2001:4830:134:3::11
X-Spam-Score: -5.0 (-----)
X-Debbugs-Envelope-To: submit
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -6.0 (------)

--=-=-=
Content-Type: text/plain

Hi. It appears that join doesn't ignore trailing whitespace, resulting
in surprising results. I'm attaching two data files. one is the output
of 'seq 5', and the other is the same thing, but with a trailing space
added to each line. Look:

  dima@scrawny:~$ join -j 1 -e - -o auto /tmp/dat.no.trailing.space /tmp/dat.no.trailing.space 
  1
  2
  3
  4
  5

  dima@scrawny:~$ join -j 1 -e - -o auto /tmp/dat.trailing.space /tmp/dat.trailing.space   
  1 - -
  2 - -
  3 - -
  4 - -
  5 - -

I guess the manpage isn't explicit about what happens in this case, but
as a user, I find this to be extremely surprising, and this thus feels
like a bug. Thanks.

(I'm not subscribed to the list, so please Cc me in replies)


--=-=-=
Content-Type: application/octet-stream
Content-Disposition: attachment; filename=dat.no.trailing.space
Content-Transfer-Encoding: base64

MQoyCjMKNAo1Cg==
--=-=-=
Content-Type: application/octet-stream
Content-Disposition: attachment; filename=dat.trailing.space
Content-Transfer-Encoding: base64

MSAKMiAKMyAKNCAKNSAK
--=-=-=--




Acknowledgement sent to Dima Kogan <dima@HIDDEN>:
New bug report received and forwarded. Copy sent to bug-coreutils@HIDDEN. Full text available.
Report forwarded to bug-coreutils@HIDDEN:
bug#32308; Package coreutils. Full text available.
Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.
Last modified: Mon, 25 Nov 2019 12:00:02 UTC

GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997 nCipher Corporation Ltd, 1994-97 Ian Jackson.