GNU bug report logs - #33239
'guix offload' regularly hangs in 'channel-get-exit-status' call

Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.

Package: guix; Severity: important; Reported by: ludo@HIDDEN (Ludovic Courtès); dated Fri, 2 Nov 2018 10:58:02 UTC; Maintainer for guix is bug-guix@HIDDEN.

Message received at 33239 <at> debbugs.gnu.org:


Received: (at 33239) by debbugs.gnu.org; 23 Nov 2018 17:25:35 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Fri Nov 23 12:25:35 2018
Received: from localhost ([127.0.0.1]:44482 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1gQFCx-0001Pl-8M
	for submit <at> debbugs.gnu.org; Fri, 23 Nov 2018 12:25:35 -0500
Received: from eggs.gnu.org ([208.118.235.92]:35144)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <ludo@HIDDEN>) id 1gQFCv-0001PV-5J
 for 33239 <at> debbugs.gnu.org; Fri, 23 Nov 2018 12:25:33 -0500
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
 (envelope-from <ludo@HIDDEN>) id 1gQFCn-00071g-D9
 for 33239 <at> debbugs.gnu.org; Fri, 23 Nov 2018 12:25:28 -0500
X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org
X-Spam-Level: 
X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=disabled
 version=3.3.2
Received: from fencepost.gnu.org ([2001:4830:134:3::e]:34676)
 by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from <ludo@HIDDEN>)
 id 1gQFCm-0006zr-Vp
 for 33239 <at> debbugs.gnu.org; Fri, 23 Nov 2018 12:25:25 -0500
Received: from [2001:660:6102:320:e120:2c8f:8909:cdfe] (port=51896 helo=ribbon)
 by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256)
 (Exim 4.82) (envelope-from <ludo@HIDDEN>) id 1gQFCm-0003S8-DZ
 for 33239 <at> debbugs.gnu.org; Fri, 23 Nov 2018 12:25:24 -0500
From: ludo@HIDDEN (Ludovic =?utf-8?Q?Court=C3=A8s?=)
To: 33239 <at> debbugs.gnu.org
Subject: Re: bug#33239: 'guix offload' regularly hangs in
 'channel-get-exit-status' call
References: <87k1lvrblp.fsf@HIDDEN>
Date: Fri, 23 Nov 2018 18:25:21 +0100
In-Reply-To: <87k1lvrblp.fsf@HIDDEN> ("Ludovic
 \=\?utf-8\?Q\?Court\=C3\=A8s\=22'\?\=
 \=\?utf-8\?Q\?s\?\= message of "Fri, 02 Nov 2018 11:57:06 +0100")
Message-ID: <87wop33dvi.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux)
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable
X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic]
X-Received-From: 2001:4830:134:3::e
X-Spam-Score: -5.0 (-----)
X-Debbugs-Envelope-To: 33239
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -6.0 (------)

ludo@HIDDEN (Ludovic Court=C3=A8s) skribis:

> (gdb) bt
> #0  0x00007f299fb330f1 in __GI___poll (fds=3D0x1dd58c0, nfds=3D1, timeout=
=3D-1) at ../sysdeps/unix/sysv/linux/poll.c:29
> #1  0x00007f2994287577 in ssh_poll_ctx_dopoll () from target:/gnu/store/w=
mpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4
> #2  0x00007f29942884d9 in ssh_handle_packets () from target:/gnu/store/wm=
pg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4
> #3  0x00007f29942885ad in ssh_handle_packets_termination () from target:/=
gnu/store/wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4
> #4  0x00007f2994275080 in ssh_channel_get_exit_status () from target:/gnu=
/store/wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4
> #5  0x00007f29946dd11a in guile_ssh_channel_get_exit_status () from targe=
t:/gnu/store/i3nfl17wfx7sryq6w15r9wxl7ilmq4rb-guile-ssh-0.11.3/lib/libguile=
-ssh.so.11
> #6  0x00007f29a1765965 in vm_regular_engine (thread=3D0x1dd58c0, vp=3D0x1=
d4df30, registers=3D0xffffffff, resume=3D-1615646479) at vm-engine.c:786
> #7  0x00007f29a1768fba in scm_call_n (proc=3D#<program 7f29a1be0030>, arg=
v=3Dargv@entry=3D0x7ffc76b1ece8, nargs=3Dnargs@entry=3D1) at vm.c:1257
> #8  0x00007f29a16ecff7 in scm_primitive_eval (
>     exp=3Dexp@entry=3D((@ (ice-9 control) %) (begin ((@@ (ice-9 command-l=
ine) load/lang) "/gnu/store/zz3b7j4iv6v143v7cqyr77k83zc5n3zw-guix-0.15.0-6.=
f9a8fce/bin/.guix-real") (main (command-line)) (quit)))) at eval.c:662
> #9  0x00007f29a16ed053 in scm_eval (
>     exp=3D((@ (ice-9 control) %) (begin ((@@ (ice-9 command-line) load/la=
ng) "/gnu/store/zz3b7j4iv6v143v7cqyr77k83zc5n3zw-guix-0.15.0-6.f9a8fce/bin/=
.guix-real") (main (command-line)) (quit))), module_or_state=3Dmodule_or_st=
ate@entry=3D"#<struct module>" =3D {...}) at eval.c:696
> #10 0x00007f29a1738220 in scm_shell (argc=3D11, argv=3D0x1dd5280) at scri=
pt.c:454
>
> (gdb) frame 0
> #0  0x00007f299fb330f1 in __GI___poll (fds=3D0x1dd58c0, nfds=3D1, timeout=
=3D-1) at ../sysdeps/unix/sysv/linux/poll.c:29
> 29      in ../sysdeps/unix/sysv/linux/poll.c
> (gdb) p *fds
> $1 =3D {fd =3D 14, events =3D 1, revents =3D 0}
> (gdb) shell ls -l /proc/12605/fd
> total 0
> lr-x------ 1 root root 64 Nov  2 11:20 0 -> 'pipe:[44413497]'
> l-wx------ 1 root root 64 Nov  2 11:33 1 -> 'pipe:[44413496]'
> lr-x------ 1 root root 64 Nov  2 11:33 10 -> 'pipe:[44459532]'
> l-wx------ 1 root root 64 Nov  2 11:33 11 -> 'pipe:[44459532]'
> lr-x------ 1 root root 64 Nov  2 11:33 12 -> 'pipe:[44429590]'
> l-wx------ 1 root root 64 Nov  2 11:33 13 -> 'pipe:[44429590]'
> lrwx------ 1 root root 64 Nov  2 11:33 14 -> 'socket:[44444783]'
> lrwx------ 1 root root 64 Nov  2 11:33 15 -> 'socket:[44444784]'
> l-wx------ 1 root root 64 Nov  2 11:33 16 -> /var/guix/offload/141.80.167=
.140/0

When that happens, the guile process on the remote node that runs the
=E2=80=98redirect=E2=80=99 code of =E2=80=98remote-daemon-channel=E2=80=99 =
is stuck in select(2) with
infinite timeout.

Note on berlin the build nodes are still running Guile 2.2.2, vulnerable
to the =E2=80=98select=E2=80=99 bug <https://bugs.gnu.org/30365>, which we =
=E2=80=98redirect=E2=80=99
supposedly works around.

Ludo=E2=80=99.




Information forwarded to bug-guix@HIDDEN:
bug#33239; Package guix. Full text available.

Message received at 33239 <at> debbugs.gnu.org:


Received: (at 33239) by debbugs.gnu.org; 17 Nov 2018 19:09:10 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Sat Nov 17 14:09:10 2018
Received: from localhost ([127.0.0.1]:58143 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1gO5xu-0003FV-KX
	for submit <at> debbugs.gnu.org; Sat, 17 Nov 2018 14:09:10 -0500
Received: from mx1.riseup.net ([198.252.153.129]:43681)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <swedebugia@HIDDEN>) id 1gO5xs-0003FL-EQ
 for 33239 <at> debbugs.gnu.org; Sat, 17 Nov 2018 14:09:09 -0500
Received: from piha.riseup.net (piha-pn.riseup.net [10.0.1.163])
 (using TLSv1 with cipher ECDHE-RSA-AES256-SHA (256/256 bits))
 (Client CN "*.riseup.net",
 Issuer "COMODO RSA Domain Validation Secure Server CA" (verified OK))
 by mx1.riseup.net (Postfix) with ESMTPS id B77781A01E7;
 Sat, 17 Nov 2018 11:09:07 -0800 (PST)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=riseup.net; s=squak;
 t=1542481747; bh=zgdr9GwxcjrIkkEI2ELr3c1AtZM9mhcaRaKbSRMx5I4=;
 h=Subject:To:References:From:Date:In-Reply-To:From;
 b=R7Lc/X5iLrU99iW7PIvXihBEPu/uUnvWJONMCxCQ/EbQfABN9P8Mj0Klg6SR9oaPR
 PgafR8NP5dAK2oksYyKH6zAZnrN8hnzsKzrKomfwzIQmtO1O9tkzInzJdjHuz36Y4U
 V+gcwTgIEIhNpd2eE6EgO1FOvr+aiobIJdgPiU4Q=
X-Riseup-User-ID: 1775A1C8F0E51E820465DEE066876CEFD3C2539CB43785E075B782E9FEF39291
Received: from [127.0.0.1] (localhost [127.0.0.1])
 by piha.riseup.net with ESMTPSA id D4E861E041A;
 Sat, 17 Nov 2018 11:09:06 -0800 (PST)
Subject: Re: bug#33239: 'guix offload' regularly hangs in
 'channel-get-exit-status' call
To: =?UTF-8?Q?Ludovic_Court=c3=a8s?= <ludo@HIDDEN>, 33239 <at> debbugs.gnu.org
References: <87k1lvrblp.fsf@HIDDEN>
From: swedebugia <swedebugia@HIDDEN>
Message-ID: <ffd9f437-61b8-ab73-41de-a57a58abed71@HIDDEN>
Date: Sat, 17 Nov 2018 20:14:52 +0100
MIME-Version: 1.0
In-Reply-To: <87k1lvrblp.fsf@HIDDEN>
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Language: sv-FI
Content-Transfer-Encoding: 8bit
X-Spam-Score: -0.7 (/)
X-Debbugs-Envelope-To: 33239
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.7 (-)

On 2018-11-02 11:57, Ludovic Courtès wrote:

snip

> To be continued…

I found this which might be related and point you to a solution: 
https://github.com/paramiko/paramiko/issues/448

found here:
https://duckduckgo.com/?q=channel-get-exit-status+ssh

-- 
Cheers Swedebugia




Information forwarded to bug-guix@HIDDEN:
bug#33239; Package guix. Full text available.
Severity set to 'important' from 'normal' Request was from ludo@HIDDEN (Ludovic Courtès) to control <at> debbugs.gnu.org. Full text available.

Message received at 33239 <at> debbugs.gnu.org:


Received: (at 33239) by debbugs.gnu.org; 3 Nov 2018 14:10:07 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Sat Nov 03 10:10:07 2018
Received: from localhost ([127.0.0.1]:33057 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1gIwco-0002Vz-Vx
	for submit <at> debbugs.gnu.org; Sat, 03 Nov 2018 10:10:07 -0400
Received: from eggs.gnu.org ([208.118.235.92]:52221)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <ludo@HIDDEN>) id 1gIwcl-0002VL-CL
 for 33239 <at> debbugs.gnu.org; Sat, 03 Nov 2018 10:10:03 -0400
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
 (envelope-from <ludo@HIDDEN>) id 1gIwcf-0002KN-IU
 for 33239 <at> debbugs.gnu.org; Sat, 03 Nov 2018 10:09:58 -0400
X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org
X-Spam-Level: 
X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=disabled
 version=3.3.2
Received: from fencepost.gnu.org ([2001:4830:134:3::e]:39555)
 by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from <ludo@HIDDEN>)
 id 1gIwcf-0002Je-CW; Sat, 03 Nov 2018 10:09:57 -0400
Received: from [2a01:e0a:1d:7270:af76:b9b:ca24:c465] (port=57118 helo=ribbon)
 by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256)
 (Exim 4.82) (envelope-from <ludo@HIDDEN>)
 id 1gIwcf-0007RH-4N; Sat, 03 Nov 2018 10:09:57 -0400
From: ludo@HIDDEN (Ludovic =?utf-8?Q?Court=C3=A8s?=)
To: swedebugia <swedebugia@HIDDEN>
Subject: Re: bug#33239: 'guix offload' regularly hangs in
 'channel-get-exit-status' call
References: <87k1lvrblp.fsf@HIDDEN>
 <e7a2ac04-e10f-7d8f-e564-82d94c3cc2df@HIDDEN>
X-URL: http://www.fdn.fr/~lcourtes/
X-Revolutionary-Date: 13 Brumaire an 227 de la =?utf-8?Q?R=C3=A9volution?=
X-PGP-Key-ID: 0x090B11993D9AEBB5
X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc
X-PGP-Fingerprint: 3CE4 6455 8A84 FDC6 9DB4  0CFB 090B 1199 3D9A EBB5
X-OS: x86_64-pc-linux-gnu
Date: Sat, 03 Nov 2018 15:09:55 +0100
In-Reply-To: <e7a2ac04-e10f-7d8f-e564-82d94c3cc2df@HIDDEN> (swedebugia's
 message of "Fri, 2 Nov 2018 14:46:30 +0100")
Message-ID: <87sh0ip80c.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux)
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable
X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic]
X-Received-From: 2001:4830:134:3::e
X-Spam-Score: -5.0 (-----)
X-Debbugs-Envelope-To: 33239
Cc: 33239 <at> debbugs.gnu.org
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -6.0 (------)

Hello,

swedebugia <swedebugia@HIDDEN> skribis:

> On 2018-11-02 11:57, Ludovic Court=C3=A8s wrote:
>> (gdb) shell ls -l /proc/12605/fd
> How did you come up with this file descriptor (did not appear above)

It showed up in the pollfd structure passed to =E2=80=98poll=E2=80=99.

> What does this give you in the debugging?

It shows that the file descriptor is indeed open and corresponds to a
socket (which is not much, I admit.)

Ludo=E2=80=99.




Information forwarded to bug-guix@HIDDEN:
bug#33239; Package guix. Full text available.

Message received at 33239 <at> debbugs.gnu.org:


Received: (at 33239) by debbugs.gnu.org; 2 Nov 2018 13:41:02 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Fri Nov 02 09:41:02 2018
Received: from localhost ([127.0.0.1]:59353 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1gIZh7-0000py-Th
	for submit <at> debbugs.gnu.org; Fri, 02 Nov 2018 09:41:02 -0400
Received: from mx1.riseup.net ([198.252.153.129]:37180)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <swedebugia@HIDDEN>) id 1gIZh6-0000pX-Bu
 for 33239 <at> debbugs.gnu.org; Fri, 02 Nov 2018 09:41:00 -0400
Received: from piha.riseup.net (piha-pn.riseup.net [10.0.1.163])
 (using TLSv1 with cipher ECDHE-RSA-AES256-SHA (256/256 bits))
 (Client CN "*.riseup.net",
 Issuer "COMODO RSA Domain Validation Secure Server CA" (verified OK))
 by mx1.riseup.net (Postfix) with ESMTPS id BFA991A0181;
 Fri,  2 Nov 2018 06:40:59 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=riseup.net; s=squak;
 t=1541166059; bh=OjWhxUymoxfw1OOjBpQrt0SKjC/hRt2vH+SNruRbe0o=;
 h=Subject:To:References:From:Date:In-Reply-To:From;
 b=Owj7+6O8uQLjpUM57pH6WeX0Aow6kco73VLR4wQli+s7LoUFe279cTFUQdLDwjZ+n
 AKa3G7/S/GJpRg1aYIYHlBNcGIYHU2xWnrZosd0b6kvNME5qm2rujFOiVilRQaTV5f
 Y4C4MwPwfoF46bjrLelicwiUzDkUg7t5eDgWnw84=
X-Riseup-User-ID: 01484FA11B571B80D2C2A75450BCE10BE15B70ADA41062CFD65A9A7B38636E68
Received: from [127.0.0.1] (localhost [127.0.0.1])
 by piha.riseup.net with ESMTPSA id DC09C64121;
 Fri,  2 Nov 2018 06:40:58 -0700 (PDT)
Subject: Re: bug#33239: 'guix offload' regularly hangs in
 'channel-get-exit-status' call
To: =?UTF-8?Q?Ludovic_Court=c3=a8s?= <ludo@HIDDEN>, 33239 <at> debbugs.gnu.org
References: <87k1lvrblp.fsf@HIDDEN>
From: swedebugia <swedebugia@HIDDEN>
Message-ID: <e7a2ac04-e10f-7d8f-e564-82d94c3cc2df@HIDDEN>
Date: Fri, 2 Nov 2018 14:46:30 +0100
MIME-Version: 1.0
In-Reply-To: <87k1lvrblp.fsf@HIDDEN>
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 8bit
Content-Language: sv-FI
X-Spam-Score: -0.7 (/)
X-Debbugs-Envelope-To: 33239
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -1.7 (-)

Hi :)


On 2018-11-02 11:57, Ludovic Courtès wrote:
> (gdb) shell ls -l /proc/12605/fd
How did you come up with this file descriptor (did not appear above)
What does this give you in the debugging?

-- 
Cheers
Swedebugia





Information forwarded to bug-guix@HIDDEN:
bug#33239; Package guix. Full text available.

Message received at submit <at> debbugs.gnu.org:


Received: (at submit) by debbugs.gnu.org; 2 Nov 2018 10:57:31 +0000
From debbugs-submit-bounces <at> debbugs.gnu.org Fri Nov 02 06:57:31 2018
Received: from localhost ([127.0.0.1]:59268 helo=debbugs.gnu.org)
	by debbugs.gnu.org with esmtp (Exim 4.84_2)
	(envelope-from <debbugs-submit-bounces <at> debbugs.gnu.org>)
	id 1gIX8t-0003EZ-0r
	for submit <at> debbugs.gnu.org; Fri, 02 Nov 2018 06:57:31 -0400
Received: from eggs.gnu.org ([208.118.235.92]:35550)
 by debbugs.gnu.org with esmtp (Exim 4.84_2)
 (envelope-from <ludo@HIDDEN>) id 1gIX8r-0003EN-BL
 for submit <at> debbugs.gnu.org; Fri, 02 Nov 2018 06:57:29 -0400
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
 (envelope-from <ludo@HIDDEN>) id 1gIX8j-0002EL-7y
 for submit <at> debbugs.gnu.org; Fri, 02 Nov 2018 06:57:23 -0400
X-Spam-Checker-Version: SpamAssassin 3.3.2 (2011-06-06) on eggs.gnu.org
X-Spam-Level: 
X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00 autolearn=disabled
 version=3.3.2
Received: from lists.gnu.org ([2001:4830:134:3::11]:57635)
 by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_256_CBC_SHA1:32)
 (Exim 4.71) (envelope-from <ludo@HIDDEN>) id 1gIX8e-0002DT-Ky
 for submit <at> debbugs.gnu.org; Fri, 02 Nov 2018 06:57:17 -0400
Received: from eggs.gnu.org ([2001:4830:134:3::10]:45117)
 by lists.gnu.org with esmtp (Exim 4.71)
 (envelope-from <ludo@HIDDEN>) id 1gIX8d-0001IT-GW
 for bug-guix@HIDDEN; Fri, 02 Nov 2018 06:57:16 -0400
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
 (envelope-from <ludo@HIDDEN>) id 1gIX8Z-0002CK-Ty
 for bug-guix@HIDDEN; Fri, 02 Nov 2018 06:57:13 -0400
Received: from fencepost.gnu.org ([2001:4830:134:3::e]:35174)
 by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from <ludo@HIDDEN>)
 id 1gIX8X-0002Bc-L4
 for bug-guix@HIDDEN; Fri, 02 Nov 2018 06:57:11 -0400
Received: from [2a01:e0a:1d:7270:af76:b9b:ca24:c465] (port=42504 helo=ribbon)
 by fencepost.gnu.org with esmtpsa (TLS1.2:RSA_AES_256_CBC_SHA1:256)
 (Exim 4.82) (envelope-from <ludo@HIDDEN>) id 1gIX8W-00074R-1L
 for bug-guix@HIDDEN; Fri, 02 Nov 2018 06:57:09 -0400
From: ludo@HIDDEN (Ludovic =?utf-8?Q?Court=C3=A8s?=)
To: bug-guix@HIDDEN
Subject: 'guix offload' regularly hangs in 'channel-get-exit-status' call
X-URL: http://www.fdn.fr/~lcourtes/
X-Revolutionary-Date: 12 Brumaire an 227 de la =?utf-8?Q?R=C3=A9volution?=
X-PGP-Key-ID: 0x090B11993D9AEBB5
X-PGP-Key: http://www.fdn.fr/~lcourtes/ludovic.asc
X-PGP-Fingerprint: 3CE4 6455 8A84 FDC6 9DB4  0CFB 090B 1199 3D9A EBB5
X-OS: x86_64-pc-linux-gnu
Date: Fri, 02 Nov 2018 11:57:06 +0100
Message-ID: <87k1lvrblp.fsf@HIDDEN>
User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux)
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable
X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.2.x-3.x [generic]
X-detected-operating-system: by eggs.gnu.org: GNU/Linux 2.6.x
X-Received-From: 2001:4830:134:3::11
X-Spam-Score: -5.0 (-----)
X-Debbugs-Envelope-To: submit
X-BeenThere: debbugs-submit <at> debbugs.gnu.org
X-Mailman-Version: 2.1.18
Precedence: list
List-Id: <debbugs-submit.debbugs.gnu.org>
List-Unsubscribe: <https://debbugs.gnu.org/cgi-bin/mailman/options/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=unsubscribe>
List-Archive: <https://debbugs.gnu.org/cgi-bin/mailman/private/debbugs-submit/>
List-Post: <mailto:debbugs-submit <at> debbugs.gnu.org>
List-Help: <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=help>
List-Subscribe: <https://debbugs.gnu.org/cgi-bin/mailman/listinfo/debbugs-submit>, 
 <mailto:debbugs-submit-request <at> debbugs.gnu.org?subject=subscribe>
Errors-To: debbugs-submit-bounces <at> debbugs.gnu.org
Sender: "Debbugs-submit" <debbugs-submit-bounces <at> debbugs.gnu.org>
X-Spam-Score: -6.0 (------)

Hello,

The =E2=80=98guix offload=E2=80=99 processes on berlin regularly hang while=
 calling
=E2=80=98channel-get-exit-status=E2=80=99:

--8<---------------cut here---------------start------------->8---
(gdb) bt
#0  0x00007f299fb330f1 in __GI___poll (fds=3D0x1dd58c0, nfds=3D1, timeout=
=3D-1) at ../sysdeps/unix/sysv/linux/poll.c:29
#1  0x00007f2994287577 in ssh_poll_ctx_dopoll () from target:/gnu/store/wmp=
g67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4
#2  0x00007f29942884d9 in ssh_handle_packets () from target:/gnu/store/wmpg=
67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4
#3  0x00007f29942885ad in ssh_handle_packets_termination () from target:/gn=
u/store/wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4
#4  0x00007f2994275080 in ssh_channel_get_exit_status () from target:/gnu/s=
tore/wmpg67bn7i7pqc0p4xjp1npnqixk9znd-libssh-0.7.6/lib/libssh.so.4
#5  0x00007f29946dd11a in guile_ssh_channel_get_exit_status () from target:=
/gnu/store/i3nfl17wfx7sryq6w15r9wxl7ilmq4rb-guile-ssh-0.11.3/lib/libguile-s=
sh.so.11
#6  0x00007f29a1765965 in vm_regular_engine (thread=3D0x1dd58c0, vp=3D0x1d4=
df30, registers=3D0xffffffff, resume=3D-1615646479) at vm-engine.c:786
#7  0x00007f29a1768fba in scm_call_n (proc=3D#<program 7f29a1be0030>, argv=
=3Dargv@entry=3D0x7ffc76b1ece8, nargs=3Dnargs@entry=3D1) at vm.c:1257
#8  0x00007f29a16ecff7 in scm_primitive_eval (
    exp=3Dexp@entry=3D((@ (ice-9 control) %) (begin ((@@ (ice-9 command-lin=
e) load/lang) "/gnu/store/zz3b7j4iv6v143v7cqyr77k83zc5n3zw-guix-0.15.0-6.f9=
a8fce/bin/.guix-real") (main (command-line)) (quit)))) at eval.c:662
#9  0x00007f29a16ed053 in scm_eval (
    exp=3D((@ (ice-9 control) %) (begin ((@@ (ice-9 command-line) load/lang=
) "/gnu/store/zz3b7j4iv6v143v7cqyr77k83zc5n3zw-guix-0.15.0-6.f9a8fce/bin/.g=
uix-real") (main (command-line)) (quit))), module_or_state=3Dmodule_or_stat=
e@entry=3D"#<struct module>" =3D {...}) at eval.c:696
#10 0x00007f29a1738220 in scm_shell (argc=3D11, argv=3D0x1dd5280) at script=
.c:454

(gdb) frame 0
#0  0x00007f299fb330f1 in __GI___poll (fds=3D0x1dd58c0, nfds=3D1, timeout=
=3D-1) at ../sysdeps/unix/sysv/linux/poll.c:29
29      in ../sysdeps/unix/sysv/linux/poll.c
(gdb) p *fds
$1 =3D {fd =3D 14, events =3D 1, revents =3D 0}
(gdb) shell ls -l /proc/12605/fd
total 0
lr-x------ 1 root root 64 Nov  2 11:20 0 -> 'pipe:[44413497]'
l-wx------ 1 root root 64 Nov  2 11:33 1 -> 'pipe:[44413496]'
lr-x------ 1 root root 64 Nov  2 11:33 10 -> 'pipe:[44459532]'
l-wx------ 1 root root 64 Nov  2 11:33 11 -> 'pipe:[44459532]'
lr-x------ 1 root root 64 Nov  2 11:33 12 -> 'pipe:[44429590]'
l-wx------ 1 root root 64 Nov  2 11:33 13 -> 'pipe:[44429590]'
lrwx------ 1 root root 64 Nov  2 11:33 14 -> 'socket:[44444783]'
lrwx------ 1 root root 64 Nov  2 11:33 15 -> 'socket:[44444784]'
l-wx------ 1 root root 64 Nov  2 11:33 16 -> /var/guix/offload/141.80.167.1=
40/0
l-wx------ 1 root root 64 Nov  2 11:33 2 -> 'pipe:[44413496]'
lr-x------ 1 root root 64 Nov  2 11:33 3 -> 'pipe:[44459528]'
lr-x------ 1 root root 64 Nov  2 11:33 33 -> /dev/urandom
l-wx------ 1 root root 64 Nov  2 11:33 4 -> 'pipe:[44413498]'
l-wx------ 1 root root 64 Nov  2 11:33 5 -> 'pipe:[44459528]'
lr-x------ 1 root root 64 Nov  2 11:33 6 -> 'pipe:[44459531]'
l-wx------ 1 root root 64 Nov  2 11:33 7 -> 'pipe:[44459531]'
lr-x------ 1 root root 64 Nov  2 11:33 8 -> 'pipe:[44453928]'
l-wx------ 1 root root 64 Nov  2 11:33 9 -> 'pipe:[44453928]'
--8<---------------cut here---------------end--------------->8---

I believe this is because in (guix ssh) we don=E2=80=99t ensure the remote
process is dead by the time we call =E2=80=98channel-get-exit-status=E2=80=
=99, as in
this example:

--8<---------------cut here---------------start------------->8---
scheme@(guix ssh)> (define s (open-ssh-session "localhost" #:user "ludo" #:=
port 22))
scheme@(guix ssh)> (define c (open-remote-pipe* s OPEN_BOTH "sleep 1000"))
scheme@(guix ssh)> (channel-send-eof c)
$4 =3D #<undefined>
scheme@(guix ssh)> (channel-get-exit-status c)
;; hangs
--8<---------------cut here---------------end--------------->8---

Problem is that calling =E2=80=98channel-get-exit-status=E2=80=99 on a clos=
ed port
doesn=E2=80=99t work, so forcing a port close isn=E2=80=99t really an optio=
n:

--8<---------------cut here---------------start------------->8---
scheme@(guix ssh)> (define c (open-remote-pipe* s OPEN_BOTH "sleep 100"))
scheme@(guix ssh)> (close-port c)
$4 =3D #t
scheme@(guix ssh)> (channel-get-exit-status c)
ERROR: In procedure channel-get-exit-status:
In procedure channel-get-exit-status: Wrong type argument in position 1 (ex=
pecting open channel): #<unknown channel (freed) 221d5c0>
--8<---------------cut here---------------end--------------->8---

To be continued=E2=80=A6

Ludo=E2=80=99.




Acknowledgement sent to ludo@HIDDEN (Ludovic Courtès):
New bug report received and forwarded. Copy sent to bug-guix@HIDDEN. Full text available.
Report forwarded to bug-guix@HIDDEN:
bug#33239; Package guix. Full text available.
Please note: This is a static page, with minimal formatting, updated once a day.
Click here to see this page with the latest information and nicer formatting.
Last modified: Fri, 23 Nov 2018 17:30:02 UTC

GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997 nCipher Corporation Ltd, 1994-97 Ian Jackson.