GNU bug report logs - #36130
split bug

Reported by: Heather Wick <heather.c.wick <at> gmail.com>

Date: Fri, 7 Jun 2019 18:47:01 UTC

Severity: normal

Tags: notabug

Done: Assaf Gordon <assafgordon <at> gmail.com>

Bug is archived. No further changes may be made.

To add a comment to this bug, you must first unarchive it, by sending
a message to control AT debbugs.gnu.org, with unarchive 36130 in the body.
You can then email your comments to 36130 AT debbugs.gnu.org in the normal way.

Toggle the display of automated, internal messages from the tracker.

View this report as an mbox folder, status mbox, maintainer mbox

Report forwarded to bug-coreutils <at> gnu.org:
bug#36130; Package coreutils. (Fri, 07 Jun 2019 18:47:02 GMT) Full text and rfc822 format available.

Acknowledgement sent to Heather Wick <heather.c.wick <at> gmail.com>:
New bug report received and forwarded. Copy sent to bug-coreutils <at> gnu.org. (Fri, 07 Jun 2019 18:47:02 GMT) Full text and rfc822 format available.

Message #5 received at submit <at> debbugs.gnu.org (full text, mbox):

From: Heather Wick <heather.c.wick <at> gmail.com>
To: bug-coreutils <at> gnu.org
Subject: split bug
Date: Fri, 7 Jun 2019 14:23:15 -0400

[Message part 1 (text/plain, inline)]

Hello,
I am using split to split up some large, paired fastq files (nearly 4
billion lines each). I am using the -l flag to split into files of 10
million reads (40 million lines) each and though the fastq files have
matched and sorted reads, split is creating different numbers of split
files for the two paired fastq files, and the pairing becomes off at some
point. The jobs finished without exceeding memory and with an exit status
0, and I noticed the help file said to email this address if there were
bugs, so I thought I would mention it.
This is the line I am using to call split on my zipped fastq files:
zcat MH1_R1.fastq.gz | split - -l 40000000 DHT_R1_
zcat MH1_R2.fastq.gz | split - -l 40000000 DHT_R2_
This creates 96 chunks for the R1 and 95 chunks for R2, even though the
orignal fastq files have the same number of reads.
Do you have any suggestions for how to proceed? Perhaps zcatting and piping
the files is not the best way to call split?
Thanks,
~ Heather

-- 
Heather Wick
PhD Candidate, Human Genetics
Labs of Sarah Wheelan and Vasan Yegnasubramanian
Institute of Genetic Medicine
Johns Hopkins University School of Medicine
hwick1 <at> jhmi.edu

[Message part 2 (text/html, inline)]

Information forwarded to bug-coreutils <at> gnu.org:
bug#36130; Package coreutils. (Sat, 08 Jun 2019 01:30:02 GMT) Full text and rfc822 format available.

Message #8 received at 36130 <at> debbugs.gnu.org (full text, mbox):

From: Assaf Gordon <assafgordon <at> gmail.com>
To: Heather Wick <heather.c.wick <at> gmail.com>
Cc: 36130 <at> debbugs.gnu.org
Subject: Re: bug#36130: split bug
Date: Fri, 7 Jun 2019 19:29:42 -0600

Hello,

On Fri, Jun 07, 2019 at 02:23:15PM -0400, Heather Wick wrote:
> I am using split to split up some large, paired fastq files [...]:
>
>   zcat MH1_R1.fastq.gz | split - -l 40000000 DHT_R1_
>   zcat MH1_R2.fastq.gz | split - -l 40000000 DHT_R2_
>
> This creates 96 chunks for the R1 and 95 chunks for R2, even though the
> orignal fastq files have the same number of reads.
>
> Do you have any suggestions for how to proceed? Perhaps zcatting and piping
> the files is not the best way to call split?

To help diagnose to issue better, please run the following commands
and tell us what are the results:

1. number of lines in each file:

   zcat MH1_R1.fastq.gz | wc -l
   zcat MH1_R2.fastq.gz | wc -l

2. The first two sequence IDs:

   zcat MH1_R1.fastq.gz | head -n8 | grep ^@
   zcat MH1_R2.fastq.gz | head -n8 | grep ^@

3. Last two sequence IDs:

   zcat MH1_R1.fastq.gz | tail -n8 | grep ^@
   zcat MH1_R2.fastq.gz | tail -n8 | grep ^@

These will just verify the FASTQ files are indeed paired with no
surprises. The files should have the same number of lines,
and matching sequence IDs in the first and last lines.

regards,
 - assaf

Information forwarded to bug-coreutils <at> gnu.org:
bug#36130; Package coreutils. (Sat, 08 Jun 2019 02:00:02 GMT) Full text and rfc822 format available.

Message #11 received at 36130 <at> debbugs.gnu.org (full text, mbox):

From: Heather Wick <heather.c.wick <at> gmail.com>
To: Assaf Gordon <assafgordon <at> gmail.com>
Cc: 36130 <at> debbugs.gnu.org
Subject: Re: bug#36130: split bug
Date: Fri, 7 Jun 2019 21:48:44 -0400

[Message part 1 (text/plain, inline)]

Hi,
Yes, sorry, I should have specified that I already checked that the
original fastq files are indeed paired and sorted with the same number of
lines and same starting/ending IDs, narrowing down the issue to a problem
with split.
~ Heather


(base) [hwick <at> zappalogin ~]$ zcat  MH2_R2.fastq.gz | wc -l

3778103832

(base) [hwick <at> zappalogin ~]$ zcat  MH2_R1.fastq.gz | wc -l

3778103832


(base) [hwick <at> zappalogin test_2019]$ zcat MH2_R1.fastq.gz | head -n8 | grep
^@

@A00197:48:HF2GWDMXX:1:1101:1741:1000 1:N:0:GATCAG+TCTTTCCC

@A00197:48:HF2GWDMXX:1:1101:2754:1000 1:N:0:GATCAG+TCTTTCCC

(base) [hwick <at> zappalogin test_2019]$ zcat MH2_R2.fastq.gz | head -n8 | grep
^@

@A00197:48:HF2GWDMXX:1:1101:1741:1000 2:N:0:GATCAG+TCTTTCCC

@A00197:48:HF2GWDMXX:1:1101:2754:1000 2:N:0:GATCAG+TCTTTCCC


(base) [hwick <at> zappalogin test_2019]$ zcat MH2_R1.fastq.gz | tail -n8 | grep
^@

@E00489:288:HMFWCCCXY:2:2224:29305:73106 1:N:0:GATCAG

@E00489:288:HMFWCCCXY:2:2224:29325:73106 1:N:0:GATCAG

(base) [hwick <at> zappalogin test_2019]$ zcat MH2_R2.fastq.gz | tail -n8 | grep
^@

@E00489:288:HMFWCCCXY:2:2224:29305:73106 2:N:0:GATCAG

@E00489:288:HMFWCCCXY:2:2224:29325:73106 2:N:0:GATCAG




On Fri, Jun 7, 2019 at 9:29 PM Assaf Gordon <assafgordon <at> gmail.com> wrote:

> Hello,
>
> On Fri, Jun 07, 2019 at 02:23:15PM -0400, Heather Wick wrote:
> > I am using split to split up some large, paired fastq files [...]:
> >
> >   zcat MH1_R1.fastq.gz | split - -l 40000000 DHT_R1_
> >   zcat MH1_R2.fastq.gz | split - -l 40000000 DHT_R2_
> >
> > This creates 96 chunks for the R1 and 95 chunks for R2, even though the
> > orignal fastq files have the same number of reads.
> >
> > Do you have any suggestions for how to proceed? Perhaps zcatting and
> piping
> > the files is not the best way to call split?
>
> To help diagnose to issue better, please run the following commands
> and tell us what are the results:
>
> 1. number of lines in each file:
>
>    zcat MH1_R1.fastq.gz | wc -l
>    zcat MH1_R2.fastq.gz | wc -l
>
> 2. The first two sequence IDs:
>
>    zcat MH1_R1.fastq.gz | head -n8 | grep ^@
>    zcat MH1_R2.fastq.gz | head -n8 | grep ^@
>
> 3. Last two sequence IDs:
>
>    zcat MH1_R1.fastq.gz | tail -n8 | grep ^@
>    zcat MH1_R2.fastq.gz | tail -n8 | grep ^@
>
> These will just verify the FASTQ files are indeed paired with no
> surprises. The files should have the same number of lines,
> and matching sequence IDs in the first and last lines.
>
> regards,
>  - assaf
>
>

-- 
Heather Wick
PhD Candidate, Human Genetics
Labs of Sarah Wheelan and Vasan Yegnasubramanian
Institute of Genetic Medicine
Johns Hopkins University School of Medicine
hwick1 <at> jhmi.edu

[Message part 2 (text/html, inline)]

Information forwarded to bug-coreutils <at> gnu.org:
bug#36130; Package coreutils. (Sat, 08 Jun 2019 03:40:03 GMT) Full text and rfc822 format available.

Message #14 received at 36130 <at> debbugs.gnu.org (full text, mbox):

From: Assaf Gordon <assafgordon <at> gmail.com>
To: Heather Wick <heather.c.wick <at> gmail.com>
Cc: 36130 <at> debbugs.gnu.org
Subject: Re: bug#36130: split bug
Date: Fri, 7 Jun 2019 21:39:18 -0600

Hello,

On Fri, Jun 07, 2019 at 09:48:44PM -0400, Heather Wick wrote:
> Yes, sorry, I should have specified that I already checked that the
> original fastq files are indeed paired and sorted with the same number of
> lines and same starting/ending IDs, narrowing down the issue to a problem
> with split.

It could be a problem with "split", but we'll need to dig a bit deeper
to be able to pinpoint the exact issue.

Could you please try the following commands and post the results?

    zcat MH1_R1.fastq.gz \
       | split --verbose -l 40000000 - DHT_R1_ > DHT_R1.log ; echo DHT_R1 exit code: $?
    zcat MH1_R2.fastq.gz \
       | split --verbose -l 40000000 - DHT_R2_ > DHT_R2.log ; echo DHT_R2 exit code: $?
    wc -l DHT_R1.log DHT_R2.log

Two more questions:
1. can you post the result of "split --version" ?
2. You mentioned "jobs" - if you are running these as submitted jobs on
a cluster (e.g. with "qsub"), can you double-check the STDERR log files
to ensure no errors where encountered ?

If we still can't pinpoint the issue, the next steps would be to check
the DHT_R{1,2}.log files, and then try to compare the content of the
splitted files.

I assume the input files are indeed correctly paired, but just to check,
if you could try the following command, it should not print anything
to the screen (indicating all sequence IDs are paired):

    paste <(zcat MH1_R2.fastq) <(zcat MH1_R2.fastq.gz) \
       | awk 'NR%4!=1 { next } $1!=$3 { print "Error in line " NR ":" $1 " vs " $3 }'

regards,
 - assaf

Information forwarded to bug-coreutils <at> gnu.org:
bug#36130; Package coreutils. (Mon, 10 Jun 2019 18:30:02 GMT) Full text and rfc822 format available.

Message #17 received at 36130 <at> debbugs.gnu.org (full text, mbox):

From: Heather Wick <heather.c.wick <at> gmail.com>
To: Assaf Gordon <assafgordon <at> gmail.com>
Cc: 36130 <at> debbugs.gnu.org
Subject: Re: bug#36130: split bug
Date: Mon, 10 Jun 2019 14:28:48 -0400

[Message part 1 (text/plain, inline)]

Thank you so much for your response. Here are the results of the tests you
sent:
Verbose: This seems to have made the same number of files this time; not
sure why the other 3-4 times I ran it it did not. They appear to be the
same size, with paired last reads

(base) [hwick <at> zappalogin interactive_with_verbose]$ cat
make_chunks_1_1mill_verbose

DHT_R1 exit code: 0

DHT_R2 exit code: 0

  96 DHT_R1.log

  96 DHT_R2.log

 192 total
Version:

(base) [hwick <at> zappalogin test_2019]$ split --version

split (GNU coreutils) 8.4

Copyright (C) 2010 Free Software Foundation, Inc.

License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html
>.

This is free software: you are free to change and redistribute it.

There is NO WARRANTY, to the extent permitted by law.

Written by Torbjörn Granlund and Richard M. Stallman.

STDERR:
The only thing in the stderr file is an odd duck of:

-sh: module: line 1: syntax error: unexpected end of file

-sh: error importing function definition for `BASH_FUNC_module'

Python 3.6.8 :: Anaconda, Inc.

/bin/sh: module: line 1: syntax error: unexpected end of file

/bin/sh: error importing function definition for `BASH_FUNC_module'

but this prints for every job I run with this particular flavor of conda/bash
and doesn't seem to affect anything else (as far as I know)
All jobs finished well below allotted memory and with exit status 0, even
when split didn't make the right number of output files.

Do you know any reason why the behavior would be inconsistent?

Pairing check: unfortunately my server's version of bash doesn't support
paste in this way, I've run into this issue before but I forget what the
workaround is. I can't run this command interactively because my server
times out (these files are > 3 billion lines each, so it takes a long time
to zcat them)

/cm/local/apps/sge/var/spool/zappa-06/job_scripts/358558: line 10: syntax
error near unexpected token `('

/cm/local/apps/sge/var/spool/zappa-06/job_scripts/358558: line 10: `paste
<(zcat MH1_R2.fastq) <(zcat MH1_R2.fastq.gz) \'

On Fri, Jun 7, 2019 at 11:39 PM Assaf Gordon <assafgordon <at> gmail.com> wrote:

> Hello,
>
> On Fri, Jun 07, 2019 at 09:48:44PM -0400, Heather Wick wrote:
> > Yes, sorry, I should have specified that I already checked that the
> > original fastq files are indeed paired and sorted with the same number of
> > lines and same starting/ending IDs, narrowing down the issue to a problem
> > with split.
>
> It could be a problem with "split", but we'll need to dig a bit deeper
> to be able to pinpoint the exact issue.
>
> Could you please try the following commands and post the results?
>
>     zcat MH1_R1.fastq.gz \
>        | split --verbose -l 40000000 - DHT_R1_ > DHT_R1.log ; echo DHT_R1
> exit code: $?
>     zcat MH1_R2.fastq.gz \
>        | split --verbose -l 40000000 - DHT_R2_ > DHT_R2.log ; echo DHT_R2
> exit code: $?
>     wc -l DHT_R1.log DHT_R2.log
>
> Two more questions:
> 1. can you post the result of "split --version" ?
> 2. You mentioned "jobs" - if you are running these as submitted jobs on
> a cluster (e.g. with "qsub"), can you double-check the STDERR log files
> to ensure no errors where encountered ?
>
> If we still can't pinpoint the issue, the next steps would be to check
> the DHT_R{1,2}.log files, and then try to compare the content of the
> splitted files.
>
> I assume the input files are indeed correctly paired, but just to check,
> if you could try the following command, it should not print anything
> to the screen (indicating all sequence IDs are paired):
>
>     paste <(zcat MH1_R2.fastq) <(zcat MH1_R2.fastq.gz) \
>        | awk 'NR%4!=1 { next } $1!=$3 { print "Error in line " NR ":" $1 "
> vs " $3 }'
>
> regards,
>  - assaf
>
>
>

-- 
Heather Wick
PhD Candidate, Human Genetics
Labs of Sarah Wheelan and Vasan Yegnasubramanian
Institute of Genetic Medicine
Johns Hopkins University School of Medicine
hwick1 <at> jhmi.edu

[Message part 2 (text/html, inline)]

Information forwarded to bug-coreutils <at> gnu.org:
bug#36130; Package coreutils. (Mon, 10 Jun 2019 21:39:01 GMT) Full text and rfc822 format available.

Message #20 received at 36130 <at> debbugs.gnu.org (full text, mbox):

From: Pádraig Brady <P <at> draigBrady.com>
To: Heather Wick <heather.c.wick <at> gmail.com>,
 Assaf Gordon <assafgordon <at> gmail.com>
Cc: 36130 <at> debbugs.gnu.org
Subject: Re: bug#36130: split bug
Date: Mon, 10 Jun 2019 22:38:44 +0100

On 10/06/19 19:28, Heather Wick wrote:
> Thank you so much for your response. Here are the results of the tests you
> sent:
> Verbose: This seems to have made the same number of files this time; not
> sure why the other 3-4 times I ran it it did not. They appear to be the
> same size, with paired last reads
> 
> (base) [hwick <at> zappalogin interactive_with_verbose]$ cat
> make_chunks_1_1mill_verbose
> 
> DHT_R1 exit code: 0
> 
> DHT_R2 exit code: 0
> 
>   96 DHT_R1.log
> 
>   96 DHT_R2.log
> 
>  192 total
> Version:
> 
> (base) [hwick <at> zappalogin test_2019]$ split --version
> 
> split (GNU coreutils) 8.4

That is nearly 10 years old now, though in saying that I'm not
sure if there were any bugs fixed that would explain what you're seeing.
One possibility is:
https://git.savannah.gnu.org/gitweb/?p=coreutils.git;a=commitdiff;h=758916b
which would manifest as silently ignoring errors when reading input.

cheers,
Pádraig

Information forwarded to bug-coreutils <at> gnu.org:
bug#36130; Package coreutils. (Mon, 10 Jun 2019 22:51:01 GMT) Full text and rfc822 format available.

Message #23 received at 36130 <at> debbugs.gnu.org (full text, mbox):

From: Assaf Gordon <assafgordon <at> gmail.com>
To: Heather Wick <heather.c.wick <at> gmail.com>
Cc: 36130 <at> debbugs.gnu.org
Subject: Re: bug#36130: split bug
Date: Mon, 10 Jun 2019 16:50:20 -0600

Hello,

On 2019-06-10 12:28 p.m., Heather Wick wrote:
> Thank you so much for your response. Here are the results of the tests 
> you sent:

> Verbose: This seems to have made the same number of files this time; not 
> sure why the other 3-4 times I ran it it did not. They appear to be the 
> same size, with paired last reads
[...]

Glad to hear it worked.

Could it be that in previous times the queued job ran out of disk space?

That would be my first guess, as such things are common in shared 
grid/cluster environments, particularly if your job runs in a temporary
and limited storage location (e.g. "/tmp/job-NNNN").

I would suspect that the exit-code you are seeing is the exit code
of the entire job (that is - of the shell script that is being qsub'd),
and not necessarily that of 'split' (then again, this might not be 
correct if you explicitly checked the exit code of 'split').

Given that your grid environment already has configuration issues
(the bash and "module" related errors), I would not be surprised if
the exit code is not reliable.

I would strongly encourage to always look into the STDERR file
of the job to verify no other errors occurred.

Or, perhaps write shell scripts more defensively, like so:

  [...]
  zcat MH1_R1.fastq.gz | split -l 40000000 - DHT_R1_ \
        && echo split MH1_R1 OK \
        || echo split MH1_R1 FAILED
  [...]

Then checking the STDOUT for positive confirmation each program succeeded.
Or perhaps:

  # define a shell function "die" to print an error and terminate
  die()
  {
    base=$(basename "$0")
    echo "$base: error: $*" >&2
    exit 1
  }

  zcat MH1_R1.fastq.gz | split -l 40000000 - DHT_R1_ \
        || die "split MH1_R1 failed"

And then run at least one job that will fail on purpose,
and ensure you see the error message in the STDERR log,
and you get a non-zero exit code (and then ensure you use 'die'
on every command).

It is sometimes recommended to use "set -e" for "easy"
error handling in shell scripts- but I would recommend against it.
Many reasons detailed here: https://mywiki.wooledge.org/BashFAQ/105

It might be more frustrating to add such extra checks on every
program, but from my humble experience, grid environments bring
on so many more intermittent and transient problems that it is
definitely worth it.

> 
> STDERR:
> The only thing in the stderr file is an odd duck of:
> 
> -sh: module: line 1: syntax error: unexpected end of file
> 
> -sh: error importing function definition for `BASH_FUNC_module'
> 
> Python 3.6.8 :: Anaconda, Inc.
> 
> /bin/sh: module: line 1: syntax error: unexpected end of file
> 
> /bin/sh: error importing function definition for `BASH_FUNC_module'
> 
> but this prints for every job I run with this particular flavor of 
> conda/bash and doesn't seem to affect anything else (as far as I know)

These errors are specific to your grid/cluster environment,
and the best place to ask is the I.T or bioinformatics department in
your institute (whomever is in charge of the cluster).

Broadly speaking, "module" is mechanism that ease the use of
various software packages. It is usally setup by your IT administrators.
A typical use-case is to have different version of programs in non-
standard locations, e.g.
   samtools version 1.6 in /opt/it/programs/samtools-1.6
 and
   samtools version 1.9 in /opt/bioinfo/tools/new/samtools/

and then cluster users (e.g. you) just need to add:
   "module load samtools-1.8"
and have the command "samtools" just work without knowing
the gritty details of where the program is.

It seems that in your case, something relating to the "module"
setup is broken.

More information here: 
https://en.wikipedia.org/wiki/Environment_Modules_(software)

> All jobs finished well below allotted memory and with exit status 0, 
> even when split didn't make the right number of output files.
>
> Do you know any reason why the behavior would be inconsistent?

The "alloted memory" is a non-issue for this "split" command,
it will always use very little amount of memory regardless of how big
the input files are.

As for "exit status 0" - I can't be sure, but I suspect the exit status
you see is the one of the entire job (i.e. the shell script),
and perhaps it does not represent the exit code of the "split" program.

If you have the STDERR files of the jobs which failed, it's worth
checking them for any additional error messages.

> 
> Pairing check: unfortunately my server's version of bash doesn't support 
> paste in this way, I've run into this issue before but I forget what the 
> workaround is. I can't run this command interactively because my server 
> times out (these files are > 3 billion lines each, so it takes a long 
> time to zcat them)

Ah yes, the construct:

   program <(other program)

is a "bash" feature that is not available in simple shell scripts
(interactive use vs non-interactive and other things).

One work-around is to run (from inside your script):

  bash -c "paste <(zcat MH1_R2.fastq) <(zcat MH1_R2.fastq.gz)" \
       | awk 'NR%4!=1 { next } $1!=$3 { print "Error in line " NR ":" $1

----

To conclude:

If I understand correctly, the latest attempt worked correctly
and there are no problems in "split".

If this is the case, we can mark this thread as "done".

regards,
 - assaf

Information forwarded to bug-coreutils <at> gnu.org:
bug#36130; Package coreutils. (Wed, 26 Jun 2019 14:57:02 GMT) Full text and rfc822 format available.

Message #26 received at 36130 <at> debbugs.gnu.org (full text, mbox):

From: Assaf Gordon <assafgordon <at> gmail.com>
To: Heather Wick <heather.c.wick <at> gmail.com>
Cc: 36130 <at> debbugs.gnu.org
Subject: Re: bug#36130: split bug
Date: Wed, 26 Jun 2019 08:56:49 -0600

tag 36130 notabug
close 36130
stop

Hello,

On Mon, Jun 10, 2019 at 04:50:20PM -0600, Assaf Gordon wrote:
> On 2019-06-10 12:28 p.m., Heather Wick wrote:
> > Verbose: This seems to have made the same number of files this time; not
> > sure why the other 3-4 times I ran it it did not. They appear to be the
> > same size, with paired last reads
> [...]
> 
> Glad to hear it worked.
> 
> Could it be that in previous times the queued job ran out of disk space?
> 
> That would be my first guess, as such things are common in shared
> grid/cluster environments, particularly if your job runs in a temporary
> and limited storage location (e.g. "/tmp/job-NNNN").


With no further comments, I'm closing this ticket.
If more issues arise (or this was not adequate solution) we can always
re-open this ticket.

regards,
 -assaf

Added tag(s) notabug. Request was from Assaf Gordon <assafgordon <at> gmail.com> to control <at> debbugs.gnu.org. (Wed, 26 Jun 2019 14:57:02 GMT) Full text and rfc822 format available.

bug closed, send any further explanations to 36130 <at> debbugs.gnu.org and Heather Wick <heather.c.wick <at> gmail.com> Request was from Assaf Gordon <assafgordon <at> gmail.com> to control <at> debbugs.gnu.org. (Wed, 26 Jun 2019 14:57:02 GMT) Full text and rfc822 format available.

Information forwarded to bug-coreutils <at> gnu.org:
bug#36130; Package coreutils. (Wed, 26 Jun 2019 16:09:02 GMT) Full text and rfc822 format available.

Message #33 received at 36130 <at> debbugs.gnu.org (full text, mbox):

From: Heather Wick <heather.c.wick <at> gmail.com>
To: Assaf Gordon <assafgordon <at> gmail.com>
Cc: 36130 <at> debbugs.gnu.org
Subject: Re: bug#36130: split bug
Date: Wed, 26 Jun 2019 12:08:14 -0400

[Message part 1 (text/plain, inline)]

Thank you for all your help. I will let you know if I run into any more
issues. For whatever reason, putting the program on verbose has let it run
with no issues that I can determine related to my initial problems.
Thanks,
~ Heather

On Wed, Jun 26, 2019 at 10:56 AM Assaf Gordon <assafgordon <at> gmail.com> wrote:

> tag 36130 notabug
> close 36130
> stop
>
> Hello,
>
> On Mon, Jun 10, 2019 at 04:50:20PM -0600, Assaf Gordon wrote:
> > On 2019-06-10 12:28 p.m., Heather Wick wrote:
> > > Verbose: This seems to have made the same number of files this time;
> not
> > > sure why the other 3-4 times I ran it it did not. They appear to be the
> > > same size, with paired last reads
> > [...]
> >
> > Glad to hear it worked.
> >
> > Could it be that in previous times the queued job ran out of disk space?
> >
> > That would be my first guess, as such things are common in shared
> > grid/cluster environments, particularly if your job runs in a temporary
> > and limited storage location (e.g. "/tmp/job-NNNN").
>
>
> With no further comments, I'm closing this ticket.
> If more issues arise (or this was not adequate solution) we can always
> re-open this ticket.
>
> regards,
>  -assaf
>


-- 
Heather Wick
PhD Candidate, Human Genetics
Labs of Sarah Wheelan and Vasan Yegnasubramanian
Institute of Genetic Medicine
Johns Hopkins University School of Medicine
hwick1 <at> jhmi.edu

[Message part 2 (text/html, inline)]

bug archived. Request was from Debbugs Internal Request <help-debbugs <at> gnu.org> to internal_control <at> debbugs.gnu.org. (Thu, 25 Jul 2019 11:24:08 GMT) Full text and rfc822 format available.

This bug report was last modified 6 years and 118 days ago.

Previous Next

GNU bug tracking system
Copyright (C) 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson.

GNU bug report logs - #36130 split bug

GNU bug report logs - #36130
split bug