Hello Everyone, Good Morning/Evening
I was running Nextstrain on local data, I got this error message:
[Wed Jan 26 12:59:21 2022]
Error in rule align:
jobid: 11
output: results/aligned_RUNTEST.fasta.xz, results/insertions_RUNTEST.tsv, results/translations/seqs_RUNTEST.gene.ORF1a.fasta.xz, results/translations/seqs_RUNTEST.gene.ORF1b.fasta.xz, results/translations/seqs_RUNTEST.gene.S.fasta.xz, results/translations/seqs_RUNTEST.gene.ORF3a.fasta.xz, results/translations/seqs_RUNTEST.gene.E.fasta.xz, results/translations/seqs_RUNTEST.gene.M.fasta.xz, results/translations/seqs_RUNTEST.gene.ORF6.fasta.xz, results/translations/seqs_RUNTEST.gene.ORF7a.fasta.xz, results/translations/seqs_RUNTEST.gene.ORF7b.fasta.xz, results/translations/seqs_RUNTEST.gene.ORF8.fasta.xz, results/translations/seqs_RUNTEST.gene.N.fasta.xz, results/translations/seqs_RUNTEST.gene.ORF9b.fasta.xz
log: logs/align_RUNTEST.txt (check log file(s) for error message)
shell:
python3 scripts/sanitize_sequences.py --sequences data/RUNTEST.sequences.fasta --strip-prefixes hCoV-19/ SARS-CoV-2/ --output /dev/stdout 2> logs/sanitize_sequences_RUNTEST.txt | nextalign --jobs=1 --reference defaults/reference_seq.fasta --genemap defaults/annotation.gff --genes ORF1a,ORF1b,S,ORF3a,E,M,ORF6,ORF7a,ORF7b,ORF8,N,ORF9b --sequences /dev/stdin --output-dir results/translations --output-basename seqs_RUNTEST --output-fasta results/aligned_RUNTEST.fasta --output-insertions results/insertions_RUNTEST.tsv > logs/align_RUNTEST.txt 2>&1;
xz -2 results/aligned_RUNTEST.fasta;
xz -2 results/translations/seqs_RUNTEST*.fasta
(one of the commands exited with non-zero exit code; note that snakemake uses bash strict mode!)
Logfile logs/align_RUNTEST.txt:
[ERROR] Nextalign: Error: when running the internal parallel pipeline: When parsing input sequences: Input stream ("/dev/stdin") is empty or corrupted. Aborting.
Removing output files of failed job align since they might be corrupted:
results/insertions_RUNTEST.tsv
Shutting down, this might take some time.
Exiting because a job execution failed. Look above for error message
Complete log: /mnt/e/Bioinformatics/nextstrain/nextstrain/ncov/.snakemake/log/2022-01-26T125920.612835.snakemake.log
This is the content of the log file:
RUNTEST.txt
[ERROR] Nextalign: Error: when running the internal parallel pipeline: When parsing input sequences: Input stream ("/dev/stdin") is empty or corrupted. Aborting.
sanitize_sequences_RUNTEST.txt
Traceback (most recent call last):
File "/mnt/e/Bioinformatics/nextstrain/nextstrain/ncov/scripts/sanitize_sequences.py", line 2, in <module>
from augur.io import open_file, read_sequences, write_sequences
ModuleNotFoundError: No module named 'augur'
I tried the solutions mentioned on this link `/dev/stdout` not handled properly for the snakemake workflow · Issue #754 · nextstrain/ncov · GitHub, but did not work
I appreciate the help,
Thank you