shenwei356 / seqkit

A cross-platform and ultrafast toolkit for FASTA/Q file manipulation

Home Page:https://bioinf.shenwei.me/seqkit

Repository from Github https://github.comshenwei356/seqkitRepository from Github https://github.comshenwei356/seqkit

Improvement for concat

kabilov opened this issue · comments

Hi Wei,

I have PE reading (R1 and R2). R2 is modified with the "seqkit concat" command. As a result, the order of the names in R1 and R2 is different.
It would be useful to have a flag that would preserve the order of names after concatenations even if this would result in a performance penalty or increased memory usage.

Best wishes,
Marsel

seqkit concat is for concatenating sequences with the same IDs from multiple files, which would consume a lot of RAM. I'm not sure of the purpose of using it for paired-end data.

Anyway, you can use seqkit pair to match up paired-end reads from two fastq files.