Skip to content

Commit

Permalink
utils
Browse files Browse the repository at this point in the history
  • Loading branch information
auroramaurizio committed Sep 1, 2024
1 parent f07fefe commit 68fe50f
Showing 1 changed file with 11 additions and 2 deletions.
13 changes: 11 additions & 2 deletions utils/split_reads/split_reads.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,8 +7,11 @@ https://bioinf.shenwei.me/seqkit/usage/
### Input files:

CELL3326_S16_L001_R1_001.fastq.gz

CELL3326_S16_L001_R2_001.fastq.gz

CELL3326_S16_L002_R1_001.fastq.gz

CELL3326_S16_L002_R2_001.fastq.gz

### Command:
Expand All @@ -20,9 +23,15 @@ seqkit split2 -1 CELL3326_S16_L002_R1_001.fastq.gz -2 CELL3326_S16_L002_R2_001.f
### Comment:

seqkit splits the total reads in the original PE files in N chunks (in this case 2 chunks was enough) so that:
the first read of the original R1 file goes to R1_part_001 file,

the first read of the original R1 file goes to R1_part_001 file,

the second read of the original R1 file goes to R1_part_002 file,
the third read of the original R1 file goes to R1_part_001 file,

the third read of the original R1 file goes to R1_part_001 file,

the fourth read of the original R1 file goes to R1_part_002 file and so on.

The same goes for R2 reads.

R1 and R2 reads are matched in R1_part_001 and R2_part_001 files, as well as in R1_part_002 and R2_part_001.

0 comments on commit 68fe50f

Please sign in to comment.