Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Alignment to palmDB viral index is too slow compared to host index for scRNAseq 10xV3 reads #3

Open
gauravgadhvi opened this issue May 20, 2024 · 0 comments

Comments

@gauravgadhvi
Copy link

Hi,

Thank you for the fantastic method for viral quantification and sharing various analyses notebooks.

I am trying to apply your method to a large set of human 10x scRNAseq datasets of ~ 50 Billion reads in order to detect and quantify viral sequences from palmDB. I am using the option 7 i.e. Capturing host reads before alignment to palmDB. During the alignment step, it took around 50-60 hours for the host reads to align and is taking way longer for the viral reads to align to PalmDB.

Currently, it is running for over 4 days and has processed close to 25% of total 50B reads. Is there a way to speed up the alignment for viral reads or is it expected to take this long for alignment to palmDB given the large amount of 10xV3 sequencing reads?
Here is the command I used for aligning the reads to palmDB :

kallisto bus -n --aa -i ./humanCDNA_masked__virus_index.idx -o ./virus -t 8 -B ./HIVdonors_allSamples_batchFile.txt --batch-barcodes --rf-stranded -x 10xv3 --verbose

If there is any way to make it faster that would be really helpful to speed up this viral alignment, thank you so much for your help!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant