Skip to content

Commit

Permalink
Create README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
phoenixAja authored Apr 11, 2024
1 parent a729caf commit 6174c74
Showing 1 changed file with 6 additions and 0 deletions.
6 changes: 6 additions & 0 deletions workflows/index-generation/helpful_for_analysis/README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,6 @@
### jupyter notebooks used for helpful common analysis steps including:
* querying NCBI for an accession and lineage (used to investigate reads in the "all taxa with neither family nor genus classification" report for mNGS)
* querying marisa trie files - notebook to easily query all marisa trie files generated from index generation workflow above.
* compare non host alignment times between two projects - this was used to benchmark how long it took to do non host alignment on old and new indexes.
* generate taxon lineage changelog for a sample report - get a readout on which reads from a sample report have a taxon / lineage change between new and old index runs. Used for comp bio validation purposes mainly.
* checking sequence retention for different index compression runs - this notebook was handy for running multiple compression runs and summarizing which reads were dropped, helpful for early analysis and benchmarking of compression workflow.

0 comments on commit 6174c74

Please sign in to comment.