Skip to content

mollysacks/prf-search

Repository files navigation

PRF Search

By Molly Sacks

This codebase allows users to search through a bacterial genome to find potential -1 progammed ribosomal frameshifting events. Currently, this can be run locally with small queries (<50 cDNAs). Cloud support is being added soon!

To run this code, you will need working installations of:

  1. hmmer
  2. RScape
  3. python3 and packages: argparse, os, datetime, pandas, shutil, json, RNA (python wrapper for ViennaRNA), localcider, & Bio.Seq (I used Bioconda for this)

Any recent/ stable release should work.

E_coli_small_2021_12_08-21_53_38.report.tsv contains output from searching through small_ecoli.fa, aligning to database bacteria.1236.1.genomic.fna. Intermediate files (including hmmer and RScape/CaCoFold output) are in E_coli_small_2021_12_08-21_53_38. Warning, bacteria.1236.1.genomic.fna is quite large!

Here is the command I used to generate this output. I used all the default parameters:

bash search.sh -o E_coli_small_ -q /path/to/small_ecoli.fa -d /path/to/bacteria.1236.1.genomic.fna -r /path/to/rscape_v1.6.1/bin/R-scape -p /path/to/prf-search

About

Senior Thesis Project!

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published