The codes are largely organized based on programming languages. The main workflows are always programmed using bash
scripts, which may call additional bash
, SLURM sbatch
(for use on a cluster), R
, or perl
scripts.
codes in (bash/prepSeq.bash), which includes downloading and processing SRA sequences to conform to formats (directory structures, file names) that are compatible with downstream steps.
codes in (bash/gvcfs.bash) which calls (bash/fq2gvcf.bash).
codes in (bash/combineGenotype.bash) and (bash/genotype.bash), including variant quality recalibration.
codes in (bash/QC.bash).
codes in (bash/acc_ind550_imputation.bash)
codes in the directory (server/)
codes in the directory (figureCode/) and data needed to generate the figures in (figureData/)