Skip to content

Latest commit

 

History

History
32 lines (27 loc) · 1.32 KB

README.md

File metadata and controls

32 lines (27 loc) · 1.32 KB

muPBWT-1KGP-workflow

Snakemake workflow for testing μ-PBWT against Durbin's PBWT and Syllable-PBWT on 1000 Genome Project (1KGP) phase 3 data. 1KGP data are available at this link.

Prerequisite

Snakemake need to be already installed, for example via conda:

conda create -c conda-forge -c bioconda -n snakemake snakemake

Run pipeline

cd muPBWT-1KGP-workflow
snakemake --cores <num_cores> --use-conda --resources load=100

The option --resources load=100 option will avoid using too much RAM for Durbin Algorithm 5 (about 500gb is still needed), running only one job at a time for the rule runPbwtIndexed(thanks Jan Schreiber).

Results

The pipeline will generatecd some results:

  • in results/data some useful CSV files
  • in results/plots some plots in PDF format
  • in results/tables some tables in LaTeX syntax

Pipeline overview

Pipeline rules