Positive selection (via scikit-allel) and Manatthan plot with Python

Python pipeline to perform PBS and Tajima D and plot/tabulate the results.

It contains two notebooks one for the analysis and one for the plot and table.

The files used are the VCF phased genomes of 1KGP and can be downloaded from here. I downloaded them in the folder called data using the following script:

mkdir data

cd data

for c in (1..22)
do wget  ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20130502/ALL.chr${c}.phase3_shapeit2_mvncall_integrated_v5a.20130502.genotypes.vcf.gz
done

The example perform PBS to investigate signals of positive selection in Southern European populations (TSI and IBS) vs CHB using as outgroup YRI. A delta Tajima D is also computed only on TSI population using as outgroup YRI

For faster results download the PBSTajD_acrossChrom.ipynb as python script and you can run it in a cluster within a conda environment. You can apply also the analysis to one chromosome or few chromosome with little adjustments of the scripts.

Acknowledge for the Manatthan plot code can be found here

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
PBSTajD_acrossChrom.ipynb		PBSTajD_acrossChrom.ipynb
Plots&TablesPBSTajDv2.ipynb		Plots&TablesPBSTajDv2.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Positive selection (via scikit-allel) and Manatthan plot with Python

About

Releases

Packages

Languages

raveancic/selection_scikit-allel_py

Folders and files

Latest commit

History

Repository files navigation

Positive selection (via scikit-allel) and Manatthan plot with Python

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages