Construct-ancestral-genome is a protocol for processing present-day genomic data, inferring local ancestry, and assembling ancestral haplotypes.
https://github.com/Shuhua-Group/Construct-ancestral-genome
https://github.com/zhangxx123456/Construct-ancestral-genome
• Python (v3.6), see http://www.python.org
• Plink (v1.9), see https://www.cog-genomics.org/plink/
• Plink (v2.0), see https://www.cog-genomics.org/plink/
• Perl (v5), see http://www.perl.org
• ChromoPainter (v2), see https://github.com/sahwa/ChromoPainterV2
• BCFtools (v1.14), see https://www.htslib.org/download/
We create a Docker image (https://registry.hub.docker.com/r/wangbaonan/shapeit4_perl_image) that provides an environment with all software installed.
scripts/
directory contains the core scripts used in this protocal.
Examples/
directory contains the test data and the execution scripts.
Examples.done/
directory contains all the intermediate files and results generated after running the above pipeline, for user reference.
• For detailed execution, please refer to the 000.run.pipeline.sh
file.
• Before you run it, users need to modify the software paths in the 000.run.pipeline.sh
file (e.g., python=/your/path
).
• Then the pipeline can be quickly executed with the following command.
sh 000.run.pipeline.sh
• To help users run the pipeline more efficiently, we remove the data preprocessing section and the subsequent analysis for constructed ancestral samples. If users require these steps, please refer to our paper.
If you use the Construct-ancestral-genome in your research work, please cite the following papers:
• Reconstructing the ancestral gene pool to uncover the origins and genetic links of Hmong-Mien speakers (BMC Biology, 2024)
By: Xiaoxi Zhang, 2024
Contact: [email protected]