From 4bf218ff9c05963161aca353cff84040889de668 Mon Sep 17 00:00:00 2001
From: David Ochoa
+
+
-
+
- +
Open Targets Gentropy is a Python package to facilitate the interpretation and analysis of GWAS and functional genomic studies for target identification. The package contains a toolkit for the harmonisation, statistical analysis and prioritisation of genetic signals to assist drug discovery. diff --git a/docs/development/airflow.md b/docs/development/airflow.md index b73ad614e..ff5f7906c 100644 --- a/docs/development/airflow.md +++ b/docs/development/airflow.md @@ -8,12 +8,14 @@ This section describes how to set up a local Airflow server which will orchestra - [Google Cloud SDK](https://cloud.google.com/sdk/docs/install) !!! warning macOS Docker memory allocation -On macOS, the default amount of memory available for Docker might not be enough to get Airflow up and running. Allocate at least 4GB of memory for the Docker Engine (ideally 8GB). [More info](https://airflow.apache.org/docs/apache-airflow/stable/howto/docker-compose/index.html#) + + On macOS, the default amount of memory available for Docker might not be enough to get Airflow up and running. Allocate at least 4GB of memory for the Docker Engine (ideally 8GB). [More info](https://airflow.apache.org/docs/apache-airflow/stable/howto/docker-compose/index.html#) ## Configure Airflow access to Google Cloud Platform !!! warning Specifying Google Cloud parameters -Run the next two command with the appropriate Google Cloud project ID and service account name to ensure the correct Google default application credentials are set up. + + Run the next two command with the appropriate Google Cloud project ID and service account name to ensure the correct Google default application credentials are set up. Authenticate to Google Cloud: @@ -38,7 +40,8 @@ cd src/airflow ### Build Docker image !!! note Custom Docker image for Airflow -The custom Dockerfile built by the command below extends the official [Airflow Docker Compose YAML](https://airflow.apache.org/docs/apache-airflow/stable/docker-compose.yaml). We add support for Google Cloud SDK, Google Dataproc operators, and access to GCP credentials. + + The custom Dockerfile built by the command below extends the official [Airflow Docker Compose YAML](https://airflow.apache.org/docs/apache-airflow/stable/docker-compose.yaml). We add support for Google Cloud SDK, Google Dataproc operators, and access to GCP credentials. ```bash docker build . --tag extending_airflow:latest @@ -47,7 +50,8 @@ docker build . --tag extending_airflow:latest ### Set Airflow user ID !!! note Setting Airflow user ID -These commands allow Airflow running inside Docker to access the credentials file which was generated earlier. + + These commands allow Airflow running inside Docker to access the credentials file which was generated earlier. ```bash # If any user ID is already specified in .env, remove it. diff --git a/docs/development/contributing.md b/docs/development/contributing.md index 1b4e5451f..a12ac4951 100644 --- a/docs/development/contributing.md +++ b/docs/development/contributing.md @@ -8,7 +8,7 @@ title: Contributing guidelines The steps in this section only ever need to be done once on any particular system. -Google Cloud configuration: +For Google Cloud configuration: 1. Install Google Cloud SDK: https://cloud.google.com/sdk/docs/install. diff --git a/docs/howto/run_step_using_config.md b/docs/howto/run_step_using_config.md index 926cdbf45..bbce9a195 100644 --- a/docs/howto/run_step_using_config.md +++ b/docs/howto/run_step_using_config.md @@ -7,7 +7,8 @@ Title: Run step using config It's possible to parametrise the functionality of a step using a YAML configuration file. This is useful when you want to run a step multiple times with different parameters or simply to avoid having to specify the same parameters every time you run a step. !!! info Configuration files using Hydra -The package uses [Hydra](https://hydra.cc) to handle configuration files. For more information, please visit the [Hydra documentation](https://hydra.cc/docs/intro/). + + The package uses [Hydra](https://hydra.cc) to handle configuration files. For more information, please visit the [Hydra documentation](https://hydra.cc/docs/intro/). To run a step using a configuration file, you need to create a configuration file in YAML format. diff --git a/docs/index.md b/docs/index.md index fb84698fb..31762c17e 100644 --- a/docs/index.md +++ b/docs/index.md @@ -1,13 +1,13 @@ --- -title: Open Targets Genetics +title: Open Targets Gentropy hide: - navigation - toc --- -- -
+ + + ---- + [![Ruff](https://img.shields.io/endpoint?url=https://raw.githubusercontent.com/astral-sh/ruff/main/assets/badge/v2.json)](https://github.com/astral-sh/ruff) [![image](https://github.com/opentargets/gentropy/actions/workflows/release.yaml/badge.svg)](https://opentargets.github.io/gentropy/) @@ -25,7 +25,7 @@ hide: --- -Open Targets Gentropy is a Python package to facilitate the interpretation and analysis of GWAS and functional genomic studies for target identification. The package contains a toolkit for the harmonisation, statistical analysis and prioritisation of genetic signals to assist drug discovery. +Open Targets Gentropy is a Python package to facilitate the interpretation and analysis of GWAS and functional genomic studies for target identification. This package contains a toolkit for the harmonisation, statistical analysis and prioritisation of genetic signals to assist drug discovery. ## About Open Targets diff --git a/docs/installation.md b/docs/installation.md index 8e9c130bc..07c5493f0 100644 --- a/docs/installation.md +++ b/docs/installation.md @@ -7,8 +7,26 @@ hide: # Installation -To install Open Targets Gentropy, we recommend using PyPI. You can install it using the following pip command: +!!! note Python compatibility -``` + In the early stages of development, we are using Python version 3.10. We recommend using [pyenv](https://github.com/pyenv/pyenv) or similar tools to manage your local Python version. We intend to support more Python versions in the future. + +## Pypi + +We recommend installing Open Targets Gentropy using Pypi: + +```bash pip install gentropy ``` + +## Source + +Alternatively, you can install Open Targets Gentropy from source. Check the [contributing](development/contributing.md) section for more information. + +## Uninstall + +```bash +pip uninstall gentropy -y +``` + +For any issues with the installation, check the [troubleshooting section](development/troubleshooting.md). diff --git a/docs/python_api/_python_api.md b/docs/python_api/_python_api.md index 32ee92820..02c1941af 100644 --- a/docs/python_api/_python_api.md +++ b/docs/python_api/_python_api.md @@ -1,5 +1,12 @@ --- -title: Open Targets Gentropy +title: Python API --- Open Targets Gentropy is a Python package to facilitate the interpretation and analysis of GWAS and functional genomic studies for target identification. The package contains a toolkit for the harmonisation, statistical analysis and prioritisation of genetic signals to assist drug discovery. + +The Overall architecture of the package distinguishes between: + +- [**Data Sources**](datasources/_datasources.md): data sources harmonisation tools +- [**Datasets**](datasets/_datasets.md): data model +- [**Methods**](methods/_methods.md): statistical analysis tools +- [**Steps**](steps/_steps.md): pipeline steps diff --git a/docs/python_api/dataset/_dataset.md b/docs/python_api/dataset/_dataset.md deleted file mode 100644 index 4f2fdb6f6..000000000 --- a/docs/python_api/dataset/_dataset.md +++ /dev/null @@ -1,5 +0,0 @@ ---- -title: Dataset ---- - -::: gentropy.dataset.dataset.Dataset diff --git a/docs/python_api/datasets/_datasets.md b/docs/python_api/datasets/_datasets.md new file mode 100644 index 000000000..058ab5fa2 --- /dev/null +++ b/docs/python_api/datasets/_datasets.md @@ -0,0 +1,7 @@ +--- +title: Datasets +--- + +The Dataset classes define the data model behind Open Targets Gentropy. Every class inherits from the `Dataset` class and contains a dataframe with a predefined schema that can be found in the respective classes. + +::: gentropy.dataset.dataset.Dataset diff --git a/docs/python_api/dataset/colocalisation.md b/docs/python_api/datasets/colocalisation.md similarity index 100% rename from docs/python_api/dataset/colocalisation.md rename to docs/python_api/datasets/colocalisation.md diff --git a/docs/python_api/dataset/gene_index.md b/docs/python_api/datasets/gene_index.md similarity index 100% rename from docs/python_api/dataset/gene_index.md rename to docs/python_api/datasets/gene_index.md diff --git a/docs/python_api/dataset/intervals.md b/docs/python_api/datasets/intervals.md similarity index 100% rename from docs/python_api/dataset/intervals.md rename to docs/python_api/datasets/intervals.md diff --git a/docs/python_api/dataset/l2g_feature.md b/docs/python_api/datasets/l2g_feature.md similarity index 100% rename from docs/python_api/dataset/l2g_feature.md rename to docs/python_api/datasets/l2g_feature.md diff --git a/docs/python_api/dataset/l2g_feature_matrix.md b/docs/python_api/datasets/l2g_feature_matrix.md similarity index 100% rename from docs/python_api/dataset/l2g_feature_matrix.md rename to docs/python_api/datasets/l2g_feature_matrix.md diff --git a/docs/python_api/dataset/l2g_gold_standard.md b/docs/python_api/datasets/l2g_gold_standard.md similarity index 100% rename from docs/python_api/dataset/l2g_gold_standard.md rename to docs/python_api/datasets/l2g_gold_standard.md diff --git a/docs/python_api/dataset/l2g_prediction.md b/docs/python_api/datasets/l2g_prediction.md similarity index 100% rename from docs/python_api/dataset/l2g_prediction.md rename to docs/python_api/datasets/l2g_prediction.md diff --git a/docs/python_api/dataset/ld_index.md b/docs/python_api/datasets/ld_index.md similarity index 100% rename from docs/python_api/dataset/ld_index.md rename to docs/python_api/datasets/ld_index.md diff --git a/docs/python_api/dataset/study_index.md b/docs/python_api/datasets/study_index.md similarity index 100% rename from docs/python_api/dataset/study_index.md rename to docs/python_api/datasets/study_index.md diff --git a/docs/python_api/dataset/study_locus.md b/docs/python_api/datasets/study_locus.md similarity index 100% rename from docs/python_api/dataset/study_locus.md rename to docs/python_api/datasets/study_locus.md diff --git a/docs/python_api/dataset/study_locus_overlap.md b/docs/python_api/datasets/study_locus_overlap.md similarity index 100% rename from docs/python_api/dataset/study_locus_overlap.md rename to docs/python_api/datasets/study_locus_overlap.md diff --git a/docs/python_api/dataset/summary_statistics.md b/docs/python_api/datasets/summary_statistics.md similarity index 100% rename from docs/python_api/dataset/summary_statistics.md rename to docs/python_api/datasets/summary_statistics.md diff --git a/docs/python_api/dataset/variant_annotation.md b/docs/python_api/datasets/variant_annotation.md similarity index 100% rename from docs/python_api/dataset/variant_annotation.md rename to docs/python_api/datasets/variant_annotation.md diff --git a/docs/python_api/dataset/variant_index.md b/docs/python_api/datasets/variant_index.md similarity index 100% rename from docs/python_api/dataset/variant_index.md rename to docs/python_api/datasets/variant_index.md diff --git a/docs/python_api/dataset/variant_to_gene.md b/docs/python_api/datasets/variant_to_gene.md similarity index 100% rename from docs/python_api/dataset/variant_to_gene.md rename to docs/python_api/datasets/variant_to_gene.md diff --git a/docs/python_api/datasource/_datasource.md b/docs/python_api/datasource/_datasource.md deleted file mode 100644 index 9fab444bf..000000000 --- a/docs/python_api/datasource/_datasource.md +++ /dev/null @@ -1,24 +0,0 @@ ---- -title: Data Source ---- - -# Data Source - -This section contains information about the data sources used in Open Targets Genetics. - -We use GnomAD v4.0 as a source for variant annotation and GnomAD v2.1.1 as a source for linkage disequilibrium (LD) information (described in the **GnomAD** section). - -We rely on Open Targets as a source for the list of targets and the Gold Standard training set (described in the **Open Targets** section). - -## Study Sources - -1. GWAS catalog -2. FinnGen - -## Molecular QTLs - -1. eQTL catalogue - -## Interaction / Interval-based Experiments - -We integrate a list of studies that focus on interaction and interval-based investigations, shedding light on the intricate relationships between genetic elements and their functional implications. For more detils see section **"Intervals"**. diff --git a/docs/python_api/datasources/_datasources.md b/docs/python_api/datasources/_datasources.md new file mode 100644 index 000000000..c06bef177 --- /dev/null +++ b/docs/python_api/datasources/_datasources.md @@ -0,0 +1,37 @@ +--- +title: Data Sources +--- + +# Data Sources + +This section contains information about the data source harmonisation tools available in Open Targets Gentropy. + +## GWAS study sources + +1. [GWAS catalog](gwas_catalog/_gwas_catalog.md) (with or without full summary statistics) +1. [FinnGen](finngen/_finngen.md) + +## Molecular QTLs + +1. [GTEx (eQTL catalogue)](eqtl_catalogue/_eqtl_catalogue.md) + +## Interaction / Interval-based Experiments + +1. [Intervals](intervals/_intervals.md)-based datasets, informing about the relationships between genetic elements and their functional implications. + +## Variant annotation/validation + +1. [GnomAD](gnomad/_gnomad.md) v4.0 +1. GWAS catalog harmonisation pipeline [more info](https://www.ebi.ac.uk/gwas/docs/methods/summary-statistics#_harmonised_summary_statistics_data) + +## Linkage desiquilibrium + +1. [GnomAD](gnomad/_gnomad.md) v2.1.1 LD matrixes (7 ancestries) + +## Locus-to-gene gold standard + +1. [Open Targets training set](open_targets/l2g_gold_standard.md) + +## Gene annotation + +1. [Open Targets Platform Target Dataset](open_targets/target.md) (derived from Ensembl) diff --git a/docs/python_api/datasource/eqtl_catalogue/_eqtl_catalogue.md b/docs/python_api/datasources/eqtl_catalogue/_eqtl_catalogue.md similarity index 100% rename from docs/python_api/datasource/eqtl_catalogue/_eqtl_catalogue.md rename to docs/python_api/datasources/eqtl_catalogue/_eqtl_catalogue.md diff --git a/docs/python_api/datasource/eqtl_catalogue/study_index.md b/docs/python_api/datasources/eqtl_catalogue/study_index.md similarity index 100% rename from docs/python_api/datasource/eqtl_catalogue/study_index.md rename to docs/python_api/datasources/eqtl_catalogue/study_index.md diff --git a/docs/python_api/datasource/eqtl_catalogue/summary_stats.md b/docs/python_api/datasources/eqtl_catalogue/summary_stats.md similarity index 100% rename from docs/python_api/datasource/eqtl_catalogue/summary_stats.md rename to docs/python_api/datasources/eqtl_catalogue/summary_stats.md diff --git a/docs/python_api/datasource/finngen/_finngen.md b/docs/python_api/datasources/finngen/_finngen.md similarity index 100% rename from docs/python_api/datasource/finngen/_finngen.md rename to docs/python_api/datasources/finngen/_finngen.md diff --git a/docs/python_api/datasource/finngen/study_index.md b/docs/python_api/datasources/finngen/study_index.md similarity index 100% rename from docs/python_api/datasource/finngen/study_index.md rename to docs/python_api/datasources/finngen/study_index.md diff --git a/docs/python_api/datasource/gnomad/_gnomad.md b/docs/python_api/datasources/gnomad/_gnomad.md similarity index 100% rename from docs/python_api/datasource/gnomad/_gnomad.md rename to docs/python_api/datasources/gnomad/_gnomad.md diff --git a/docs/python_api/datasource/gnomad/gnomad_ld.md b/docs/python_api/datasources/gnomad/gnomad_ld.md similarity index 100% rename from docs/python_api/datasource/gnomad/gnomad_ld.md rename to docs/python_api/datasources/gnomad/gnomad_ld.md diff --git a/docs/python_api/datasource/gnomad/gnomad_variants.md b/docs/python_api/datasources/gnomad/gnomad_variants.md similarity index 100% rename from docs/python_api/datasource/gnomad/gnomad_variants.md rename to docs/python_api/datasources/gnomad/gnomad_variants.md diff --git a/docs/python_api/datasource/gwas_catalog/_gwas_catalog.md b/docs/python_api/datasources/gwas_catalog/_gwas_catalog.md similarity index 100% rename from docs/python_api/datasource/gwas_catalog/_gwas_catalog.md rename to docs/python_api/datasources/gwas_catalog/_gwas_catalog.md diff --git a/docs/python_api/datasource/gwas_catalog/associations.md b/docs/python_api/datasources/gwas_catalog/associations.md similarity index 100% rename from docs/python_api/datasource/gwas_catalog/associations.md rename to docs/python_api/datasources/gwas_catalog/associations.md diff --git a/docs/python_api/datasource/gwas_catalog/study_index.md b/docs/python_api/datasources/gwas_catalog/study_index.md similarity index 100% rename from docs/python_api/datasource/gwas_catalog/study_index.md rename to docs/python_api/datasources/gwas_catalog/study_index.md diff --git a/docs/python_api/datasource/gwas_catalog/study_splitter.md b/docs/python_api/datasources/gwas_catalog/study_splitter.md similarity index 100% rename from docs/python_api/datasource/gwas_catalog/study_splitter.md rename to docs/python_api/datasources/gwas_catalog/study_splitter.md diff --git a/docs/python_api/datasource/gwas_catalog/summary_statistics.md b/docs/python_api/datasources/gwas_catalog/summary_statistics.md similarity index 100% rename from docs/python_api/datasource/gwas_catalog/summary_statistics.md rename to docs/python_api/datasources/gwas_catalog/summary_statistics.md diff --git a/docs/python_api/datasource/intervals/_intervals.md b/docs/python_api/datasources/intervals/_intervals.md similarity index 100% rename from docs/python_api/datasource/intervals/_intervals.md rename to docs/python_api/datasources/intervals/_intervals.md diff --git a/docs/python_api/datasource/intervals/andersson.md b/docs/python_api/datasources/intervals/andersson.md similarity index 100% rename from docs/python_api/datasource/intervals/andersson.md rename to docs/python_api/datasources/intervals/andersson.md diff --git a/docs/python_api/datasource/intervals/javierre.md b/docs/python_api/datasources/intervals/javierre.md similarity index 100% rename from docs/python_api/datasource/intervals/javierre.md rename to docs/python_api/datasources/intervals/javierre.md diff --git a/docs/python_api/datasource/intervals/jung.md b/docs/python_api/datasources/intervals/jung.md similarity index 100% rename from docs/python_api/datasource/intervals/jung.md rename to docs/python_api/datasources/intervals/jung.md diff --git a/docs/python_api/datasource/intervals/thurman.md b/docs/python_api/datasources/intervals/thurman.md similarity index 100% rename from docs/python_api/datasource/intervals/thurman.md rename to docs/python_api/datasources/intervals/thurman.md diff --git a/docs/python_api/datasource/open_targets/_open_targets.md b/docs/python_api/datasources/open_targets/_open_targets.md similarity index 100% rename from docs/python_api/datasource/open_targets/_open_targets.md rename to docs/python_api/datasources/open_targets/_open_targets.md diff --git a/docs/python_api/datasource/open_targets/l2g_gold_standard.md b/docs/python_api/datasources/open_targets/l2g_gold_standard.md similarity index 100% rename from docs/python_api/datasource/open_targets/l2g_gold_standard.md rename to docs/python_api/datasources/open_targets/l2g_gold_standard.md diff --git a/docs/python_api/datasource/open_targets/target.md b/docs/python_api/datasources/open_targets/target.md similarity index 100% rename from docs/python_api/datasource/open_targets/target.md rename to docs/python_api/datasources/open_targets/target.md diff --git a/docs/python_api/method/_method.md b/docs/python_api/methods/_methods.md similarity index 100% rename from docs/python_api/method/_method.md rename to docs/python_api/methods/_methods.md diff --git a/docs/python_api/method/carma.md b/docs/python_api/methods/carma.md similarity index 100% rename from docs/python_api/method/carma.md rename to docs/python_api/methods/carma.md diff --git a/docs/python_api/method/clumping.md b/docs/python_api/methods/clumping.md similarity index 100% rename from docs/python_api/method/clumping.md rename to docs/python_api/methods/clumping.md diff --git a/docs/python_api/method/coloc.md b/docs/python_api/methods/coloc.md similarity index 100% rename from docs/python_api/method/coloc.md rename to docs/python_api/methods/coloc.md diff --git a/docs/python_api/method/ecaviar.md b/docs/python_api/methods/ecaviar.md similarity index 100% rename from docs/python_api/method/ecaviar.md rename to docs/python_api/methods/ecaviar.md diff --git a/docs/python_api/method/l2g/_l2g.md b/docs/python_api/methods/l2g/_l2g.md similarity index 100% rename from docs/python_api/method/l2g/_l2g.md rename to docs/python_api/methods/l2g/_l2g.md diff --git a/docs/python_api/method/l2g/evaluator.md b/docs/python_api/methods/l2g/evaluator.md similarity index 100% rename from docs/python_api/method/l2g/evaluator.md rename to docs/python_api/methods/l2g/evaluator.md diff --git a/docs/python_api/method/l2g/feature_factory.md b/docs/python_api/methods/l2g/feature_factory.md similarity index 100% rename from docs/python_api/method/l2g/feature_factory.md rename to docs/python_api/methods/l2g/feature_factory.md diff --git a/docs/python_api/method/l2g/model.md b/docs/python_api/methods/l2g/model.md similarity index 100% rename from docs/python_api/method/l2g/model.md rename to docs/python_api/methods/l2g/model.md diff --git a/docs/python_api/method/l2g/trainer.md b/docs/python_api/methods/l2g/trainer.md similarity index 100% rename from docs/python_api/method/l2g/trainer.md rename to docs/python_api/methods/l2g/trainer.md diff --git a/docs/python_api/method/ld_annotator.md b/docs/python_api/methods/ld_annotator.md similarity index 100% rename from docs/python_api/method/ld_annotator.md rename to docs/python_api/methods/ld_annotator.md diff --git a/docs/python_api/method/pics.md b/docs/python_api/methods/pics.md similarity index 100% rename from docs/python_api/method/pics.md rename to docs/python_api/methods/pics.md diff --git a/docs/python_api/step/_step.md b/docs/python_api/step/_step.md deleted file mode 100644 index e19fd5fcc..000000000 --- a/docs/python_api/step/_step.md +++ /dev/null @@ -1,7 +0,0 @@ ---- -title: Step ---- - -# Step - -This section provides description for the `Step` class. Each `Step` uses its own set of Methods. diff --git a/docs/python_api/steps/_steps.md b/docs/python_api/steps/_steps.md new file mode 100644 index 000000000..9c6d7679c --- /dev/null +++ b/docs/python_api/steps/_steps.md @@ -0,0 +1,7 @@ +--- +title: Step +--- + +# Step + +This section provides description for the `Step` class. Each `Step` uses its own set of Methods and Datasets and implements the logic necessary to read a set of inputs, perform the transformation and write the outputs. All steps are available through the command line interface when running the `gentropy` command. diff --git a/docs/python_api/step/colocalisation.md b/docs/python_api/steps/colocalisation.md similarity index 100% rename from docs/python_api/step/colocalisation.md rename to docs/python_api/steps/colocalisation.md diff --git a/docs/python_api/step/eqtl_catalogue.md b/docs/python_api/steps/eqtl_catalogue.md similarity index 100% rename from docs/python_api/step/eqtl_catalogue.md rename to docs/python_api/steps/eqtl_catalogue.md diff --git a/docs/python_api/step/finngen_studies.md b/docs/python_api/steps/finngen_studies.md similarity index 100% rename from docs/python_api/step/finngen_studies.md rename to docs/python_api/steps/finngen_studies.md diff --git a/docs/python_api/step/finngen_sumstat_preprocess.md b/docs/python_api/steps/finngen_sumstat_preprocess.md similarity index 100% rename from docs/python_api/step/finngen_sumstat_preprocess.md rename to docs/python_api/steps/finngen_sumstat_preprocess.md diff --git a/docs/python_api/step/gene_index.md b/docs/python_api/steps/gene_index.md similarity index 100% rename from docs/python_api/step/gene_index.md rename to docs/python_api/steps/gene_index.md diff --git a/docs/python_api/step/gwas_catalog_curation.md b/docs/python_api/steps/gwas_catalog_curation.md similarity index 100% rename from docs/python_api/step/gwas_catalog_curation.md rename to docs/python_api/steps/gwas_catalog_curation.md diff --git a/docs/python_api/step/gwas_catalog_inclusion.md b/docs/python_api/steps/gwas_catalog_inclusion.md similarity index 100% rename from docs/python_api/step/gwas_catalog_inclusion.md rename to docs/python_api/steps/gwas_catalog_inclusion.md diff --git a/docs/python_api/step/gwas_catalog_ingestion.md b/docs/python_api/steps/gwas_catalog_ingestion.md similarity index 100% rename from docs/python_api/step/gwas_catalog_ingestion.md rename to docs/python_api/steps/gwas_catalog_ingestion.md diff --git a/docs/python_api/step/gwas_catalog_sumstat_preprocess.md b/docs/python_api/steps/gwas_catalog_sumstat_preprocess.md similarity index 100% rename from docs/python_api/step/gwas_catalog_sumstat_preprocess.md rename to docs/python_api/steps/gwas_catalog_sumstat_preprocess.md diff --git a/docs/python_api/step/l2g.md b/docs/python_api/steps/l2g.md similarity index 100% rename from docs/python_api/step/l2g.md rename to docs/python_api/steps/l2g.md diff --git a/docs/python_api/step/ld_clump.md b/docs/python_api/steps/ld_clump.md similarity index 100% rename from docs/python_api/step/ld_clump.md rename to docs/python_api/steps/ld_clump.md diff --git a/docs/python_api/step/ld_index.md b/docs/python_api/steps/ld_index.md similarity index 100% rename from docs/python_api/step/ld_index.md rename to docs/python_api/steps/ld_index.md diff --git a/docs/python_api/step/pics.md b/docs/python_api/steps/pics.md similarity index 100% rename from docs/python_api/step/pics.md rename to docs/python_api/steps/pics.md diff --git a/docs/python_api/step/variant_annotation_step.md b/docs/python_api/steps/variant_annotation_step.md similarity index 100% rename from docs/python_api/step/variant_annotation_step.md rename to docs/python_api/steps/variant_annotation_step.md diff --git a/docs/python_api/step/variant_index_step.md b/docs/python_api/steps/variant_index_step.md similarity index 100% rename from docs/python_api/step/variant_index_step.md rename to docs/python_api/steps/variant_index_step.md diff --git a/docs/python_api/step/variant_to_gene_step.md b/docs/python_api/steps/variant_to_gene_step.md similarity index 100% rename from docs/python_api/step/variant_to_gene_step.md rename to docs/python_api/steps/variant_to_gene_step.md diff --git a/docs/python_api/step/window_based_clumping.md b/docs/python_api/steps/window_based_clumping.md similarity index 100% rename from docs/python_api/step/window_based_clumping.md rename to docs/python_api/steps/window_based_clumping.md diff --git a/mkdocs.yml b/mkdocs.yml index ad928ec7b..d25076a73 100644 --- a/mkdocs.yml +++ b/mkdocs.yml @@ -1,4 +1,4 @@ -site_name: Open Targets Genetics +site_name: Open Targets Gentropy nav: - Home: index.md diff --git a/pyproject.toml b/pyproject.toml index 849a8f6f7..30713de46 100644 --- a/pyproject.toml +++ b/pyproject.toml @@ -2,7 +2,7 @@ name = "gentropy" # !! version is managed by semantic_release version = "0.0.0" -description = "Open targets Genetics Portal Python ETL" +description = "Open Targets python framework for post-GWAS analysis" authors = ["Open Targets core team"] license = "Apache-2.0" readme = "README.md" diff --git a/src/gentropy/dataset/dataset.py b/src/gentropy/dataset/dataset.py index 5a282ce35..401c5d6c6 100644 --- a/src/gentropy/dataset/dataset.py +++ b/src/gentropy/dataset/dataset.py @@ -18,7 +18,7 @@ @dataclass class Dataset(ABC): - """Open Targets Genetics Dataset. + """Open Targets Gentropy Dataset. `Dataset` is a wrapper around a Spark DataFrame with a predefined schema. Schemas for each child dataset are described in the `schemas` module. """ diff --git a/src/gentropy/dataset/study_locus_overlap.py b/src/gentropy/dataset/study_locus_overlap.py index ee1e81b32..5f839bd9c 100644 --- a/src/gentropy/dataset/study_locus_overlap.py +++ b/src/gentropy/dataset/study_locus_overlap.py @@ -21,6 +21,7 @@ class StudyLocusOverlap(Dataset): This dataset captures pairs of overlapping `StudyLocus`: that is associations whose credible sets share at least one tagging variant. !!! note + This is a helpful dataset for other downstream analyses, such as colocalisation. This dataset will contain the overlapping signals between studyLocus associations once they have been clumped and fine-mapped. """ diff --git a/src/gentropy/ld_index.py b/src/gentropy/ld_index.py index dfdd90306..cb260977d 100644 --- a/src/gentropy/ld_index.py +++ b/src/gentropy/ld_index.py @@ -11,6 +11,7 @@ class LDIndexStep: """LD index step. !!! warning "This step is resource intensive" + Suggested params: high memory machine, 5TB of boot disk, no SSDs. """ diff --git a/src/gentropy/method/colocalisation.py b/src/gentropy/method/colocalisation.py index b0699b88f..fd56398f0 100644 --- a/src/gentropy/method/colocalisation.py +++ b/src/gentropy/method/colocalisation.py @@ -100,6 +100,7 @@ class Coloc: | H4 | both traits are associated and share the same single causal variant | !!! warning "Bayes factors required" + Coloc requires the availability of Bayes factors (BF) for each variant in the credible set (`logBF` column). """ diff --git a/src/gentropy/method/pics.py b/src/gentropy/method/pics.py index bcf17280a..e5ed5f2c6 100644 --- a/src/gentropy/method/pics.py +++ b/src/gentropy/method/pics.py @@ -185,6 +185,7 @@ def finemap( """Run PICS on a study locus. !!! info "Study locus needs to be LD annotated" + The study locus needs to be LD annotated before PICS can be calculated. Args: From 2d8b08bf6671210618f4cb027c2e805210bca795 Mon Sep 17 00:00:00 2001 From: Daniel Suveges
diff --git a/docs/howto/run_step_in_cli.md b/docs/howto/run_step_in_cli.md
index 07ede526d..965c7e079 100644
--- a/docs/howto/run_step_in_cli.md
+++ b/docs/howto/run_step_in_cli.md
@@ -41,4 +41,4 @@ In most occassions, some mandatory values will be required to run the step. For
gentropy step=gene_index step.target_path=/path/to/target step.gene_index_path=/path/to/gene_index
```
-You can find more about the available steps in the [documentation](../python_api/step/_step.md).
+You can find more about the available steps in the [documentation](../python_api/steps/_steps.md).
diff --git a/docs/index.md b/docs/index.md
index 31762c17e..26070ab3d 100644
--- a/docs/index.md
+++ b/docs/index.md
@@ -21,7 +21,7 @@ hide:
[![image](https://github.com/opentargets/gentropy/actions/workflows/release.yaml/badge.svg)](https://opentargets.github.io/gentropy/)
[![codecov](https://codecov.io/gh/opentargets/gentropy/branch/main/graph/badge.svg?token=5ixzgu8KFP)](https://codecov.io/gh/opentargets/gentropy)
[![License](https://img.shields.io/badge/License-Apache_2.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
-[![pre-commit.ci status](https://results.pre-commit.ci/badge/github/opentargets/gentropy/main.svg)](https://results.pre-commit.ci/badge/github/opentargets/gentropy)
+[![DOI](https://zenodo.org/badge/DOI/10.5281/zenodo.10527086.svg)](https://doi.org/10.5281/zenodo.10527086)
---
diff --git a/docs/python_api/_python_api.md b/docs/python_api/_python_api.md
index 02c1941af..9952aa56f 100644
--- a/docs/python_api/_python_api.md
+++ b/docs/python_api/_python_api.md
@@ -4,7 +4,7 @@ title: Python API
Open Targets Gentropy is a Python package to facilitate the interpretation and analysis of GWAS and functional genomic studies for target identification. The package contains a toolkit for the harmonisation, statistical analysis and prioritisation of genetic signals to assist drug discovery.
-The Overall architecture of the package distinguishes between:
+The overall architecture of the package distinguishes between:
- [**Data Sources**](datasources/_datasources.md): data sources harmonisation tools
- [**Datasets**](datasets/_datasets.md): data model
diff --git a/docs/python_api/datasources/_datasources.md b/docs/python_api/datasources/_datasources.md
index c06bef177..036dbcb37 100644
--- a/docs/python_api/datasources/_datasources.md
+++ b/docs/python_api/datasources/_datasources.md
@@ -8,7 +8,7 @@ This section contains information about the data source harmonisation tools avai
## GWAS study sources
-1. [GWAS catalog](gwas_catalog/_gwas_catalog.md) (with or without full summary statistics)
+1. [GWAS Catalog](gwas_catalog/_gwas_catalog.md) (with or without full summary statistics)
1. [FinnGen](finngen/_finngen.md)
## Molecular QTLs
diff --git a/docs/python_api/datasources/finngen/summary_stats.md b/docs/python_api/datasources/finngen/summary_stats.md
new file mode 100644
index 000000000..ba6f6c263
--- /dev/null
+++ b/docs/python_api/datasources/finngen/summary_stats.md
@@ -0,0 +1,5 @@
+---
+title: Study Index
+---
+
+::: gentropy.datasource.finngen.summary_stats.FinnGenSummaryStats
diff --git a/docs/python_api/steps/colocalisation.md b/docs/python_api/steps/colocalisation.md
index 17a0dfd9a..76ecf7150 100644
--- a/docs/python_api/steps/colocalisation.md
+++ b/docs/python_api/steps/colocalisation.md
@@ -1,5 +1,5 @@
---
-title: Colocalisation
+title: colocalisation
---
::: gentropy.colocalisation.ColocalisationStep
diff --git a/docs/python_api/steps/eqtl_catalogue.md b/docs/python_api/steps/eqtl_catalogue.md
index 32f2aa257..17ea6a6f6 100644
--- a/docs/python_api/steps/eqtl_catalogue.md
+++ b/docs/python_api/steps/eqtl_catalogue.md
@@ -1,5 +1,5 @@
---
-title: eQTL Catalogue
+title: eqtl_catalogue
---
::: gentropy.eqtl_catalogue.EqtlCatalogueStep
diff --git a/docs/python_api/steps/finngen_studies.md b/docs/python_api/steps/finngen_studies.md
index 09da6f7c8..1ec4394f4 100644
--- a/docs/python_api/steps/finngen_studies.md
+++ b/docs/python_api/steps/finngen_studies.md
@@ -1,5 +1,5 @@
---
-title: FinnGen Studies
+title: finngen_studies
---
::: gentropy.finngen_studies.FinnGenStudiesStep
diff --git a/docs/python_api/steps/finngen_sumstat_preprocess.md b/docs/python_api/steps/finngen_sumstat_preprocess.md
index 17b44b95e..57f27658e 100644
--- a/docs/python_api/steps/finngen_sumstat_preprocess.md
+++ b/docs/python_api/steps/finngen_sumstat_preprocess.md
@@ -1,5 +1,5 @@
---
-title: FinnGen Preprocess Summary Stats
+title: finngen_sumstat_preprocess
---
::: gentropy.finngen_sumstat_preprocess.FinnGenSumstatPreprocessStep
diff --git a/docs/python_api/steps/gene_index.md b/docs/python_api/steps/gene_index.md
index bae77b5aa..a0808dcad 100644
--- a/docs/python_api/steps/gene_index.md
+++ b/docs/python_api/steps/gene_index.md
@@ -1,5 +1,5 @@
---
-title: Gene Index
+title: gene_index
---
::: gentropy.gene_index.GeneIndexStep
diff --git a/docs/python_api/steps/gwas_catalog_curation.md b/docs/python_api/steps/gwas_catalog_curation.md
index 51aa72970..8512e39fc 100644
--- a/docs/python_api/steps/gwas_catalog_curation.md
+++ b/docs/python_api/steps/gwas_catalog_curation.md
@@ -1,5 +1,5 @@
---
-title: Apply in-house curation on GWAS Catalog studies
+title: gwas_catalog_study_curation
---
::: gentropy.gwas_catalog_study_curation.GWASCatalogStudyCurationStep
diff --git a/docs/python_api/steps/gwas_catalog_inclusion.md b/docs/python_api/steps/gwas_catalog_inclusion.md
index 43abbb171..0a8c33b30 100644
--- a/docs/python_api/steps/gwas_catalog_inclusion.md
+++ b/docs/python_api/steps/gwas_catalog_inclusion.md
@@ -1,5 +1,5 @@
---
-title: Generate inclusion and exclusions lists for GWAS Catalog study ingestion.
+title: gwas_catalog_study_curation
---
::: gentropy.gwas_catalog_study_inclusion.GWASCatalogStudyInclusionGenerator
diff --git a/docs/python_api/steps/gwas_catalog_ingestion.md b/docs/python_api/steps/gwas_catalog_ingestion.md
index 17df73fec..69ea92479 100644
--- a/docs/python_api/steps/gwas_catalog_ingestion.md
+++ b/docs/python_api/steps/gwas_catalog_ingestion.md
@@ -1,5 +1,5 @@
---
-title: GWAS Catalog
+title: gwas_catalog_ingestion
---
::: gentropy.gwas_catalog_ingestion.GWASCatalogIngestionStep
diff --git a/docs/python_api/steps/gwas_catalog_sumstat_preprocess.md b/docs/python_api/steps/gwas_catalog_sumstat_preprocess.md
index 39c64a882..3b0422050 100644
--- a/docs/python_api/steps/gwas_catalog_sumstat_preprocess.md
+++ b/docs/python_api/steps/gwas_catalog_sumstat_preprocess.md
@@ -1,5 +1,5 @@
---
-title: GWAS Catalog sumstat preprocess
+title: gwas_catalog_sumstat_preprocess
---
::: gentropy.gwas_catalog_sumstat_preprocess.GWASCatalogSumstatsPreprocessStep
diff --git a/docs/python_api/steps/l2g.md b/docs/python_api/steps/l2g.md
index d2b9e290e..847569e36 100644
--- a/docs/python_api/steps/l2g.md
+++ b/docs/python_api/steps/l2g.md
@@ -1,5 +1,5 @@
---
-title: Locus-to-gene (L2G)
+title: locus_to_gene
---
::: gentropy.l2g.LocusToGeneStep
diff --git a/docs/python_api/steps/ld_clump.md b/docs/python_api/steps/ld_clump.md
index 00fc8f2e4..fea44f807 100644
--- a/docs/python_api/steps/ld_clump.md
+++ b/docs/python_api/steps/ld_clump.md
@@ -1,5 +1,5 @@
---
-title: LD-based clumping
+title: ld_based_clumping
---
::: gentropy.ld_based_clumping.LDBasedClumpingStep
diff --git a/docs/python_api/steps/ld_index.md b/docs/python_api/steps/ld_index.md
index eba826266..bf8b9b58e 100644
--- a/docs/python_api/steps/ld_index.md
+++ b/docs/python_api/steps/ld_index.md
@@ -1,5 +1,5 @@
---
-title: LD Index
+title: ld_index
---
::: gentropy.ld_index.LDIndexStep
diff --git a/docs/python_api/steps/pics.md b/docs/python_api/steps/pics.md
index 5654489f6..aacc6fbaf 100644
--- a/docs/python_api/steps/pics.md
+++ b/docs/python_api/steps/pics.md
@@ -1,5 +1,5 @@
---
-title: PICS
+title: pics
---
::: gentropy.pics.PICSStep
diff --git a/docs/python_api/steps/variant_annotation_step.md b/docs/python_api/steps/variant_annotation_step.md
index e8d7c2c3f..e65a071b2 100644
--- a/docs/python_api/steps/variant_annotation_step.md
+++ b/docs/python_api/steps/variant_annotation_step.md
@@ -1,5 +1,5 @@
---
-title: Variant Annotation
+title: variant_annotation
---
::: gentropy.variant_annotation.VariantAnnotationStep
diff --git a/docs/python_api/steps/variant_index_step.md b/docs/python_api/steps/variant_index_step.md
index e38fd8206..8a36f097f 100644
--- a/docs/python_api/steps/variant_index_step.md
+++ b/docs/python_api/steps/variant_index_step.md
@@ -1,5 +1,5 @@
---
-title: Variant Index
+title: variant_index
---
::: gentropy.variant_index.VariantIndexStep
diff --git a/docs/python_api/steps/variant_to_gene_step.md b/docs/python_api/steps/variant_to_gene_step.md
index 16db10e8e..1a3e56af8 100644
--- a/docs/python_api/steps/variant_to_gene_step.md
+++ b/docs/python_api/steps/variant_to_gene_step.md
@@ -1,5 +1,5 @@
---
-title: Variant-to-gene
+title: variant_to_gene
---
::: gentropy.v2g.V2GStep
diff --git a/docs/python_api/steps/window_based_clumping.md b/docs/python_api/steps/window_based_clumping.md
index bbcd2c0d8..f33057c71 100644
--- a/docs/python_api/steps/window_based_clumping.md
+++ b/docs/python_api/steps/window_based_clumping.md
@@ -1,5 +1,5 @@
---
-title: Window-based clumping
+title: window_based_clumping
---
::: gentropy.window_based_clumping.WindowBasedClumpingStep
From 1e0912f94aaa634930a37afdd57ff319984df1ff Mon Sep 17 00:00:00 2001
From: David Ochoa
-
+