Exposome Data Challenge 2021

NOTE for selected talks: Read this file to see how to upload your R code (or the code used in any other language: Python, Matlab, ...).

Exposome Data Challenge 2021

The exposome, described as "the totality of human environmental exposures from conception onwards", recognizes that individuals are exposed simultaneously to a multitude of different environmental factors and takes a holistic approach to the discovery of etiological factors for disease. The exposome’s main advantage over traditional ‘one-exposure-one-disease’ study approaches is that it provides an unprecedented conceptual framework for the study of multiple environmental hazards (urban, chemical, lifestyle, social) and their combined effects.

The objective of this event (described here) is to promote innovative statistical, data science, or other quantitative approaches to studying the health effects of complex high-throughput measurement of exposure indicators (exposome). Detailed challenge examples are given on this link.

These are the availalbe datasets to propose data analyses to address any challenge:

Exposome data (n=1301): Rdata file without missings and with missings containing three objects:
- 1 object for exposures: exposome
- 1 object for covariates: covariates
- 1 object for outcomes: phenotype

The three tables can be linked using ID variable. See the codebook for variable description (variable name, domain, type of variable, transformation, ...)

omic data: Exposome and omic data can be linked using ID variable.
- Proteome: ExpressionSet called metabol_serum of 1170 individuals and 39 proteins (log-transformed) that are annotated in the ExpressionSet object (use fData(proteome) after loading Biobase Bioconductor package).
- Serum Metabolome: ExpressionSet called metabol_serum of 1198 individuals and 177 metabolites (log-transformed) (see here for a descripton).
- Urine Metabolome: ExpressionSet called metabol_urine of 1192 individuals and 44 metabolites (see here for a descripton).
- Gene expression: ExpressionSet called genexpr (see here what an ExpressionSet is) of 1007 individuals and 28,738 transcripts with annotated gene symbols.
- Methylation: GenomicRatioSet called methy (see here what a GenomicRatioSet is) of 918 individuals and 386,518 CpGs

The variables that are available in the metadata are:

ID: identification number

e3_sex: gender (male, female)

age_sample_years: age (in years)

h_ethnicity_cauc: caucasic? (yes, no)

ethn_PC1: first PCA to address population stratification

ethn_PC2: second PCA to address population stratification

Cell-type estimates (only for methylation): NK_6, Bcell_6, CD4T_6, CD8T_6, Gran_6, Mono_6

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
R_Code_Presentations		R_Code_Presentations
.gitignore		.gitignore
ExposomeDataChallenge2021.Rproj		ExposomeDataChallenge2021.Rproj
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exposome Data Challenge 2021

About

Releases

Packages

Languages

License

johnlpearce/ExposomeDataChallenge2021

Folders and files

Latest commit

History

Repository files navigation

Exposome Data Challenge 2021

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages