GitHub

Benchmarking the translational potential of spatial gene expression prediction from histology

This benchmarking pipeline is designed to provide a comprehensive evaluation of methods that predict spot-based spatial gene expression using histology images

We employed a hierarchy of evaluation categories:

(1) within image spatial gene expression (SGE) prediction performance for lower-resolution spatial transcriptomics (ST) and higher-resolution 10x Visium data
(2) cross-study model generalisability, evaluated by applying models trained on ST data to predict Visium tissues, as well as to predict TCGA images to identify whether models were useful for predicting existing H&E images;
(3) clinical translational impact through the prediction of survival outcomes and canonical pathological regions using predicted SGE from TCGA;
(4) usability of the methods encompassing code, documentation and the manuscript;
(5) the computational efficiency.

Processed Data

The processed datasets required for reproduction are available on Zenodo and can be accessed via this DOI link:
https://doi.org/10.5281/zenodo.14602489
Please download and store them in the appropriate directories as required by the scripts.

Reproduction Steps

We provide the code to reproduce the evaluation results and figures from our work. Please follow the order of the .Rmd files to process your raw prediction data and obtain the results:

00-CombineDat.Rmd contains an example dataset and the code used to calculate several evaluation metrics between the predicted SGE and the ground truth.
01-BenchmarkUsability.Rmd contains the code used to generate usability plot for each method.
02-BenchmarkPredictedExprs.Rmd contains the code used to generate ST and 10x Visium Spatial Gene Expression metrics.
03-BenchmarkTCGA.Rmd contains the code used to perform survival analysis using TCGA data.
04-BenchmarkRanks.Rmd contains the code used to rank each method based on six categories: ST SGE prediction, Visium SGE prediction, model generalisability, clinical impact, usability, and efficiency. The rankings are visualized using a funky heatmap.

Reference

If you have any questions, particularly regarding data processing, please contact [email protected]. We welcome any suggestions and comments.

Wang, C., & Chan, A. (2025). Benchmarking the translational potential of spatial gene expression prediction from histology (3.0). Zenodo. https://doi.org/10.5281/zenodo.14602489

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
benchmark pipeline		benchmark pipeline
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Benchmarking the translational potential of spatial gene expression prediction from histology

We employed a hierarchy of evaluation categories:

Processed Data

Reproduction Steps

Reference

About

Releases

Packages

Languages

License

SydneyBioX/HEtoSGEBench

Folders and files

Latest commit

History

Repository files navigation

Benchmarking the translational potential of spatial gene expression prediction from histology

We employed a hierarchy of evaluation categories:

Processed Data

Reproduction Steps

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages