MoleculeBind

📜 Installation guide

It is recommmended using conda for creating a virtual environment. If you want to (re)train the models, your system needs to have CUDA dependencies, please use the environment.yaml file for the installation.

conda env create -f environment.yaml
conda activate molbind

📁 Data availability

The simulated spectra data have been compiled from IBM's Multimodal Spectroscopic Dataset.

(WIP 🏗️) Run molbind-get-datasets from the command line to download the data.

📋 Environment file

Your environment file should look like this:

WANDB_PROJECT="<your-wandb-project-name>"
WANDB_ENTITY="<your-wandb-account-name>"
TOKENIZERS_PARALLELISM=False

After you have defined your system variables in .env, it is read into the script as following:

load_dotenv("path/to/.env")

📉 Train models

The experiment configs can be found at config. For example, to run the train.py

python train.py 'experiment="train/ir_simulated_large_dataset"'

The training scripts outputs the checkpoints at experiments/checkpoints/<run-code-name>/<checkpoint-file-name>.ckpt To find all three checkpoints used in this work, please access the supplementary information on Zenodo.

To run the metrics on these experiments:

python retrieval.py 'experiment="metrics/ir_simulated_large_dataset"'

⚙️ System requirements

For the training script 4 NVIDIA A100-40GB GPUs have been used. For the retrieval script 1 NVIDIA A100-40GB GPUs has been used.

💰 Funding

This work was funded by the Carl-Zeiss Foundation. In addition, this work was partly funded by the SOL-AI project funded as part of the Helmholtz Foundation Model Initiative of the Helmholtz Association. Moreover, this work was supported by Helmholtz AI computing resources (HAICORE) of the Helmholtz Association’s Initiative and Networking Fund through Helmholtz AI.

Name		Name	Last commit message	Last commit date
Latest commit History 290 Commits
.github		.github
app		app
configs		configs
data		data
experiments		experiments
notebooks		notebooks
scripts		scripts
src/molbind		src/molbind
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.project-root		.project-root
Makefile		Makefile
README.md		README.md
environment.yaml		environment.yaml
open_source_licenses.txt		open_source_licenses.txt
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MoleculeBind

📜 Installation guide

📁 Data availability

📋 Environment file

📉 Train models

⚙️ System requirements

💰 Funding

About

Releases 2

Packages

Contributors 3

Languages

lamalab-org/MoleculeBind

Folders and files

Latest commit

History

Repository files navigation

MoleculeBind

📜 Installation guide

📁 Data availability

📋 Environment file

📉 Train models

⚙️ System requirements

💰 Funding

About

Resources

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 3

Languages

Packages