Computational Prediction of De-Esterification by Human Carboxylesterases 1 and 2

Author : Rüthemann Peter, André Fischer

Description

Computational Prediction of De-Esterification by Human Carboxylesterases 1 and 2 using Random Forests (RFs) trained on literature-derived dataset.

This repository contains all used structures as sdf or mae files, as well as the used data set for training. The predictions for the training, as well as the two test sets EXT_A and EXT_B are provided.

The two scripts Train.py and Prediction.py allow the reproduction of the results in table 1 and the generation of the Random Forest (RF) model.

Dataset:
    - input/datasets.csv:       Data sets with descriptor data and labels (y_true)

Random forest model
    - input/features.csv:       Selected features used for training
    - input/parameters.yaml:    Hyper parameters required for initialisation of sklearn Random Forest
    - model/RF_classifier.pkl:  Random forest model stored as pickle file

Results files:
    - prediction/TRAIN.csv:     Dataset with predictions for training set
    - prediction/EXT_A.csv:     Dataset with predictions for external test set A
    - prediction/EXT_B.csv:     Dataset with predictions for external test set B

SetUp

Install conda

Install conda environment
conda env create -f environment.yml

Training

Activate environment
conda activate ester-prediction

Execute training of training set
python3 Train.py

Prediction

Activate environment
conda activate ester-prediction

Execute prediction of training and two test sets
python3 Prediction.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computational Prediction of De-Esterification by Human Carboxylesterases 1 and 2

Description

SetUp

Training

Prediction

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
compounds		compounds
input		input
model		model
prediction		prediction
LICENSE		LICENSE
Prediction.py		Prediction.py
README.md		README.md
Train.py		Train.py
environment.yml		environment.yml

License

lillgroup/ester-prediction

Folders and files

Latest commit

History

Repository files navigation

Computational Prediction of De-Esterification by Human Carboxylesterases 1 and 2

Description

SetUp

Training

Prediction

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages