-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Privacy preserving prediction of molecular properties #90
Comments
Hello janweinreich, Thank you for your bounty proposition! Our team will review and add comments in your issue! In the meantime:
Talk soon, |
Good news @janweinreich, this bounty is accepted! |
thank you getting started right away! |
Work is in full progress! However, we wanted to notify you that we are considering pivoting to a different target: For instance, how much of a substance is absorbed in the liver? The reasons we want to change the dataset are:
All other points of the proposal would remain unchanged! |
It's ok with your updates, could you update the main issue to reflect this changes? |
thank you, I made the minor changes to the main post of this issue. Most of the code is there we just need to clean and document it well. Will contact you as soon as this is done! |
Hey Jan Regarding the HF space: in general, it’s already very nice, very clear, very straight to the point, it’s an excellent bounty!
Regarding https://github.com/vaultchem/molvault:
Once again: very nice work that you’ve done, I can’t wait we can let our marketing publish about it! |
Thanks for your feedback!
|
Thanks!
|
Sure I can see if these molecules are in the test set and add the reference values. No problem, rerunning the models as I write with new version of concrete-ml. Timings will be updated accordingly |
Thank you for your patience @bcm-at-zama ! Fixes
(script for timing test,
Questions
The idea was to publish the repo of the bounty under CC-BY. We want to make sure to comply with Zamas' policy, including future developments that may lead to commercial use.
Thank you! |
|
Category:
Application
Overview
We at VaultChem, a startup combining encryption and chemistry, aim to use Zamas
Concrete ML
library for FHE inference of molecular properties. Consider a scenario where pharma companyA
is interested in predicting properties of candidate molecules, before phase I clinical trials.In particular, understanding the processes of absorption, distribution, metabolism, and excretion (ADME) is crucial for determining a drug candidate's concentration profile at its action site, significantly impacting the drug's effectiveness.
However,
A
does not have sufficient data available for reliable ML predictions. Instead,A
will securely obtain predictions on molecular data from an untrusted partyB
that owns a secret database and an ML model with sufficient training data. This is only possible using FHE to guarantee partyA
will not reveal the secret query to partyB
.We will simulate this scenario using open-source chemistry datasets. We will provide tools (based on cheminformatics
rdkit
andconcrete-ml
) to give an end-to-end solution to the problem of privacy-preserving prediction. We will deploy the app to hugging face (similar to the FHE image filter) and provide detailed tutorials/notebooks that explain each step. Finally, in comparing againstsklearn
implementations, we will also investigate the accuracy versus computational cost trade-off as computational screening in cheminformatics may require fast predictions on thousands of molecules. We provide an outlook on how to account for increased computational costs due to FHE inference in the case of molecular data.Total Reward: 3500 € (split by milestones)
Description
concrete ml
in the field of chemistry and pharmaceutical data as well as making a demo available on Huggingface.Milestones
RdKit
for use with ML.References
The text was updated successfully, but these errors were encountered: