source code for https://arxiv.org/abs/2005.11248 "Accelerating Antimicrobial Discovery with Controllable Deep Generative Models and Molecular Dynamics"

Last update: Nov 15, 2022

Overview

Accelerating Antimicrobial Discovery with Controllable Deep Generative Models and Molecular Dynamics

This work will be published in Nature Biomedical Engineering on March 11, 2021

URL : https://www.nature.com/articles/s41551-021-00689-x

De novo therapeutic design is challenged by a vast chemical repertoire and multiple constraints, e.g., high broad-spectrum potency and low toxicity. This project proposes CLaSS (Controlled Latent attribute Space Sampling) - an efficient computational method for attribute-controlled generation of molecules, which leverages guidance from classifiers trained on an informative latent space of molecules modeled using a deep generative autoencoder. We screen the generated molecules for additional key attributes by using deep learning classifiers in conjunction with novel features derived from atomistic simulations.

Setup

The amp_gen.yml lists are the required dependencies for the project.
Use amp_gen.yml to create your own conda environment to run this project. Command: conda-env create -f amp_gen.yml

Usage

Phase 1: Autoencoder (VAE/WAE) Training

./run.sh. This will run with default config from cfg.py. Since cfg.runname=default the output goes to output/default and tb/default.
python main.py --tiny 1 for fast testing with default config file.
Additionally, one could explicitly run the individual scripts as follows:
- python main.py --phase 1
- python static_eval.py --config_json output/dir/config_overrides.json

Phase 2: CLaSS (Controlled Latent attribute Space Sampling)

python sample_pipeline.py --config_json output/default/config_overrides.json --samples_outfn_prefix samples --Q_select_amppos 0

Data:

data_processing/data dir has the short versions of data files required by our data curation code data_processing/create_datasets.py
For the full version of dataset use following links to download full version of data files that are publicly available.
UNIPROT: [https://www.uniprot.org/uniprot/?query=reviewed:yes] and [https://www.uniprot.org/uniprot/?query=reviewed:no]
SATPDB: [http://crdd.osdd.net/raghava/satpdb/]
DBAASP: [https://dbaasp.org]
AMPEP: [https://cbbio.cis.um.edu.mo/software/AmPEP/]
ToxinPred: [https://webs.iiitd.edu.in/raghava/toxinpred/dataset.php]

Related Visualization Tools

Peptide Walker : https://peptide-walk.mybluemix.net
Cogmol Drug Exploration: https://covid19-mol.mybluemix.net

Citations

Please cite the following articles:

@article{das2020accelerating,
  title={Accelerating Antimicrobial Discovery with Controllable Deep Generative Models and Molecular Dynamics},
  author={Das, Payel and Sercu, Tom and Wadhawan, Kahini and Padhi, Inkit and Gehrmann, Sebastian and Cipcigan, Flaviu and Chenthamarakshan, Vijil and Strobelt, Hendrik and Santos, Cicero dos and Chen, Pin-Yu and others},
  journal={arXiv preprint arXiv:2005.11248},
  year={2020}
}

@article{chenthamarakshan2020cogmol,
  title={CogMol: Target-specific and selective drug design for COVID-19 using deep generative models},
  author={Chenthamarakshan, Vijil and Das, Payel and Hoffman, Samuel C and Strobelt, Hendrik and Padhi, Inkit and Lim, KW and others},
  journal={arXiv: 2004.01215},
  year={2020}
  }

source code for https://arxiv.org/abs/2005.11248 "Accelerating Antimicrobial Discovery with Controllable Deep Generative Models and Molecular Dynamics"

Related tags

Overview

Accelerating Antimicrobial Discovery with Controllable Deep Generative Models and Molecular Dynamics

This work will be published in Nature Biomedical Engineering on March 11, 2021

URL : https://www.nature.com/articles/s41551-021-00689-x

Setup

Usage

Phase 1: Autoencoder (VAE/WAE) Training

Phase 2: CLaSS (Controlled Latent attribute Space Sampling)

Data:

Related Visualization Tools

Citations

Owner

International Business Machines

Flaxformer: transformer architectures in JAX/Flax

A simplistic and efficient pure-python neural network library from Phys Whiz with CPU and GPU support.

Some pvbatch (paraview) scripts for postprocessing OpenFOAM data

The Self-Supervised Learner can be used to train a classifier with fewer labeled examples needed using self-supervised learning.

The code release of paper Low-Light Image Enhancement with Normalizing Flow

Code for Ditto: Building Digital Twins of Articulated Objects from Interaction

PyTorch implementation of DeepDream algorithm

Rate-limit-semaphore - Semaphore implementation with rate limit restriction for async-style (any core)

Deep Sea Treasure Environment for Multi-Objective Optimization Research

Trying to understand alias-free-gan.

This repository accompanies the ACM TOIS paper "What can I cook with these ingredients?" - Understanding cooking-related information needs in conversational search

Keras Implementation of The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation by (Simon Jégou, Michal Drozdzal, David Vazquez, Adriana Romero, Yoshua Bengio)

LUKE -- Language Understanding with Knowledge-based Embeddings

Implementing SYNTHESIZER: Rethinking Self-Attention in Transformer Models using Pytorch

MVGCN: a novel multi-view graph convolutional network (MVGCN) framework for link prediction in biomedical bipartite networks.

SplineConv implementation for Paddle.

Implementation of UNET architecture for Image Segmentation.

Official PyTorch implementation of the paper "Recycling Discriminator: Towards Opinion-Unaware Image Quality Assessment Using Wasserstein GAN", accepted to ACM MM 2021 BNI Track.

E-Ink Magic Calendar that automatically syncs to Google Calendar and runs off a battery powered Raspberry Pi Zero

🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"