Official implementation of "Generating 3D Molecules for Target Protein Binding"

Last update: Dec 07, 2022

Related tags

Overview

Generating 3D Molecules for Target Protein Binding

This is the official implementation of the GraphBP method proposed in the following paper.

Meng Liu, Youzhi Luo, Kanji Uchino, Koji Maruhashi, and Shuiwang Ji. "Generating 3D Molecules for Target Protein Binding".

Requirements

We include key dependencies below. The versions we used are in the parentheses. Our detailed environmental setup is available in environment.yml.

PyTorch (1.9.0)
PyTorch Geometric (1.7.2)
rdkit-pypi (2021.9.3)
biopython (1.79)
openbabel (3.3.1)

Preparing Data

Download and extract the CrossDocked2020 dataset:

wget https://bits.csb.pitt.edu/files/crossdock2020/CrossDocked2020_v1.1.tgz -P data/crossdock2020/
tar -C data/crossdock2020/ -xzf data/crossdock2020/CrossDocked2020_v1.1.tgz
wget https://bits.csb.pitt.edu/files/it2_tt_0_lowrmsd_mols_train0_fixed.types -P data/crossdock2020/
wget https://bits.csb.pitt.edu/files/it2_tt_0_lowrmsd_mols_test0_fixed.types -P data/crossdock2020/

Note: (1) The unzipping process could take a lot of time. Unzipping on SSD is much faster!!! (2) Several samples in the training set cannot be processed by our code. Hence, we recommend replacing the it2_tt_0_lowrmsd_mols_train0_fixed.types file with a new one, where these samples are deleted. The new one is available here.

Split data files:

python scripts/split_sdf.py data/crossdock2020/it2_tt_0_lowrmsd_mols_train0_fixed.types data/crossdock2020
python scripts/split_sdf.py data/crossdock2020/it2_tt_0_lowrmsd_mols_test0_fixed.types data/crossdock2020

Run

Train GraphBP from scratch:

CUDA_VISIBLE_DEVICES=${you_gpu_id} python main.py

Note: GraphBP can be trained on a 48GB GPU with batchsize=16. Our trained model is avaliable here.

Generate atoms in the 3D space with the trained model:

CUDA_VISIBLE_DEVICES=${you_gpu_id} python main_gen.py

Postprocess and then save the generated molecules:

CUDA_VISIBLE_DEVICES=${you_gpu_id} python main_eval.py

Reference

@article{liu2022graphbp,
      title={Generating 3D Molecules for Target Protein Binding},
      author={Meng Liu and Youzhi Luo and Kanji Uchino and Koji Maruhashi and Shuiwang Ji},
      journal={arXiv preprint arXiv:2204.09410},
      year={2022},
}

Official implementation of "Generating 3D Molecules for Target Protein Binding"

Related tags

Overview

Generating 3D Molecules for Target Protein Binding

Requirements

Preparing Data

Run

Reference

Owner

DIVE Lab, Texas A&M University

a delightful machine learning tool that allows you to train, test and use models without writing code

Code for the CVPR 2021 paper "Triple-cooperative Video Shadow Detection"

A tensorflow implementation of GCN-LPA

Styled text-to-drawing synthesis method. Featured at the 2021 NeurIPS Workshop on Machine Learning for Creativity and Design

JUSTICE: A Benchmark Dataset for Supreme Court’s Judgment Prediction

ComPhy: Compositional Physical Reasoning ofObjects and Events from Videos

Breast cancer is been classified into benign tumour and malignant tumour.

A Pytorch implementation of CVPR 2021 paper "RSG: A Simple but Effective Module for Learning Imbalanced Datasets"

A library for uncertainty representation and training in neural networks.

Reference implementation for Deep Unsupervised Learning using Nonequilibrium Thermodynamics

A simple implementation of Kalman filter in single object tracking

fklearn: Functional Machine Learning

Applying CLIP to Point Cloud Recognition.

patchmatch和patchmatchstereo算法的python实现

Official implementation of the paper Momentum Capsule Networks (MoCapsNet)

AdamW optimizer for bfloat16 models in pytorch.

Kaggle: Cell Instance Segmentation

Job Assignment System by Real-time Emotion Detection

[ICCV 2021] Amplitude-Phase Recombination: Rethinking Robustness of Convolutional Neural Networks in Frequency Domain

CZU-MHAD: A multimodal dataset for human action recognition utilizing a depth camera and 10 wearable inertial sensors