jeah-z / liGAN

Deep generative models of 3D grids for structure-based drug discovery

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

liGAN

liGAN methods diagram

liGAN is a PyTorch project for structure-based drug discovery with deep generative models of atomic density grids.

Dependencies

  • numpy
  • pandas
  • scikit-image
  • torch
  • openbabel
  • rdkit
  • molgrid
  • gnina

NOTE: Please be aware that the current version of molgrid provided through pip/conda is incompatible with conda openbabel, and you will likely get segmentation faults if you install them both through conda.

A molgrid conda build recipe is in the works (see https://github.com/mattragoza/conda-molgrid), but for now you can use this environment to build libmolgrid from source.

Usage

Generating molecules

To generate molecules, you must first download the pretrained model weights:

sh download_weights.sh

Then just run the generate.py script with the default configuration file:

python3 generate.py config/generate.config

Training a model

To train a model from scratch, you must first download the full Crossdocked2020 data set:

sh download_data.sh

More info about this data set can be found here.

Then you can run the train.py script with the default configuration file:

python3 train.py config/train.config

Citation

@article{ragoza2022chemsci,
	title={{Generating 3D molecules conditional on receptor binding sites with deep generative models}},
	author={Matthew Ragoza and Tomohide Masuda and David Ryan Koes},
	journal={Chem Sci},
	month={Feb},
	day={7},
	year={2022},
	volume={13},
	pages={2701--2713},
	doi={10.1039/D1SC05976A},
	abstract={The goal of structure-based drug discovery is to find small molecules that bind to a given target protein. Deep learning has been used to generate drug-like molecules with certain cheminformatic properties, but has not yet been applied to generating 3D molecules predicted to bind to proteins by sampling the conditional distribution of protein–ligand binding interactions. In this work, we describe for the first time a deep learning system for generating 3D molecular structures conditioned on a receptor binding site. We approach the problem using a conditional variational autoencoder trained on an atomic density grid representation of cross-docked protein–ligand structures. We apply atom fitting and bond inference procedures to construct valid molecular conformations from generated atomic densities. We evaluate the properties of the generated molecules and demonstrate that they change significantly when conditioned on mutated receptors. We also explore the latent space learned by our generative model using sampling and interpolation techniques. This work opens the door for end-to-end prediction of stable bioactive molecules from protein structures with deep learning.},
}

About

Deep generative models of 3D grids for structure-based drug discovery

License:GNU General Public License v2.0


Languages

Language:Python 99.4%Language:Shell 0.6%