rocke2020 / cycpeptmp

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CycPeptMP

License: MIT

About

  • Python implementation of CycPeptMP.

  • CycPeptMP is an accurate and efficient method for predicting the membrane permeability of cyclic peptides.

  • We designed features for cyclic peptides at the atom, monomer, and peptide levels to concurrently capture both the local sequence variations and global conformational changes in cyclic peptides. We also applied data augmentation techniques at three scales to enhance model training efficiency.

    framework

Requirements

  • Python: 3.9.6
  • Numpy: 1.25.0
  • Pandas: 1.4.4
  • Pytorch: 2.0.0 (CUDA: 11.7)
  • RDKit: 2022.09.5
  • Mordred: 1.2.0
  • MOE: 2019.01 (commercial software)

Dataset

  • The original cyclic peptide structure (SMILES) and experimentally determined membrane permeability (LogPexp) used in this study were all sourced from CycPeptMPDB.
  • Selected PAMPA datasets used in this research are summarized in all_data.csv.

Code

  • EXAMPLE.ipynb

    Jupyter notebook with an example of prediction.

  • atoms_model.py

    Transformer-based atom model using Node, Bond, Graph, and Conf created from atoms_input.py. The maximum number of heavy atoms in the input is 128.

  • monomers_model.py

    CNN-based monomer model using 16 monomer features created from monomers_input.py. The maximum number of monomers in the input is 16.

  • peptides_model.py

    MLP-based peptide model using 16 peptide features and 2048-bit Morgan fingerprint.

Pretrained weights

  • Weights of CycPeptMP (60 times augmentation) for three validation runs (Fusion-60_cv*.cpt).
  • Weights of fusion model with no augmentation (Fusion-1_cv*.cpt) and 20 times augmentation (Fusion-20_cv*.cpt) for three validation runs in ablation studies.

Reference

Contact

About

License:MIT License


Languages

Language:Python 100.0%