EBGU / DPAM

A domain parser for Alphafold models

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DPAM

A Domain Parser for Alphafold Models

DPAM: A Domain Parser for AlphaFold Models (https://www.biorxiv.org/content/10.1101/2022.09.22.509116v1, accepted by Protein Science)

Updates:

Upload domain parser results for six model organisms. (2022-12-6)

Replace Dali with Foldseek for initial hits searching. (2022-11-30)

Fix a bug in analyze_PDB.py which prevents the proper usage of Dali results. (2022-10-31)

Prerequisites:

Software and packages

Please add above software to environment path for DPAM. We also provide a script check_dependencies.py to check if above programs can be found.

Supporting database:

We provide a script download_all_data.sh that can be used to download all of these databases.

bash download_all_data.sh <DOWNLOAD_DIR>

After downloading the databases, please decompress files. All supporting database files should be put in the same directory and the directory should be provided to DPAM.py as <datadir>. The <datadir> should have the following structure and files.

<datadir>/
    ECOD70/
    ecod_domain_info/
    ECOD_foldseek_DB/
    ecod_weights/
    pdb70/
    UniRef30_2022_02/
    ecod.latest.domains
    ECOD_length
    ECOD_norms
    ECOD_pdbmap

Installation

git clone https://github.com/CongLabCode/DPAM.git

conda install -c qianlabcode dpam

Usage

python DPAM.py <input_cif/pdb> <input_pae> <accession> <output_dir> <threads> <datadir>

Future improvments

  • Incoperate mmseq to improve search speed
  • Provide public server and incoperate with ECOD database

About

A domain parser for Alphafold models

License:GNU General Public License v3.0


Languages

Language:Python 97.2%Language:Shell 2.8%