matteoferla / Fragment-hit-follow-up-chemistry

A collection of notebooks and scripts for the prediction of follow-up compounds in

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Scripts and pipelines to elaborate fragment hits

🚧 This repository is under construction 🚧

Herein sections with the icon 🚧 are incomplete.

In notebooks/unsorted there are notebooks that need annotating.

A collection of notebooks and scripts to elaborate fragment hits. I used for elaboration chemistry for XChem targets, ie. small fragments identified crystallographically.

It is not intended as an exhaustive repository of all possible methods. For now I am adding my code (:construction:).

Furthermore, this is narrowly focused on the hit discovery stage of drug discovery, before any activity data can be determined and any that does exist are deep in noise (so no talk of QSAR, activity cliffs, ADMET etc etc). Many scripts will be Diamond Light Source XChem specific.

For now see: :octocat:: EV D68-3C protease or :octocat:: EV A71-2A protease.

Read the Docs https img shields io pypi pyversions fragment_elaboration_scripts logo python https img shields io pypi v fragment_elaboration_scripts logo python https img shields io pypi dm fragment_elaboration_scripts logo python https img shields io github license matteoferla fragment_elaboration_scripts logo github https img shields io github last commit matteoferla rdkit_to_params logo github https img shields io github commit activity m matteoferla rdkit_to_params logo github

Colab notebooks

Given a Uniprot ID make a sequence logo of the cleavage sites (viral polyprotein) colab

Given a Fragalysis valid upload set, fix it to pick the best Fragalysis template and do minor tweaks to make it fit there. I.e. pretend an induced fit compound binds by lock and key. colab.

For the preparation of an upload file colab.

Scripts

These scripts in fragment_elaboration_scripts. Some contain in their docstring a description of the script and a yaml block for installation with conda if needed.

I have used somely via Python in Jupyter notebooks, I have converted them to CLI scripts for the convenience of others. But I have not tested them all.

Installation (without dependencies)

These can be installed with pip:

pip install fragment_elaboration_scripts

or individually fetched from the repo (which means it will be the latest):

pip install gist_import

For example:

from gist_import import GistImporter
from types import ModuleType
from typing import List
from rdkit import Chem

url = 'https://raw.githubusercontent.com/matteoferla/Fragment-hit-follow-up-chemistry/main/fragment_elaboration_scripts/fragment.py'
frag: ModuleType = GistImporter.from_github(url).to_module()
mols: List[Chem.Mol] = frag.fragment(...)
mols: List[Chem.Mol] = frag.remove_duplicated(...)

Fragalysis

See fragalysis_targetting_scripts.md.

Utilities

See utilities_scripts.md.

Elaboration pipelines

See elaboration_scripts.md.

Other

For the questionnaire given to user to direct which target analyses MF uses see questionnaire.md.

Dataset generation

combined XChem libraries is a CSV file of the compounds in the XChem libraries.

The raw code to generate it is notebooks/combine_libraries.ipynb.

NB. Some compounds are in two or more libraries. Each row is a library entry. See also_in column for duplicates.

About

A collection of notebooks and scripts for the prediction of follow-up compounds in

License:MIT License


Languages

Language:Jupyter Notebook 90.4%Language:Python 9.1%Language:Shell 0.4%Language:Batchfile 0.1%Language:Makefile 0.0%