tairsha / taxa-specific-substitution-matrix

Algorithms for the generation of substitution matrices from intra-taxa variation data. Implemented and published on human genetic variation data.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

What is taxa-specific-substitution-matrix?

This is a collection of Python implemented bioinformatic algorithms for the generation of substitution matrices from intra-taxa genetic variation data. It allows one to infer a basic set of substitution matrices that may be genrated from one's data. These algorithms were implemented and published on human genetic variation data, but may be used widely for any variational dataset.

Usage examples can be found @ usage_examples.ipynb

What's in here?

deduce_substitution_matrices.py

This module can be used to convert aggregated genetic variant information into substitution matrices. You may create substitution matrices in all resolutios: nucleotide substitution matrix, codon substitution matrix, and amino-acid substitution matrix.

calculate_ha.py

This module can be used to construct an amino-acid substitution matrix from a codon substitution matrix which is taken to the specified power.

project_utils.py

Utilities to support other modules

Please cite

Evolutionary and Functional Lessons from Human-Specific Amino-Acid Substitution Matrices Tair Shauli, Nadav Brandes, Michal Linial bioRxiv 2020.05.09.086009; doi: https://doi.org/10.1101/2020.05.09.086009

About

Algorithms for the generation of substitution matrices from intra-taxa variation data. Implemented and published on human genetic variation data.

License:MIT License


Languages

Language:Jupyter Notebook 93.1%Language:Python 6.9%