gabe-foley-thesis / ancestral_agreement

Calculate the level of agreement between two ancestral reconstructions from differing tools

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Ancestral Agreement

Ancestral Agreement is a method, inspired by Bland-Altman plots, for comparing the agreement between two sets of predicted biological ancestral sequences.

Each pair of ancestral protein sequences between two sets are converted into quantitative predictions for a range of metrics and the differences between sets are plotted against the evolutionary age of each ancestor. Results are easily interpretable and highlight whether particular methods consistently disagree, whether specific nodes are inconsistently inferred, and specific evolutionary times beyond which methods start to diverge.

Installation

  1. Install MongoDB

https://docs.mongodb.com/manual/installation/

You can set the name of the database you want to create in aa/configs/mongoconfig.py and then when you run the application it will create the database for you.

  1. Clone the repository
git clone github.com/gabe-foley-thesis/ancestral_agreement
cd ancestral_agreement
  1. Install requirements

Recommended to do this within a conda or virtual environment

pip install -r requirements.txt

Running the program

python aa/routes.py

You can now open a web browser and go to http://localhost:5000 -

This gives you the upload form which you can enter your data into.

Inputs required are a phylogenetic tree labelled with ancestral node positions (Newick format) and two sets of ancestor sequences from different prediction methods / settings that correspond to the labelled ancestral node positions (FASTA format).

See /files/test_6_1.aln, test_6_2.aln, and test_6.nwk for the expected format.

Viewing the example data

There is an example data set for a CYP2U1 ancestral reconstruction using either GRASP or FastML. It gets loaded automatically into the Mongo database when you run the app.

If you just want to view the example data then navigate to http://localhost:5000/dash and it will load automatically. Just select the two data sets from the dropdown menus.

Notes

After you're loaded the files you need to select the data sets you're interested in viewing from the drop down menu.

Currently the phylogenetic tree must be identical for each method.

About

Calculate the level of agreement between two ancestral reconstructions from differing tools


Languages

Language:Python 84.8%Language:JavaScript 7.0%Language:CSS 5.1%Language:HTML 3.0%Language:Dockerfile 0.1%