yupaulk / BacteriophageHostPrediction

Code and database related to our manuscript "Predicting bacteriophage hosts based on receptor-binding proteins"

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Predicting bacteriophage hosts based on sequences of annotated receptor-binding proteins

This repository contains the code and database related to our manuscript "Predicting bacteriophage hosts based on sequences of annotated receptor-binding proteins", published in Scientific Reports (January 2021). This research is funded by a PhD fellowship strategic basic research from the Research Foundation – Flanders (FWO), grant number 1S69520N.

Access the research paper via the following link: https://www.nature.com/articles/s41598-021-81063-4

More specifically, this repository contains the following files:

  • RBP_database.csv contains the collected RBP sequences as described in Materials & Methods.

  • RBP_functions.py is a Python script containing all the necessary manually implemented functions for the various analyses carried out in this study.

  • RBP_alignment.jl is a Julia script used to compute pairwise alignments between the sequences to explore the diversity and assess redundancy in the database.

  • RBP_alignmentscores.txt is .txt file containing the alignment scores if you prefer not to compute them with the provided Julia script.

  • RBP_host_prediction.ipynb is an IPython notebook containing all the analyses, model constructions and evaluations described in Materials & Methods.

About

Code and database related to our manuscript "Predicting bacteriophage hosts based on receptor-binding proteins"

License:MIT License


Languages

Language:Jupyter Notebook 95.7%Language:Python 3.7%Language:Julia 0.5%