frenkowski / MSA_SARS-CoV-2

Multiple Sequence Alignment (MSA) of SARS-Cov2 sequences.

Home Page:https://frenkowski.github.io/MSA_SARS-CoV-2/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Multiple Sequence Alignment (MSA) Analysis of SARS-Cov2 sequences

Team

Ali Manan (mohammadalimanan+github@gmail.com)
Francesco Porto (francescoporto97+github@gmail.com)
Francesco Stranieri (frenkowski+github@gmail.com)

Abstract

In this report we analyze Multiple Sequence Alignments (MSAs) obtained from 18 samples of European COVID-19 patients and we compare against 3 samples from China, in order to figure out how similar or different they are. We also present our tool for detecting differences in a MSA and building a phylogenetic tree from a subset of these differences. By analyzing the results of our tool on the provided samples, we make an attempt to either confirm or deny various claims about the origin of the outbreak in Europe.

There are 3 parts to this project:

  1. We look for differences between a reference sequence and other sequences in a MSA; we obtain a file containing information on these differences (i.e. position, type, etc.)
  2. We compare codons in annotated genes to figure out whether CDS are still translatable or not. We also obtain a list of codon-differences, along with their relative amino-acids.
  3. We obtain a phylogenetic tree from a character matrix obtained from the output of the previous part.

Instructions

  • Place a MSA in the /Alignments folder.
  • Set the reference name in /Src/findDifferencesMSA.py (line 255).
  • Run the following command: python3 runScript.py
  • The resulting files should be /Outputs/Part_X.

NOTE: currently only works under *NIX.

About

Multiple Sequence Alignment (MSA) of SARS-Cov2 sequences.

https://frenkowski.github.io/MSA_SARS-CoV-2/


Languages

Language:Python 100.0%