fransua / aligner

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Reads sequences from file fasta format, and align them. 
If input sequences are nucleotidic translate them using a given genetic code,
aligns amino-acids sequences, and revert alignment to nucleotide sequences.

Alignment:
----------
Can use all of these aligners (if available on your computer):
  * muscle (1)
  * mafft (2)
  * dialign-tx (3)
  * probcons (4)

Also gives the option to align inverted sequences.

Masking:
--------
Once generated your alignments, you can merge them with t-coffee (5). 
Also a cut-off can be applied here, in order 
to mask unstable regions of the alignment 
 -> Mask, here consists in replacing codons by "nnn" (lower case)

Clean alignments with trimal (6), but do not remove poorly aligned columns, it also
masks them, this time "NNN" (upper case).

Rreferences:
------------
 (1) Edgar R C 2004 
     MUSCLE: multiple sequence alignment with high accuracy and high throughput. 
     Nucleic acids research 32 1792–7
 (2) Katoh K, Kuma K-ichi, Toh H and Miyata T 2005 
     MAFFT version 5: improvement in accuracy of multiple sequence alignment. 
     Nucleic acids research 33 511–8
 (3) Subramanian A R, Kaufmann M and Morgenstern B 2008 
     DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment. 
     Algorithms for molecular biology : AMB 3 6
 (4) Do C B, Mahabhashyam M S P, Brudno M and Batzoglou S 2005 
     ProbCons: Probabilistic consistency-based multiple sequence alignment. 
     Genome research 15 330–40
 (5) Notredame C 2010 
     Computing multiple sequence/structure alignments with the T-coffee package. 
     Current protocols in bioinformatics / editoral board, Andreas D. Baxevanis ... [et al.] Chapter 3 Unit 3.8.1–25
 (6) Capella-Gutiérrez S, Silla-Martínez J M and Gabaldón T 2009 
     trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. 
     Bioinformatics (Oxford, England) 25 1972–3

About


Languages

Language:Python 84.4%Language:HTML 15.6%