JLSteenwyk / latch_wf_seq_to_tree

codebase for latch workflow that infers the evolutionary history for a set of sequences

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool


Mafft -> ClipKIT -> IQTree2

Click here to see the workflow!

Workflow author: Jacob L. Steenwyk
follow on Twitter


This is an implementation of a 'typical' workflow for inferring the evolutionary history among a set of sequences.


Seq_to_tree


About

The Seq_to_tree workflow goes from a multi-FASTA file of sequences to a phylogenetic tree. Specifically, the seq_to_tree workflow will align sequences using Mafft, trim the alignment using ClipKIT, and then infer the evolutionary history of the sequences using IQTREE.

Users can modify many, but not all, components of running each software. These are described in detail below. However, user’s must name their output directory and input their multi-FASTA file. Otherwise, default parameters will be selected for all other options.


Citations

If you found seq_to_tree useful, please cite

  • MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability. Katoh & Standley 2013, Molecular Biology and Evolution. doi: 10.1093/molbev/mst010*
  • ClipKIT: a multiple sequence alignment trimming software for accurate phylogenomic inference. Steenwyk et al. 2020, PLoS Biology. doi: 10.1371/journal.pbio.3001007
  • IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Minh et al. 2020, Molecular Biology and Evolution. doi: 10.1093/molbev/msaa015.

About

codebase for latch workflow that infers the evolutionary history for a set of sequences


Languages

Language:Python 89.1%Language:Dockerfile 10.9%