sauloal / cnidaria

CNIDARIA: fast, reference-free phylogenomic clustering

Home Page:https://github.com/sauloal/cnidaria/wiki

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CNIDARIA: fast, reference-free phylogenomic clustering

[https://travis-ci.org/sauloal/cnidaria](Travis Continuous Integration Build Status)

Manual & Wiki: https://github.com/sauloal/cnidaria/wiki

Cnidaria does not need to be compiled but if you really want to, please refer to INSTALL.md.

Motivation: Identification of biological specimens is a major requirement for a range of applications. Reference-free methods analyse unprocessed sequencing data without relying on prior knowledge, but these do not scale to arbitrarily large genomes and arbitrarily large phylogenetic distances.

Results: We present Cnidaria, a practical tool for clustering genomic and transcriptomic data with no limitation on ge-nome size or phylogenetic distances. We successfully simultaneously clustered 169 genomic and transcriptomic datasets from 4 kingdoms, achieving 100% accuracy at supra-species level and 78% accuracy for species level.

Availability and Implementation: Cnidaria is written in C++ and Python and is available at http://www.ab.wur.nl/cnidaria

Contact: sauloal@gmail.com

Manual: https://sauloal.github.io/cnidaria/

About

CNIDARIA: fast, reference-free phylogenomic clustering

https://github.com/sauloal/cnidaria/wiki

License:MIT License


Languages

Language:Python 66.1%Language:Jupyter Notebook 19.7%Language:C++ 7.1%Language:C 4.0%Language:Shell 1.6%Language:JavaScript 0.6%Language:Makefile 0.4%Language:R 0.2%Language:Groff 0.1%Language:CSS 0.0%Language:HTML 0.0%Language:M4 0.0%Language:Fortran 0.0%Language:Objective-C 0.0%