This repository contains data used in "Fast and Accurate Distance-based Phylogenetic Placement using Divide and Conquer".
multigene
directory contains WoL based data sets and scripts used in this paper.singlegene
directory contains RNAsim-VS data set and scripts. A README file with the installation and running instructions is included with each data set.
Some of the data sets require downloading large archives. Use the following Dryad archive to access large datasets:
https://doi.org/doi:10.6076/D1M59N
Archive Name | Description |
---|---|
wol.tar.bz2 | Web of Life dataset containing 10575 species and 381 genes. The archive contains the data and the results for the phylogenetic placement experiment the based on best marker genes. |
sequences.tar.gz | Gene sequences from Web of life that are used in WoL-denovo analysis. |
alignments.tar.bz2 | Marker gene alignments for Web of life dataset, which contains 10575 species and 381 marker genes. |
rnasim_qscal.tar.xz | Simulated single gene RNASim-QS dataset. |