mrcardholder's starred repositories
hyperloglog
HyperLogLog and HyperLogLog++ implementation in Go/Golang.
genometools
GenomeTools genome analysis system.
rna-seq-diff-exprn
RNA-Sequencing data differential expression analysis pipeline. Performs: genome coverage (via bedtools and HTSeq), generates Circos code and plots, differential expression (via DESeq and NOISeq), structural variant detection (e.g. fusion genes, via SVDetect) and differential exon usage (via DEXSeq).
genoset-norovirus
Determines if a genome is immune to norovirus (rs601338)
WiggleTools
Basic operations on the space of numerical functions defined on the genome using lazy evaluators for flexibility and efficiency
SimSeq
An illumina paired-end and mate-pair short read simulator. This project attempts to model as many of the quirks that exist in Illumina data as possible. Some of these quirks include the potential for chimeric reads, and non-biotinylated fragment pull down in mate-pair libraries . Additionally the program provides the ability to model both site and base specific error, and scripts are provided to train this error model on real datasets. My hope in creating this program is to generate as realistic data as possible to assist in assessing the accuracy of genome assembly tools.
genomesunzipped
Code for Genomes Unzipped
trait-o-matic
An open-source tool to find and classify phenotypic correlations for variations in whole genomes. (For deployment, use the production branch.)
RetroSeq
RetroSeq is a bioinformatics tool that searches for mobile element insertions from aligned reads in a BAM file and a library of reference transposable elements. Please read the wiki page (link below) for usage instructions. Also, there is a page on the wiki describing how the 1000 genomes CEU trio was carried out with the files and parameters used for the various steps.
chromozoom
ChromoZoom is a fast, fluid web-based genome browser