James van Alstine's repositories
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
breakseq2
BreakSeq2: Ultrafast and accurate nucleotide-resolution analysis of structural variants
bwa
Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)
CNVnator
a tool for CNV discovery and genotyping from depth-of-coverage by mapped reads
gatk
Official code repository for GATK versions 4 and up
pypgx
A Python package for pharmacogenomics research
samtools
Tools (written in C using htslib) for manipulating next-generation sequencing data
tern
The SQL Fan's Migrator
TileDB
The Universal Storage Engine
TileDB-FastQ
FastQ ingestor using TileDB storage format
TileDB-VCF
Efficient variant-call data storage and retrieval library using the TileDB storage library.