ekimb / arewebioyet.github.io

Rust in bioinformatics and computational biology

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Are we bio yet?

🎉 Yes! 🎉

(but many projects are still work-in-progress)

The Rust bioinformatics ecosystem contains general, easy-to-use crates like bio, along with a plethora of crates for specific tasks.

Ecosystem

Libraries

Here you can find all sorts of bioinformatics crates that were created by the Rust community:

  • bio - Implementations of many useful bioinformatics data structures and algorithms, including pattern matching, alignment, suffix arrays, BWT, FM-Index, and parsers for common file types.
  • coitrees - Cache oblivious interval tree implementation for very fast overlap queries of a static set of integer intervals, with genomic intervals in mind.
  • needletail - Fast FASTX parsing and k-mer methods in Rust.
  • rust-htslib - Provides HTSlib bindings and a high level Rust API for reading and writing BAM files.
  • triple_accel - Rust edit distance routines accelerated using SIMD. Supports fast Hamming, Levenshtein, restricted Damerau-Levenshtein, etc. distance calculations and string search.
  • fastq-rs - A fast parser for FASTQ.
  • seq_io - FASTA and FASTQ parsing and writing in Rust.

Tools

Here you can find all sorts of bioinformatics tools that were created by the Rust community:

  • bcl2fastr - Faster bcl2fastq implementation.
  • noodles-squab - Noodles squab performs gene expression quantification by counting the number of aligned records that intersects a set of features. Output can be the raw counts or normalized counts in TPM (transcripts per million) or FPKM (fragments per kilobase per million mapped reads).
  • perbase - A highly parallelized utility for generating per-base level metrics.
  • prosolo - A variant caller for multiple displacement amplified DNA sequencing data from diploid single cells
  • sourmash - Quickly search, compare, and analyze genomic and metagenomic data sets.
  • varlociraptor - Varlociraptor implements a novel, unified fully uncertainty-aware approach to genomic variant calling in arbitrary scenarios.
  • finch - A genomic MinHashing implementation.
  • rust-mdbg - Minimizer-space de Bruijn graphs (mdBG) implementation for whole-genome assembly.

Contributing

You can use the editor on GitHub to contribute to this page. Feel free to list new bioinformatics crates! Of course, you can also contribute to the ecosystem by writing a new Rust crate.

About

Rust in bioinformatics and computational biology

License:Creative Commons Zero v1.0 Universal


Languages

Language:Ruby 100.0%