Chris Ha's repositories
aHash
aHash is a non-cryptographic hashing algorithm that uses the AES hardware instruction
awesome-data-deduplication
An awesome list of data deduplication use cases, papers, tools, and methods.
candle
Minimalist ML framework for Rust
charset-normalizer-rs_original
Truly universal encoding detector in pure Rust - port of Python version
chash
Consistent HashRing
CLRS-rs
CLRS pseudocode in rust
counter-rs
Simple object to count Rust iterables
dashmap
Blazing fast concurrent HashMap for Rust.
dolma
Data and tools for generating and inspecting OLMo pre-training data.
elara
Work-in-progress educational programming game
esaxx-rs
Bindings to copy of SentencePiece esaxx library (fast suffix array and frequent substrings).
highway-rs
Native Rust port of Google's HighwayHash, which makes use of SIMD instructions for a fast and strong hash function
itertools
Extra iterator adaptors, iterator methods, free functions, and macros.
llm
An ecosystem of Rust libraries for working with large language models
ocrs
A modern OCR engine, written in Rust
ogg
Ogg container decoder and encoder written in pure Rust
regex
An implementation of regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
rfcs
RFCs for changes to Rust
rspdf
PDF library in Rust
sonic-rs
A fast Rust JSON library based on SIMD.
subset_sum
Solves subset sum problem and returns a set of decomposed integers.
suffix
Fast suffix arrays for Rust (with Unicode support).
Symphonia
Pure Rust multimedia format demuxing, tag reading, and audio decoding library
text-dedup
All-in-one text de-duplication
tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
unicode_names2
char <-> Unicode character name (maintained fork of huonw/unicode_names)
vers
very efficient rank and select