Chris Ha's repositories
CU_MSCS_Projects
Project portfolios as part of final and assignment projects from Colorado University MSCS classes
CLRS-rs
CLRS pseudocode in rust
ocrs
A modern OCR engine, written in Rust
charset-normalizer-rs_original
Truly universal encoding detector in pure Rust - port of Python version
rspdf
PDF library in Rust
elara
Work-in-progress educational programming game
esaxx-rs
Bindings to copy of SentencePiece esaxx library (fast suffix array and frequent substrings).
candle
Minimalist ML framework for Rust
subset_sum
Solves subset sum problem and returns a set of decomposed integers.
regex
An implementation of regular expressions for Rust. This implementation uses finite automata and guarantees linear time matching on all inputs.
itertools
Extra iterator adaptors, iterator methods, free functions, and macros.
highway-rs
Native Rust port of Google's HighwayHash, which makes use of SIMD instructions for a fast and strong hash function
tokenizers
đź’Ą Fast State-of-the-Art Tokenizers optimized for Research and Production
aHash
aHash is a non-cryptographic hashing algorithm that uses the AES hardware instruction
sonic-rs
A fast Rust JSON library based on SIMD.
dolma
Data and tools for generating and inspecting OLMo pre-training data.
unicode_names2
char <-> Unicode character name (maintained fork of huonw/unicode_names)
suffix
Fast suffix arrays for Rust (with Unicode support).
counter-rs
Simple object to count Rust iterables
text-dedup
All-in-one text de-duplication
awesome-data-deduplication
An awesome list of data deduplication use cases, papers, tools, and methods.
rfcs
RFCs for changes to Rust
vers
very efficient rank and select
llm
An ecosystem of Rust libraries for working with large language models
Symphonia
Pure Rust multimedia format demuxing, tag reading, and audio decoding library
ogg
Ogg container decoder and encoder written in pure Rust
chash
Consistent HashRing