Anirudh's repositories
ann-benchmarks
Benchmarks of approximate nearest neighbor libraries in Python
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
arrow-datafusion-comet
Apache Arrow DataFusion Comet Spark Accelerator
datafusion
Apache DataFusion SQL Query Engine
datasketch
MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog, HyperLogLog++, LSH Ensemble
datasketches-cpp
Core C++ Sketch Library
datasketches-java
Core Sketch Library.
datafusion-ballista
Apache Arrow Ballista Distributed Query Engine
datasketches-python
Apache datasketches
incubator-horaedb
HoraeDB is a high-performance, distributed, cloud native time-series database.
instructor
structured outputs for llms
luminal
Deep learning at the speed of light.
manticoresearch
Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon
outlines
Structured Text Generation
oxide-enzyme
Enzyme integration into Rust. Experimental, do not use.
vowpal_wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
zed
Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.