satanson's repositories
cpp_etudes
smart tools for source code study : cpptree.pl, calltree.pl, javatree.pl, java_calltree.pl
cqf_implementation
Safe implementation of a cqf
cudf
cuDF - GPU DataFrame Library
emvb
Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024
fast-multi-join-sketch
Fast Cardinality Estimation of Multi-Join Queries Using Sketches
fe-plugins-auditloader
AuditLoader plugin for FE
gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
MS-DOS
The original sources of MS-DOS 1.25, 2.0, and 4.0 for reference purposes
OpenAurora
OpenAurora is a cloud-native database system prototype developed at Purdue University. It is an open-source version of Amazon Aurora. It is designed to enable more research in cloud-native database systems for our database community.
Python
All Algorithms implemented in Python
RaBitQ
[SIGMOD 2024] RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search
souffle
Soufflé is a variant of Datalog for tool designers crafting analyses in Horn clauses. Soufflé synthesizes a native parallel C++ program from a logic specification.
stan
🕵️ Haskell STatic ANalyser
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Trinity
EuroSys '24: "Trinity: A Fast Compressed Multi-attribute Data Store"
unitycatalog
Open, Multi-modal Catalog for Data & AI
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
v
Simple, fast, safe, compiled language for developing maintainable software. Compiles itself in <1s with zero library dependencies. Supports automatic C => V translation. https://vlang.io
vectordb
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/