Christopher Akiki's repositories
arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
arrow-site
Mirror of Apache Arrow site
awesome-huggingface
🤗 A list of wonderful open-source projects & applications integrated with Hugging Face libraries.
BlingFire
A lightning fast Finite State machine and REgular expression manipulation library.
blog
Public repo for HF blog posts
complexity-scaling
gzip Predicts Data-dependent Scaling Laws
DataFrames.jl
In-memory tabular data in Julia
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
dhlab-site
dhlab.yale.edu
dspy
DSPy: The framework for programming—not prompting—foundation models
fast_hdbscan
A fast multi-core implementation of HDBSCAN for low dimensional Euclidean spaces
gaia
Hugging Face and Pyserini interoperability
gradio
Create UIs for your machine learning model in Python in 3 minutes
huggingface.js
Utilities to use the Hugging Face Hub API
ir_datasets
Provides a common interface to many IR ranking datasets.
nnue-pytorch
Stockfish NNUE (Chess evaluation) trainer in Pytorch
PHATE
PHATE (Potential of Heat-diffusion for Affinity-based Transition Embedding) is a tool for visualizing high dimensional data.
pyterrier
A Python framework for performing information retrieval experiments, building on http://terrier.org/
re2
RE2 is a fast, safe, thread-friendly alternative to backtracking regular expression engines like those used in PCRE, Perl, and Python. It is a C++ library.
thisnotthat
A visual labeling system implemented in Jupyter widgets.
trl
Train transformer language models with reinforcement learning.
vectorizers
Vectorizers for a range of different data types
what-if-tool
Source code/webpage/demos for the What-If Tool