Zach Nussbaum's repositories
gpt4all.cpp
Locally run an Assistant-Tuned Chat-Style LLM
accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
AIR-Bench
AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
arena
Code for the MTEB Arena
BERT4Rec-VAE-Pytorch
Pytorch implementation of BERT4Rec and Netflix VAE.
enformer-tensorflow-sonnet-training-script
The full training script for Enformer - Tensorflow Sonnet
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
llama_index
LlamaIndex (formerly GPT Index) is a data framework for your LLM applications
lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
LongEmbed
Official implementation for the paper "LongEmbed: Extending Embedding Models for Long Context Retrieval"
mlx_embedding_models
run embeddings in MLX
mteb
MTEB: Massive Text Embedding Benchmark
nodelist-inflator
CLI tool to easily expand a list of hostnames (which include brackets) and write to a hosttile
pylate
Late Interaction Models Training & Retrieval
results
Data for the MTEB leaderboard
sentence-transformers
State-of-the-Art Text Embeddings
session_based_recommenders
Official repo for FF19: Session-based Recommender Systems
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
Transformers4Rec
Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.