HazyResearch's repositories
ThunderKittens
Tile primitives for speedy kernels
data-centric-ai
Resources for Data Centric AI
aisys-building-blocks
Building blocks for foundation models.
legalbench
An open science effort to benchmark legal reasoning in foundation models
flash-fft-conv
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
eclair-agents
Automating enterprise workflows with multimodal agents
structured-nets
Structured matrices for compressing neural networks
wonderbread
WONDERBREAD benchmark + dataset for BPM tasks
based-evaluation-harness
A framework for few-shot evaluation of language models.
olive-evaluation-harness
A framework for few-shot evaluation of language models.
olive-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs