HazyResearch

We are a CS research group led by Prof. Chris Ré.

Palo Alto, CA

https://cs.stanford.edu/people/chrismre/

HazyResearch's repositories

ThunderKittens

Tile primitives for speedy kernels

Language:CudaMIT1881 32 32

data-centric-ai

Resources for Data Centric AI

Language:TeXApache-2.01105 69 7

safari

Convolutions for Sequence Modeling

Language:AssemblyApache-2.0872 34 40

meerkat

Creative interactive views of any dataset.

Language:PythonApache-2.0830 15 83

hyena-dna

Official implementation for HyenaDNA, a long-range genomic foundation model built with Hyena

Language:AssemblyApache-2.0618 21 68

hgcn

Hyperbolic Graph Convolutional Networks in PyTorch.

Language:Python604 26 48

m2

Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"

Language:AssemblyApache-2.0543 20 32

evaporate

This repo contains data and code for the paper "Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes"

Language:Python484 18 25

manifest

Prompt programming with FMs.

Language:PythonApache-2.0440 22 36

aisys-building-blocks

Building blocks for foundation models.

legalbench

An open science effort to benchmark legal reasoning in foundation models

Language:Python366 47 13

flash-fft-conv

FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores

Language:C++Apache-2.0293 16 25

based

Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"

Language:PythonApache-2.0218 16 12

lolcats

Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"

Language:PythonApache-2.0201 20 6

zoology

Understand and test language model architectures on synthetic tasks.

Language:PythonApache-2.0172 14 23

hippo-code

Language:PythonApache-2.0171 20 11

domino

Language:PythonApache-2.0134 20 11

eclair-agents

Automating enterprise workflows with multimodal agents

Language:Jupyter NotebookApache-2.097 17 10

structured-nets

Structured matrices for compressing neural networks

Language:PythonApache-2.067 17 8

train-tk

train with kittens!

Language:Python50 30

prefix-linear-attention

Language:Python45 15 1

skill-it

Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models

Language:Jupyter NotebookApache-2.042 12 1

wonderbread

WONDERBREAD benchmark + dataset for BPM tasks

Language:Jupyter Notebook21 13 1

aioli

Aioli: A unified optimization framework for language model data mixing

Language:Jupyter NotebookApache-2.018 130

based-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT9 10

olive-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT7 10

smoothie

Language:Jupyter NotebookMIT1 140

axolive

Go ahead and axolotl questions

Language:PythonApache-2.0010

olive-recipes

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter Notebook000

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.0000