Bailin's starred repositories

llama.cpp

LLM inference in C/C++

tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Language:PythonLicense:MITStargazers:24282Issues:266Issues:614

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:19928Issues:192Issues:2817

shap-e

Generate 3D objects conditioned on text or images

Language:PythonLicense:MITStargazers:11366Issues:238Issues:109

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8772Issues:116Issues:115

llama-recipes

Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7850Issues:68Issues:227

open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5210Issues:59Issues:86

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookLicense:MITStargazers:5132Issues:27Issues:25

cutlass

CUDA Templates for Linear Algebra Subroutines

Language:C++License:NOASSERTIONStargazers:4672Issues:106Issues:864

keops

KErnel OPerationS, on CPUs and GPUs, with autodiff and without memory overflows

Language:PythonLicense:MITStargazers:1007Issues:14Issues:298
Language:PythonLicense:Apache-2.0Stargazers:998Issues:10Issues:53

safari

Convolutions for Sequence Modeling

Language:AssemblyLicense:Apache-2.0Stargazers:842Issues:36Issues:38

m2

Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"

Language:AssemblyLicense:Apache-2.0Stargazers:501Issues:21Issues:24

H3

Language Modeling with the H3 State Space Model

Language:AssemblyLicense:Apache-2.0Stargazers:496Issues:32Issues:26

rellm

Exact structure out of any language model completion.

Language:PythonLicense:MITStargazers:492Issues:11Issues:4

attention_with_linear_biases

Code for the ALiBi method for transformer language models (ICLR 2022)

Language:PythonLicense:MITStargazers:480Issues:11Issues:18

SGEMM_CUDA

Fast CUDA matrix multiplication from scratch

Language:CudaLicense:MITStargazers:289Issues:2Issues:4

liquid-s4

Liquid Structural State-Space Models

Language:PythonLicense:Apache-2.0Stargazers:253Issues:15Issues:2

GenSim

GenSim: Generating Robotic Simulation Tasks via Large Language Models

Language:PythonLicense:MITStargazers:247Issues:7Issues:9

ModuleFormer

ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Language Models (MoLM) ranging in scale from 4 billion to 8 billion parameters.

Language:PythonLicense:Apache-2.0Stargazers:216Issues:11Issues:5

llm_large_context

Large Sequence Modeling with Transformers

Language:PythonLicense:Apache-2.0Stargazers:180Issues:4Issues:5

zoology

Understand and test language model architectures on synthetic tasks.

Language:PythonLicense:Apache-2.0Stargazers:143Issues:15Issues:15

BiGS

Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE benchmark with subquadratic complexity in length (or without attention).

Language:PythonLicense:Apache-2.0Stargazers:105Issues:3Issues:1

explain-then-translate

Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations"

ConvRe

🤖ConvRe🤯: An Investigation of LLMs’ Inefficacy in Understanding Converse Relations (EMNLP 2023)

Language:PythonStargazers:20Issues:3Issues:0

pytorch_linear_rnn

Implementations of various linear RNN layers using pytorch and triton

Logical-and-abstract-reasoning

Evaluation on Logical Reasoning and Abstract Reasoning Challenges

Language:PythonLicense:MITStargazers:16Issues:3Issues:0

pyccg

A Python library for distantly supervised CCG parsing/learning, based off of NLTK's CCG implementation

Language:PythonStargazers:10Issues:1Issues:0