Learning Chip's starred repositories

open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

Language:CLicense:NOASSERTIONStargazers:14150Issues:172Issues:318

Llama-Chinese

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8597Issues:81Issues:34

ZLUDA

CUDA on AMD GPUs

Language:RustLicense:Apache-2.0Stargazers:8218Issues:117Issues:149

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5526Issues:46Issues:73

diffusion-models-class

Materials for the Hugging Face Diffusion Models Course

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3298Issues:101Issues:22

ctransformers

Python bindings for the Transformer models implemented in C/C++ using GGML library.

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1248Issues:12Issues:25

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:923Issues:41Issues:63

FRIDAY

An self-improving embodied conversational agent seamlessly integrated into the operating system to automate our daily tasks.

Language:PythonLicense:MITStargazers:851Issues:10Issues:9

tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Language:PythonLicense:MITStargazers:681Issues:15Issues:58

LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Language:PythonLicense:MITStargazers:536Issues:9Issues:33

DiffusionFastForward

DiffusionFastForward: a free course and experimental framework for diffusion-based generative models

Language:Jupyter NotebookLicense:MITStargazers:530Issues:8Issues:8

LongChat

Official repository for LongChat and LongEval

Language:PythonLicense:Apache-2.0Stargazers:497Issues:10Issues:37

H2O

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

neural-speed

An innovative library for efficient LLM inference via low-bit quantization

Language:C++License:Apache-2.0Stargazers:299Issues:8Issues:44

stripedhyena

Repository for StripedHyena, a state-of-the-art beyond Transformer architecture

Language:PythonLicense:Apache-2.0Stargazers:240Issues:5Issues:7

inferflow

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

Language:C++License:MITStargazers:229Issues:7Issues:16

ao

torchao: PyTorch Architecture Optimization (AO). Performant kernels that work with PyTorch.

Language:PythonLicense:BSD-3-ClauseStargazers:192Issues:6Issues:6

float8_experimental

This repository contains the experimental PyTorch native float8 training UX

Language:PythonLicense:BSD-3-ClauseStargazers:182Issues:25Issues:44

llm-swarm

Manage scalable open LLM inference endpoints in Slurm clusters

Language:PythonLicense:MITStargazers:178Issues:36Issues:0

mscclpp

MSCCL++: A GPU-driven communication stack for scalable AI applications

Language:C++License:MITStargazers:168Issues:17Issues:82

ssm-book

Interactive textbook on state-space models

Language:Jupyter NotebookLicense:MITStargazers:164Issues:15Issues:2

REST

REST: Retrieval-Based Speculative Decoding, NAACL 2024

Language:CLicense:Apache-2.0Stargazers:143Issues:6Issues:12

self-speculative-decoding

Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:108Issues:3Issues:17

TOVA

Token Omission Via Attention

Language:PythonLicense:Apache-2.0Stargazers:108Issues:3Issues:2

babilong

BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:90Issues:5Issues:1

CUDA-Programs

Examples from Programming in Parallel with CUDA

Language:CudaLicense:GPL-2.0Stargazers:90Issues:5Issues:0

LVEval

Repository of LV-Eval Benchmark

Language:PythonLicense:MITStargazers:32Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:17Issues:4Issues:0