Beast code in Giters

Maozhou Ge's starred repositories

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION23441 193 197

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT22191 217 124

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

CC0-1.016261 344 24

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookMIT11343 79 13

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonApache-2.07410 97 1480

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilog6728 65 22

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT6380 60 78

optax

Optax is a gradient processing and optimization library for JAX.

Language:PythonApache-2.01574 37 226

fastmoe

A fast MoE impl for PyTorch

Language:PythonApache-2.01483 13 113

ThunderKittens

Tile primitives for speedy kernels

Language:CudaMIT1400 25 20

maxtext

A simple, performant and scalable Jax LLM!

Language:PythonApache-2.01384 26 69

torchtitan

A native PyTorch Library for large model training

Language:PythonBSD-3-Clause1363 36 107

Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation methods, etc. 天工系列模型在3.2TB高质量多语言和代码数据上进行预训练。我们开源了模型参数，训练数据，评估数据，评估方法。

Language:PythonNOASSERTION1178 22 63