YP's starred repositories

NewPipe

A libre lightweight streaming front-end for Android.

Language:JavaLicense:GPL-3.0Stargazers:28994Issues:594Issues:8660

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:20093Issues:199Issues:108

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:17172Issues:204Issues:39

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilogStargazers:6274Issues:56Issues:19

hatchet

A distributed, fault-tolerant task queue

Language:GoLicense:MITStargazers:3344Issues:8Issues:107

Shiro

📜 A minimalist personal website embodying the purity of paper and freshness of snow.

Language:TypeScriptLicense:NOASSERTIONStargazers:2906Issues:11Issues:76

LapisCV

📃 开箱即用的 Obsidian / Typora 简历

Language:CSSLicense:MITStargazers:1861Issues:33Issues:8

so-large-lm

大模型基础: 一文了解大模型基础知识

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:1178Issues:27Issues:86

ThunderKittens

Tile primitives for speedy kernels

Language:CudaLicense:MITStargazers:1158Issues:25Issues:11

how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:991Issues:22Issues:6

LlamaGym

Fine-tune LLM agents with online reinforcement learning

Language:PythonLicense:MITStargazers:924Issues:8Issues:7

llm-reasoners

A library for advanced large language model reasoning

Language:PythonLicense:Apache-2.0Stargazers:918Issues:14Issues:29

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonLicense:Apache-2.0Stargazers:834Issues:41Issues:58

How_to_optimize_in_GPU

This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.

Language:CudaLicense:Apache-2.0Stargazers:707Issues:12Issues:15

tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Language:PythonLicense:MITStargazers:673Issues:14Issues:54

makeMoE

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Language:Jupyter NotebookLicense:MITStargazers:527Issues:7Issues:3

LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing

LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.

Language:Jupyter NotebookLicense:MITStargazers:505Issues:14Issues:0

text-clustering

Easily embed, cluster and semantically label text datasets

Language:PythonLicense:Apache-2.0Stargazers:357Issues:35Issues:4

tiny-universe

《大模型白盒子构建指南》:一个全手搓的Tiny-Universe

glake

GLake: optimizing GPU memory management and IO transmission.

Language:C++License:Apache-2.0Stargazers:297Issues:5Issues:17

ipc

[Start here!] Flow-IPC - Modern C++ toolkit for high-speed inter-process communication (IPC)

Language:C++License:Apache-2.0Stargazers:244Issues:5Issues:12

ns3-ai

Enable the interaction between ns-3 and popular frameworks using Python, which mean you can train and test your AI algorithms in ns-3 without changing any frameworks you are using now!

Language:C++License:GPL-2.0Stargazers:205Issues:12Issues:81

Chinese-Resume-in-Typst

使用 Typst 编写的中文简历, 语法简洁, 样式美观, 开箱即用, 可选是否显示照片

cookbook

Deep learning for dummies. All the practical details and useful utilities that go into working with real models.

Language:PythonLicense:Apache-2.0Stargazers:188Issues:8Issues:10

scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Language:PythonLicense:Apache-2.0Stargazers:140Issues:5Issues:11

fms-fsdp

Demonstrate throughput of PyTorch FSDP

Language:PythonLicense:Apache-2.0Stargazers:73Issues:0Issues:0

ccml

simple autodiff library

Language:CStargazers:61Issues:2Issues:0