zhang-yingping

followers

following

stars

YP's starred repositories

ChatTTS

A generative speech model for daily dialogue.

Language:PythonAGPL-3.029705 172 480

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookNOASSERTION25134 278 77

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT22949 225 130

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Language:SystemVerilog6862 68 22

hatchet

A distributed, fault-tolerant task queue

Language:GoMIT3991 11 139

Shiro

📜 A minimalist personal website embodying the purity of paper and freshness of snow.

Language:TypeScriptNOASSERTION3238 13 103

LapisCV

📃 开箱即用的 Obsidian / Typora 简历

Language:CSSMIT2554 34 13

torchtitan

A native PyTorch Library for large model training

Language:PythonBSD-3-Clause1491 35 124

ThunderKittens

Tile primitives for speedy kernels

Language:CudaMIT1456 26 22

how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Language:Cuda1361 21 9

llm-reasoners

A library for advanced large language model reasoning

Language:PythonApache-2.01071 15 33

LlamaGym

Fine-tune LLM agents with online reinforcement learning

Language:PythonMIT964 7 9

tiny-universe

《大模型白盒子构建指南》：一个全手搓的Tiny-Universe

Language:Python798 7 9

poe-api-wrapper

👾 A Python API wrapper for Poe.com. With this, you will have free access to GPT-4, Claude, Llama, Gemini, Mistral and more! 🚀

Language:PythonGPL-3.0780 21 157

tutel

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Language:PythonMIT703 15 58

LLM-PowerHouse-A-Curated-Guide-for-Large-Language-Models-with-Custom-Training-and-Inferencing

LLM-PowerHouse: Unleash LLMs' potential through curated tutorials, best practices, and ready-to-use code for custom training and inferencing.

Language:Jupyter NotebookMIT583 150

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonApache-2.0582 9 41

makeMoE

From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)

Language:Jupyter NotebookMIT567 7 3

awesomeMLSys

An ML Systems Onboarding list

text-clustering

Easily embed, cluster and semantically label text datasets

Language:PythonApache-2.0419 34 5

ipc

[Start here!] Flow-IPC - Modern C++ toolkit for high-speed inter-process communication (IPC)

Language:C++Apache-2.0273 6 12

Chinese-Resume-in-Typst

使用 Typst 编写的中文简历, 语法简洁, 样式美观, 开箱即用, 可选是否显示照片

Language:Typst254 2 2

ns3-ai

Enable the interaction between ns-3 and popular frameworks using Python, which mean you can train and test your AI algorithms in ns-3 without changing any frameworks you are using now!

Language:C++GPL-2.0219 12 82

cuda-repo

From zero to hero CUDA for accelerating maths and machine learning on GPU.

Language:CudaMIT163 40

scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Language:PythonApache-2.0159 5 12

fms-fsdp

🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.

Language:PythonApache-2.0141 11 31

parallel-computing-tutorial

Language:C++MIT132 100

PyNorch

Recreating PyTorch from scratch (C/C++, CUDA and Python, with multi-GPU support and automatic differentiation!)

Language:Python82 3 1

ccml

simple autodiff library

Language:Objective-C62 30

InterProcessPyObjects

High-performance and seamless sharing and modification of Python objects between processes, without the periodic overhead of serialization and deserialization. Provides fast inter-process communication (IPC) via shared memory. Supports NumPy, Torch arrays, custom classes (including dataclass), classes with methods, and asyncio

Language:PythonApache-2.054 2 3