Yang's starred repositories
AI-For-Beginners
12 Weeks, 24 Lessons, AI for All!
lm-evaluation-harness
A framework for few-shot evaluation of language models.
matmulfreellm
Implementation for MatMul-free LM.
ThunderKittens
Tile primitives for speedy kernels
flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Triton-Puzzles
Puzzles for learning Triton
zero-bubble-pipeline-parallelism
Zero Bubble Pipeline Parallelism
ParrotServe
[OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable
TiledKernel
TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.
sgemm_riscv
This project records the process of optimizing SGEMM (single-precision floating point General Matrix Multiplication) on the riscv platform.
rvv-kernels
RISCV Vector Kernel C/LLVM-IR generator
Ansor-AF-DS
This repository contains the figures, tables and source code in the ICS'24 paper: "Accelerated Auto-Tuning of GPU Kernels for Tensor Computations".