Zihao Ye's repositories
flashinfer-ai.github.io
Project website of FlashInfer project
Language:HTML000
metal-benchmarks
Apple GPU microarchitecture
MIT000
mlx
MLX: An array framework for Apple silicon
MIT000
nccl
Optimized primitives for collective multi-GPU communication
Language:C++NOASSERTION000
relax-sparse
Temp repo for prototyping relax(relay next), the effort will be upstreamed. We use the wiki pages on this repo to host design docs.
smoothquant
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
uwsampl.github.io
The UW SAMPL group's website.
Language:HTMLNOASSERTION000
web-stable-diffusion
Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.