Zihao Ye (yzh119)

yzh119

Geek Repo

Company:@uwsampl

Location:Seattle, WA

Home Page:https://homes.cs.washington.edu/~zhye/

Github PK Tool:Github PK Tool


Organizations
apache
dmlc
mlc-ai
uwsampa
uwsampl

Zihao Ye's repositories

bibfetch

Fetch bibtex entries from academic search engines like dblp.

Language:PythonLicense:GPL-3.0Stargazers:3Issues:2Issues:0

mirage

A multi-level tensor algebra superoptimizer

License:Apache-2.0Stargazers:2Issues:0Issues:0

punica

Serving multiple LoRA finetuned LLM as one

Language:PythonLicense:Apache-2.0Stargazers:2Issues:1Issues:0

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

relax

Temp repo for prototyping relax(relay next), the effort will be upstreamed. We use the wiki pages on this repo to host design docs.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0
Language:CudaLicense:Apache-2.0Stargazers:0Issues:1Issues:0

envd

🏕️ Reproducible development environment for AI/ML

Language:GoLicense:Apache-2.0Stargazers:0Issues:1Issues:0

flashinfer-ai.github.io

Project website of FlashInfer project

Language:HTMLStargazers:0Issues:0Issues:0
Language:CudaLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:ShellStargazers:0Issues:1Issues:0

Magicube

Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.

Language:C++License:GPL-3.0Stargazers:0Issues:1Issues:0

metal-benchmarks

Apple GPU microarchitecture

License:MITStargazers:0Issues:0Issues:0

mlx

MLX: An array framework for Apple silicon

License:MITStargazers:0Issues:0Issues:0

mogan

Mogan Editor / 墨干编辑器

Language:TclLicense:GPL-3.0Stargazers:0Issues:1Issues:0

nccl

Optimized primitives for collective multi-GPU communication

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

nnfusion

A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.

Language:C++License:MITStargazers:0Issues:1Issues:0

relax-sparse

Temp repo for prototyping relax(relay next), the effort will be upstreamed. We use the wiki pages on this repo to host design docs.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

smoothquant

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

sputnik

A library of GPU kernels for sparse matrix operations.

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0

taco

The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs

Language:C++License:NOASSERTIONStargazers:0Issues:3Issues:0
Language:GroovyLicense:Apache-2.0Stargazers:0Issues:2Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:0Issues:1Issues:0

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:0Issues:4Issues:0

tvm-rfcs

A home for the final text of all TVM RFCs.

License:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

uwsampl.github.io

The UW SAMPL group's website.

Language:HTMLLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:2Issues:0

web-llm

Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

web-stable-diffusion

Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0