Junru Shao's repositories
Crowd-Track
A Yelp-like website back-ended with MySQL
cutlass_fpA_intB_gemm
A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
cutlass
CUDA Templates for Linear Algebra Subroutines
DietCode
DietCode Code Release
dmlc-core
A common bricks library for building scalable and portable distributed machine learning.
FastChat
The release repo for "Vicuna: An Open Chatbot Impressing GPT-4"
LazyVim
Neovim config for the lazy
numpy_dlpack
Example showing how to convert between Numpy and TVM's NDArray without copies.
pytorch-cifar
95.16% on CIFAR10 with PyTorch
relax
Temp repo for prototyping relax(relay next), the effort will be upstreamed. We use the wiki pages on this repo to host design docs.
scikit-build-core
A next generation Python CMake adaptor and Python API for plugins
tensile_rkutils
replacement kernel development tools for tensile.
tokenizers-cpp
Universal cross-platform tokenizers binding to HF and sentencepiece
tophub
tophub autotvm log distro
web-llm
Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.