Tianqi Chen's starred repositories
open-webui
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
flashinfer
FlashInfer: Kernel Library for LLM Serving
jetson-inference
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
cutlass_fpA_intB_gemm
A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer
tokenizers-cpp
Universal cross-platform tokenizers binding to HF and sentencepiece
zeno-build
Build, evaluate, understand, and fix LLM-based apps
ChatLLM-Web
🗣️ Chat with LLM like Vicuna totally in your browser with WebGPU, safely, privately, and with no server. Powered by web llm.
open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset