Yan Yucheng's starred repositories
LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
ucasthesis
LaTeX Thesis Template for the University of Chinese Academy of Sciences
starcoder2
Home of StarCoder2!
torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
PatrickStar
PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.
buddy-mlir
An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).
composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
flash-attention
Fast and memory-efficient exact attention
Triton-Compiler
Triton Compiler related materials.
tvm-gdb-commands
Small set of gdb commands for useful tasks in tvm