Yan Yucheng's starred repositories
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
ucasthesis
LaTeX Thesis Template for the University of Chinese Academy of Sciences
starcoder2
Home of StarCoder2!
torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
PatrickStar
PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.
Wrapper_VideoStation
Synology VideoStation and DLNA FFmpeg Wrapper with AAC, DTS, EAC3 and TrueHD support via pipes (now with GStreamer support). It enables full hardware transcoding from Synology´s FFmpeg for video and transcoding DTS, EAC3, TrueHD and AAC from the SynoCommunity's FFmpeg only when necessary.
buddy-mlir
An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).
composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
flash-attention
Fast and memory-efficient exact attention
Triton-Compiler
Triton Compiler related materials.
tvm-gdb-commands
Small set of gdb commands for useful tasks in tvm