Xiao's repositories
awesome-AI-system
paper and its code for AI System
raft-thesis-zh_cn
Raft 博士论文的中文翻译
FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
FasterTransformer
Transformer related optimization, including BERT, GPT
FlexFlow
A distributed deep learning framework that supports flexible parallelization strategies.
TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
6.S060-labs
Programming labs for 6.S060 (Foundations of Computer Security).
column_store_db
A column-store database for big data storage system.
DeepLearning-MuLi-Notes
Notes about courses Dive into Deep Learning by Mu Li
DeepLearning_LHY21_Notes
深度学习 李宏毅 2021 学习笔记
hangzhou_mountain
杭州登山地图收集
mlc-llm
Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
open-gpu-kernel-modules
NVIDIA Linux open GPU kernel module source
SE124-CSE-2021-Notes
上海交通大学软件学院课程计算机系统工程(SE124)笔记
tvm_mlir_learn
tvm learn