HanHui's starred repositories
AI-RecommenderSystem
该仓库尝试整理推荐系统领域的一些经典算法模型
text-embeddings-inference
A blazing fast inference solution for text embeddings models
MatrixSlow
A simple deep learning framework in pure python for purpose of learning in DL
annotated-transformer
An annotated implementation of the Transformer paper.
CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
Recommender-System
推荐系统综述
how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
The-Art-of-Linear-Algebra-zh-CN
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone", 线性代数的艺术中文版, 欢迎PR.
cherry-markdown
✨ A Markdown Editor
text-generation-inference
Large Language Model Text Generation Inference