xiaobochen123

xiaobochen123

Geek Repo

Github PK Tool:Github PK Tool

xiaobochen123's repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

sglang

SGLang is yet another fast serving framework for large language models and vision language models.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

License:MITStargazers:0Issues:0Issues:0

onnx

Open standard for machine learning interoperability

Language:PureBasicLicense:MITStargazers:0Issues:0Issues:0
Language:C++Stargazers:1Issues:0Issues:0

cutlass

CUDA Templates for Linear Algebra Subroutines

License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

YHs_Sample

Yinghan's Code Sample

License:GPL-3.0Stargazers:0Issues:0Issues:0

CppTemplateTutorial

中文的C++ Template的教学指南。与知名书籍C++ Templates不同,该系列教程将C++ Templates作为一门图灵完备的语言来讲授,以求帮助读者对Meta-Programming融会贯通。(正在施工中)

Stargazers:1Issues:0Issues:0

optimizer

Actively maintained ONNX Optimizer

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0