linxiaobo's repositories
Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
GPL-3.0000
bhook
Baidu Hook
Language:C++000
cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++NOASSERTION000
iree
A retargetable MLIR-based machine learning compiler and runtime toolkit.
Language:C++Apache-2.0000
json
JSON for Modern C++
Language:C++MIT000
llm-export
llm-export can export llm model to onnx.
Language:PythonApache-2.0000
timeflies
Compute the time of Model
Language:Python000
xbyak
a JIT assembler for x86(IA-32)/x64(AMD64, x86-64) MMX/SSE/SSE2/SSE3/SSSE3/SSE4/FPU/AVX/AVX2/AVX-512 by C++ header
Language:C++BSD-3-Clause000