LiuXinyu's starred repositories
system-design-questions
Problem statements on System Design and Software Architecture as part of Arpit's System Design Masterclass
bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
wordcloud2.js
Tag cloud/Wordle presentation on 2D canvas or HTML
commandline
The best C# command line parser that brings standardized *nix getopt style, for .NET. Includes F# support
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
cppbestpractices
Collaborative Collection of C++ Best Practices. This online resource is part of Jason Turner's collection of C++ Best Practices resources. See README.md for more information.
FasterTransformer
Transformer related optimization, including BERT, GPT
trt-samples-for-hackathon-cn
Simple samples for TensorRT programming
hack-SysML
The road to hack SysML and become an system expert
onnxruntime-extensions
onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime
flatbuffers
FlatBuffers: Memory Efficient Serialization Library
Awesome-Model-Quantization
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.