Ma Mingfei's repositories
op_bench-py
performance benchmark for pytorch operators
convnet-benchmark-py
PyTorch convnet performance benchmark
detectron2
Detectron2 is FAIR's next-generation platform for object detection and segmentation.
pytorch_block_sparse
Fast Block Sparse Matrices for Pytorch
gen-efficientnet-pytorch
Pretrained EfficientNet, EfficientNet-Lite, MixNet, MobileNetV3 / V2, MNASNet A1 and B1, FBNet, Single-Path NAS
lite-transformer
[ICLR 2020] Lite Transformer with Long-Short Range Attention
whisper.cpp
Port of OpenAI's Whisper model in C/C++
bench_sdpa
smoke test and benchmark for sdpa
ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
cpuinfo
CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS)
extension-cpp
C++ extensions in PyTorch
gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Informer2020
The GitHub repository for the paper "Informer" accepted by AAAI 2021.
llama.cpp
Port of Facebook's LLaMA model in C/C++
llm.c
LLM training in simple, raw C/CUDA
pytorch_geometric
Graph Neural Network Library for PyTorch
SqueezeLLM
SqueezeLLM: Dense-and-Sparse Quantization
tutorials
PyTorch tutorials.