Aloha Li's starred repositories
flash-attention
Fast and memory-efficient exact attention
composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
recommenders-addons
Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.
tensorflow
An Open Source Machine Learning Framework for Everyone
x-deeplearning
An industrial deep learning framework for high-dimension sparse data
diaosj.github.io
Keep writing
benchmark-models
benchmark models for TNN, ncnn, MNN
benchmarks
A benchmark framework for Tensorflow
TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.
DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
how-to-optimize-gemm
row-major matmul optimization
FaceDetection-DSFD
腾讯优图高精度双分支人脸检测器