chengduo's repositories
BabelStream
STREAM, for lots of devices written in many programming models
concurrentqueue
A fast multi-producer, multi-consumer lock-free concurrent queue for C++11
cpp-subprocess
popen() -like C++ library with iostream support for stdio forwarding
CTranslate2
Optimized inference engine for OpenNMT models
DeepLearningFrameworks
Demo of running NNs across different frameworks
dlrm
An implementation of a deep learning recommendation model (DLRM)
grpc-go-pool
grpc connection pool
learn-go-with-tests
Learn Go with test-driven development
LeetCodeAnimation
Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)
llama.cpp
LLM inference in C/C++
llama2.c
Inference Llama 2 in one file of pure C
llm.c
LLM training in simple, raw C/CUDA
Megatron-LM
Ongoing research training transformer language models at scale, including: BERT
models
Model configurations
oneflow
OneFlow is a performance-centered and open-source deep learning framework.
Paddle
PArallel Distributed Deep LEarning
pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
pytorch-OpCounter
Count the MACs / FLOPs of your PyTorch model.
TurboTransformers
a fast and user-friendly tool for transformer inference on CPU and GPU
x-deeplearning
An industrial deep learning framework for high-dimension sparse data