chengduoZH

followers

following

stars

chengduo's repositories

BabelStream

STREAM, for lots of devices written in many programming models

Language:C++NOASSERTION020

benchmark-1

Language:Python000

book

Deep Learning 101 with PaddlePaddle

Language:HTML020

concurrentqueue

A fast multi-producer, multi-consumer lock-free concurrent queue for C++11

Language:C++NOASSERTION000

cpp-subprocess

popen() -like C++ library with iostream support for stdio forwarding

Language:M4MIT020

CTranslate2

Optimized inference engine for OpenNMT models

Language:C++MIT000

DeepLearningFrameworks

Demo of running NNs across different frameworks

Language:Jupyter Notebook020

dlrm

An implementation of a deep learning recommendation model (DLRM)

MIT000

FluidDoc

Documentations for PaddlePaddle

Language:Shell020

grpc-go-pool

grpc connection pool

Language:GoMIT020

LARK

LAnguage Representations Kit

Language:PythonApache-2.0020

learn-go-with-tests

Learn Go with test-driven development

Language:GoMIT020

LeetCodeAnimation

Demonstrate all the questions on LeetCode in the form of animation.（用动画的形式呈现解LeetCode题目的思路）

Language:Java020

librime

Rime Input Method Engine, the core library

Language:C++BSD-3-Clause020

llama.cpp

LLM inference in C/C++

MIT000

llama2.c

Inference Llama 2 in one file of pure C

MIT000

llm.c

LLM training in simple, raw C/CUDA

MIT000

marian

Fast Neural Machine Translation in C++

Language:C++NOASSERTION020

Megatron-LM

Ongoing research training transformer language models at scale, including: BERT

Language:PythonNOASSERTION020

models

Model configurations

Language:PythonApache-2.0000

oneflow

OneFlow is a performance-centered and open-source deep learning framework.

Apache-2.0000

Paddle

PArallel Distributed Deep LEarning

Language:C++Apache-2.0000

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:C++NOASSERTION020

pytorch-CycleGAN-and-pix2pix

Image-to-Image Translation in PyTorch

Language:PythonNOASSERTION020

pytorch-OpCounter

Count the MACs / FLOPs of your PyTorch model.

Language:PythonMIT020

ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Language:PythonApache-2.0020

TensorRT

TensorRT is a C++ library for high performance inference on NVIDIA GPUs and deep learning accelerators.

Language:C++Apache-2.0020

TurboTransformers

a fast and user-friendly tool for transformer inference on CPU and GPU

Language:C++NOASSERTION020

vearch

A distributed system for efficient similarity search of embedding vectors

Language:GoNOASSERTION020

x-deeplearning

An industrial deep learning framework for high-dimension sparse data

Language:PureBasicApache-2.0020