Cheng(Kit) CHEN's repositories
benchmarks
Benchmark code
cuda-samples
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
DeepLearningExamples
Deep Learning Examples
DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Language:PythonApache-2.0000
generative-recommenders
Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
Language:PythonApache-2.0000
llama
Inference code for LLaMA models
Language:PythonNOASSERTION000
mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Go, Javascript and more
Language:C++NOASSERTION000
Paddle
PArallel Distributed Deep LEarning
Language:C++Apache-2.0000
tensorflow
An Open Source Machine Learning Framework for Everyone
Language:C++Apache-2.0000