dream's repositories
HugeCTR
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
Paddle
PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
tensorflow
An Open Source Machine Learning Framework for Everyone
abseil-cpp
Abseil Common Libraries (C++)
asyncplusplus
Async++ concurrency framework for C++11
ByteTransformer
optimized BERT transformer inference on NVIDIA GPU. https://arxiv.org/abs/2210.03052
Clustered-Embedding-Learning
Code for the paper "Clustered Embedding Learning for Recommender Systems"
cub
Cooperative primitives for CUDA C++.
CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
docs
Documentations for PaddlePaddle
fairscale
PyTorch extensions for high performance and large scale training.
FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
LargeBatchCTR
Large batch training of CTR models based on DeepCTR with CowClip.
MEGABYTE-pytorch
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
Megatron-LM
Ongoing research training transformer models at scale
mini-lsm
A tutorial of building an LSM-Tree storage engine in a week!
nccl
Optimized primitives for collective multi-GPU communication
nccl-tests
NCCL Tests
nvidia_tensorflow
An Open Source Machine Learning Framework for Everyone
OptEmbed
This repository contains PyTorch Implementation of CIKM 2022 research-track paper: OptEmbed: Learning Optimal Embedding Table for Click-through Rate Prediction.
PaddleRec
Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESMM、ESCMM, MAML、xDeepFM、DeepFEFM、NFM、AFM、RALM、DMR、GateNet、NAML、DIFM、Deep Crossing、PNN、BST、AutoInt、FGCNN、FLEN、Fibinet、ListWise、DeepRec、ENSFM,TiSAS,AutoFIS等,
PaddleTest
PaddlePaddle TestSuite
recommenders-addons
Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.
triton
Development repository for the Triton language and compiler
xla
A machine learning compiler for GPUs, CPUs, and ML accelerators