Mr-Nineteen's starred repositories
RecSysPapers
推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.
code-samples
Source code examples from the Parallel Forall Blog
generative-recommenders
Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).
Time-Series-Library
A Library for Advanced Deep Time Series Models.
iTransformer
Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah
parallel-hashmap
A family of header-only, very fast and memory-friendly hashmap and btree containers.
gpu-sum-reduction
CUDA implementation of the fundamental sum reduce operation. Aims to be as optimized as reasonable.
how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
CUDALibrarySamples
CUDA Library Samples
onnx-simplifier
Simplify your onnx model
CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
flash-attention
Fast and memory-efficient exact attention
trt-samples-for-hackathon-cn
Simple samples for TensorRT programming
alpaca-rlhf
Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat
Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
stable-diffusion-webui
Stable Diffusion web UI
onnxruntime-training-examples
Examples for using ONNX Runtime for model training.
onnx-tensorrt
ONNX-TensorRT: TensorRT backend for ONNX
HierarchicalKV
HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of HierarchicalKV is to store key-value feature-embeddings on high-bandwidth memory (HBM) of GPUs and in host memory. It also can be used as a generic key-value storage.