Mr-Nineteen

Mr-Nineteen

Geek Repo

Location:Shanghai

Github PK Tool:Github PK Tool

Mr-Nineteen's starred repositories

RecSysPapers

推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.

Language:PythonLicense:BSD-2-ClauseStargazers:1268Issues:0Issues:0

code-samples

Source code examples from the Parallel Forall Blog

Language:HTMLLicense:BSD-3-ClauseStargazers:1230Issues:0Issues:0

generative-recommenders

Repository hosting code used to reproduce results in "Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations" (https://arxiv.org/abs/2402.17152).

Language:PythonLicense:Apache-2.0Stargazers:705Issues:0Issues:0

Time-Series-Library

A Library for Advanced Deep Time Series Models.

Language:PythonLicense:MITStargazers:6710Issues:0Issues:0

iTransformer

Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah

Language:PythonLicense:MITStargazers:1222Issues:0Issues:0

parallel-hashmap

A family of header-only, very fast and memory-friendly hashmap and btree containers.

Language:C++License:Apache-2.0Stargazers:2517Issues:0Issues:0

gpu-sum-reduction

CUDA implementation of the fundamental sum reduce operation. Aims to be as optimized as reasonable.

Language:CudaStargazers:35Issues:0Issues:0

how-to-optim-algorithm-in-cuda

how to optimize some algorithm in cuda.

Language:CudaStargazers:1521Issues:0Issues:0

sampleQAT

Inference of quantization aware trained networks using TensorRT

Language:PythonLicense:Apache-2.0Stargazers:77Issues:0Issues:0

CUDALibrarySamples

CUDA Library Samples

Language:CudaLicense:NOASSERTIONStargazers:1578Issues:0Issues:0

tensorrtx

Implementation of popular deep learning networks with TensorRT network definition API

Language:C++License:MITStargazers:6939Issues:0Issues:0

onnx-simplifier

Simplify your onnx model

Language:C++License:Apache-2.0Stargazers:3826Issues:0Issues:0

CUDA-Programming-Guide-in-Chinese

This is a Chinese translation of the CUDA programming guide

Stargazers:1224Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:56090Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:13828Issues:0Issues:0

trt-samples-for-hackathon-cn

Simple samples for TensorRT programming

Language:PythonLicense:Apache-2.0Stargazers:1495Issues:0Issues:0

alpaca-rlhf

Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat

Language:PythonLicense:MITStargazers:106Issues:0Issues:0

optimate

A collection of libraries to optimise AI model performances

Language:PythonLicense:Apache-2.0Stargazers:8379Issues:0Issues:0

Linly

Chinese-LLaMA 1&2、Chinese-Falcon 基础模型;ChatFlow中文对话模型;中文OpenLLaMA模型;NLP预训练/指令微调数据集

Language:PythonStargazers:3026Issues:0Issues:0

Fengshenbang-LM

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。

Language:PythonLicense:Apache-2.0Stargazers:4011Issues:0Issues:0

stable-diffusion-webui

Stable Diffusion web UI

Language:PythonLicense:AGPL-3.0Stargazers:141492Issues:0Issues:0

onnxruntime-training-examples

Examples for using ONNX Runtime for model training.

Language:C#License:MITStargazers:309Issues:0Issues:0

onnx-tensorrt

ONNX-TensorRT: TensorRT backend for ONNX

Language:C++License:Apache-2.0Stargazers:2934Issues:0Issues:0

custom-op

Guide for building custom op for TensorFlow

Language:SmartyLicense:Apache-2.0Stargazers:378Issues:0Issues:0

io

Dataset, streaming, and file system extensions maintained by TensorFlow SIG-IO

Language:C++License:Apache-2.0Stargazers:704Issues:0Issues:0

HierarchicalKV

HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of HierarchicalKV is to store key-value feature-embeddings on high-bandwidth memory (HBM) of GPUs and in host memory. It also can be used as a generic key-value storage.

Language:CudaLicense:Apache-2.0Stargazers:130Issues:0Issues:0

libcuckoo

A high-performance, concurrent hash table

Language:C++License:NOASSERTIONStargazers:1599Issues:0Issues:0

embedx

embedx 是基于 c++ 开发的、完全自研的分布式 embedding 训练和推理框架。它目前支持 图模型、深度排序、召回模型和图与排序、图与召回的联合训练模型等

Language:C++License:NOASSERTIONStargazers:299Issues:0Issues:0

libcds

A C++ library of Concurrent Data Structures

Language:C++License:BSL-1.0Stargazers:2562Issues:0Issues:0