tiantang5156's starred repositories

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1987Issues:0Issues:0

torchsparse

[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.

Language:CudaLicense:MITStargazers:1159Issues:0Issues:0

pytorchvideo

A deep learning library for video understanding research.

Language:PythonLicense:Apache-2.0Stargazers:3240Issues:0Issues:0

yolo_slowfast

Yolov5+SlowFast: Realtime Action Detection Based on PytorchVideo

Language:PythonStargazers:420Issues:0Issues:0

detectron2

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:29550Issues:0Issues:0

Custom-ava-dataset_Custom-Spatio-Temporally-Action-Video-Dataset

Custom ava dataset, Multi-Person Video Dataset Annotation Method of Spatio-Temporally Actions

Language:PythonStargazers:95Issues:0Issues:0

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Language:PythonLicense:Apache-2.0Stargazers:6414Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:22364Issues:0Issues:0

tinyflow

Tutorial code on how to build your own Deep Learning System in 2k Lines

Language:C++License:Apache-2.0Stargazers:2004Issues:0Issues:0

cuda_learning

learning how CUDA works

Language:CudaStargazers:115Issues:0Issues:0

Awesome-LLM-Inference

📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.

License:GPL-3.0Stargazers:2084Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:62701Issues:0Issues:0

CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

Stargazers:17356Issues:0Issues:0

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonLicense:MITStargazers:52317Issues:0Issues:0

Programmers-Overseas-Job-Interview-Handbook

🏂🏻 程序员海外工作/英文面试手册

Stargazers:4277Issues:0Issues:0

EmbedRoadAndCSInEU

嵌入式软件之路与欧陆CS留学工作

Stargazers:285Issues:0Issues:0

tinyflow

A simple deep learning framework that supports automatic differentiation and GPU acceleration.

Language:PythonLicense:MITStargazers:54Issues:0Issues:0

axolotl

Go ahead and axolotl questions

Language:PythonLicense:Apache-2.0Stargazers:7085Issues:0Issues:0

triton

Development repository for the Triton language and compiler

Language:C++License:MITStargazers:12115Issues:0Issues:0

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonLicense:Apache-2.0Stargazers:27608Issues:0Issues:0

lora_from_scratch

Implements Low-Rank Adaptation(LoRA) Finetuning from scratch

Language:Jupyter NotebookLicense:MITStargazers:57Issues:0Issues:0

self-llm

《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合**宝宝的部署教程

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6788Issues:0Issues:0

Tianji

天机是一款专注人情世故的大语言模型系统。您可以利用它进行涉及传统人情世故的任务,如何说好话、如何会来事儿等,以提升您的“情商”和"核心竞争能力"

Language:PythonLicense:Apache-2.0Stargazers:292Issues:0Issues:0

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

License:Apache-2.0Stargazers:2Issues:0Issues:0

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:40140Issues:0Issues:0

ChatGLM-Finetuning

基于ChatGLM-6B、ChatGLM2-6B、ChatGLM3-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning、全参微调等

Language:PythonStargazers:2586Issues:0Issues:0

CUDA-Learn-Notes

🎉CUDA/C++ 笔记 / 大模型手撕CUDA / 技术博客,更新随缘: flash_attn、sgemm、sgemv、warp reduce、block reduce、dot product、elementwise、softmax、layernorm、rmsnorm、hist etc.

Language:CudaLicense:GPL-3.0Stargazers:928Issues:0Issues:0
Language:HTMLStargazers:43Issues:0Issues:0

Atom

[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

Language:CudaStargazers:229Issues:0Issues:0

web-llm

High-performance In-browser LLM Inference Engine

Language:TypeScriptLicense:Apache-2.0Stargazers:11885Issues:0Issues:0