coderonion's repositories

awesome-yolo-object-detection

🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.

awesome-llm-and-aigc

🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.

awesome-cuda-and-hpc

🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.

Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:ShellStargazers:4Issues:0Issues:0

awesome-ai4science

This repository lists some awesome public projects about AI4Science.

LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥

Language:CudaLicense:GPL-3.0Stargazers:2Issues:0Issues:0
Language:CudaStargazers:2Issues:0Issues:0

ai-infra-hpc

hpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Language:CudaLicense:MITStargazers:1Issues:0Issues:0

chitu

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Language:CudaLicense:MITStargazers:1Issues:0Issues:0

fast.cu

Fastest kernels written from scratch

Language:CudaLicense:MITStargazers:1Issues:0Issues:0

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0

GenPose2

[ECCV 2024] GenPose++: A generative category-level 6D object pose estimation and tracking approach proposed in Omni6DPose.

License:MITStargazers:1Issues:0Issues:0

KuiperLLama

校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。

Language:C++Stargazers:1Issues:0Issues:0

lite_llama

A light llama-like llm inference framework based on the triton kernel.

Language:PythonStargazers:1Issues:0Issues:0

MAYE

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Stargazers:1Issues:0Issues:0

nano-vllm

Nano vLLM

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

OpenManus

No fortress, purely open ground. OpenManus is Coming.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

SageAttention

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Language:CudaLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:CudaLicense:MITStargazers:1Issues:0Issues:0

TensorRT-YOLO

🚀 Easier & Faster YOLO Deployment Toolkit for NVIDIA 🛠️

Language:C++License:GPL-3.0Stargazers:1Issues:0Issues:0

tilelang

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Language:C++License:MITStargazers:1Issues:0Issues:0

transformer-hyunwoongko

Transformer: PyTorch Implementation of "Attention Is All You Need"

Language:PythonStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

Video-R1

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Language:PythonStargazers:1Issues:0Issues:0

Visual-RFT

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

VisualThinker-R1-Zero

Explore the Multimodal “Aha Moment” on 2B Model

Language:PythonStargazers:1Issues:0Issues:0

VLM-R1

Solve Visual Understanding with Reinforced VLMs

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

yoloe

YOLOE: Real-Time Seeing Anything

Language:PythonLicense:AGPL-3.0Stargazers:1Issues:0Issues:0