Beast code in Giters

coderonion's repositories

awesome-yolo-object-detection

🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.

1626 340

🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.

772 14 4

awesome-cuda-and-hpc

🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.

398 7 1

Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:Shell400

awesome-ai4science

This repository lists some awesome public projects about AI4Science.

2 10

LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA etc.🔥

Language:CudaGPL-3.0200

LLM-engineering

Language:Cuda200

ai-infra-hpc

hpc 教程，包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Language:CudaMIT100

chitu

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Language:PythonApache-2.0100

coderonion

1 10

DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Language:CudaMIT100

fast.cu

Fastest kernels written from scratch

Language:CudaMIT100

flash-attention

Fast and memory-efficient exact attention

Language:PythonBSD-3-Clause100

GenPose2

[ECCV 2024] GenPose++: A generative category-level 6D object pose estimation and tracking approach proposed in Omni6DPose.

MIT100

KuiperLLama

校招、秋招、春招、实习好项目，带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。

Language:C++100

lite_llama

A light llama-like llm inference framework based on the triton kernel.

Language:Python100

MAYE

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

100

nano-vllm

Nano vLLM

Language:PythonMIT100

OpenManus

No fortress, purely open ground. OpenManus is Coming.

Language:PythonMIT100

SageAttention

Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Language:CudaApache-2.0100