coderonion's repositories
awesome-yolo-object-detection
🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.
awesome-llm-and-aigc
🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.
awesome-cuda-and-hpc
🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.
awesome-ai4science
This repository lists some awesome public projects about AI4Science.
ai-infra-hpc
hpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等
flash-attention
Fast and memory-efficient exact attention
KuiperLLama
校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。
lite_llama
A light llama-like llm inference framework based on the triton kernel.
SageAttention
Quantized Attention that achieves speedups of 2.1-3.1x and 2.7-5.1x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.
TensorRT-YOLO
🚀 Easier & Faster YOLO Deployment Toolkit for NVIDIA 🛠️
transformer-hyunwoongko
Transformer: PyTorch Implementation of "Attention Is All You Need"
Visual-RFT
Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’
VisualThinker-R1-Zero
Explore the Multimodal “Aha Moment” on 2B Model