Haian Huang(深度眸)'s repositories
awesome-mm-chat
多模态 MM +Chat 合集
mmdetection
OpenMMLab Detection Toolbox and Benchmark
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
DeepSpeedExamples
Example models using DeepSpeed
VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
Liger-Kernel
Efficient Triton Kernels for LLM Training
LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
LLaVA
Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
LLMs-from-scratch
Implementing a ChatGPT-like LLM from scratch, step by step
lmms-eval
Accelerating the development of large multimodal models (LMMs) with lmms-eval
long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
MHA2MLA
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs
ms-swift
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, InternVL3, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).
ring-flash-attention
Ring attention implementation with flash attention
slime
slime is a LLM post-training framework aiming at scaling RL.
torchgpipe
A GPipe implementation in PyTorch
torchtitan
A native PyTorch Library for large model training
transformers
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
trl
Train transformer language models with reinforcement learning.
verl
verl: Volcano Engine Reinforcement Learning for LLMs