hhaAndroid

User data from Github https://github.com/hhaAndroid

followers

following

stars

nuaa

上海

Haian Huang(深度眸)'s repositories

awesome-mm-chat

多模态 MM +Chat 合集

Language:Python277 9 2

mmdetection

OpenMMLab Detection Toolbox and Benchmark

Language:Jupyter NotebookApache-2.02700

xtuner

XTuner is a toolkit for efficiently fine-tuning LLM

Language:PythonApache-2.0500

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookApache-2.0200

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.0100

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonApache-2.0100

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION100

ReAlign

Reformatted Alignment

Language:JavaScript100

VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks

Language:PythonApache-2.0100

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonMIT000

Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

Language:PythonMIT000

Liger-Kernel

Efficient Triton Kernels for LLM Training

Language:PythonBSD-2-Clause000

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonApache-2.0000

LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonApache-2.0000

LLaVA

Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

Language:PythonApache-2.0000

llava-phi

Language:Python000

LLMs-from-scratch

Implementing a ChatGPT-like LLM from scratch, step by step

Language:Jupyter NotebookNOASSERTION000

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:Python000

long-context-attention

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Language:PythonApache-2.0000

lvlm-interpret

Language:PythonApache-2.0000

MHA2MLA

Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs

Apache-2.0000

ml-ferret

Language:PythonNOASSERTION000

ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, InternVL3, Ovis2.5, Llava, GLM4v, Phi4, ...) (AAAI 2025).

Apache-2.0000

ring-flash-attention

Ring attention implementation with flash attention

Language:Python000

slime

slime is a LLM post-training framework aiming at scaling RL.

Language:PythonApache-2.0000

torchgpipe

A GPipe implementation in PyTorch

Language:PythonBSD-3-Clause000

torchtitan

A native PyTorch Library for large model training

Language:PythonBSD-3-Clause000

transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Language:PythonApache-2.0000

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.0000

verl

verl: Volcano Engine Reinforcement Learning for LLMs

Apache-2.0000