SparkJiao

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonApache-2.01899 19 161

open-instruct

Language:PythonApache-2.01032 13 78

dlrover

DLRover: An Automatic Distributed Deep Learning System

Language:PythonNOASSERTION990 49 210

DeepSeek-MoE

Language:PythonMIT876 13 32

nanotron

Minimalistic large language model 3D-parallelism training

Language:PythonApache-2.0824 40 54

megablocks

Language:PythonApache-2.0765 12 34

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonApache-2.0747 8 18

alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Language:PythonApache-2.0721 8 41

EAGLE

[ICML'24] EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty

Language:PythonApache-2.0524 10 61

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonMIT520 18 14

lumos

Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"

Language:PythonMIT416 9 4

CoLLiE

Collaborative Training of Large Language Models in an Efficient Way

Language:PythonApache-2.0390 10 69

Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

MIT364 6 4

ChunkLlama

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Language:PythonApache-2.0222 8 12

zero-bubble-pipeline-parallelism

Zero Bubble Pipeline Parallelism

Language:PythonNOASSERTION196 5 12

HallusionBench

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

Language:PythonBSD-3-Clause183 4 10

Q-Instruct

②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.

Language:PythonMIT162 2 22

RefGPT

Language:PythonApache-2.068 2 2

RLCD

Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment

Language:PythonMIT54 5 3

SimulateBench

GPT as Human

Language:Python1800

SeaEval

NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning

Language:PythonNOASSERTION1201

dpo-trajectory-reasoning

Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".

Language:Python11 20

RLMEC

The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"

Language:Python600