SparkJiao

followers

following

stars

NTU-NLP & I2R, A*STAR, Singapore

Sinagpore

jiaofangkai.com

Fangkai Jiao's starred repositories

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonApache-2.025617 171 4136

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonMIT6024 66 150

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT4385 49 284

Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Language:Jupyter NotebookMIT2426 32 53

EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Language:PythonApache-2.02291 42 86

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonApache-2.02025 21 167

DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Language:PythonApache-2.01756 41 282

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonApache-2.01603 34 256

LLMAgentPapers

Must-read Papers on LLM Agents.

megablocks

Language:PythonApache-2.01115 19 50

open-instruct

Language:PythonApache-2.01079 13 88

dlrover

DLRover: An Automatic Distributed Deep Learning System

Language:PythonNOASSERTION1062 50 217

DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Language:PythonMIT916 15 34

megablocks-public

Language:PythonApache-2.0857 90

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonApache-2.0789 8 18

alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Language:PythonApache-2.0738 8 41

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonMIT703 19 23

Awesome-LLMs-Evaluation-Papers

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

EAGLE

Official Implementation of EAGLE

Language:PythonApache-2.0627 12 80

MS-AMP

Microsoft Automatic Mixed Precision Library

Language:PythonMIT479 11 57

lumos

Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"

Language:PythonMIT423 10 4

CoLLiE

Collaborative Training of Large Language Models in an Efficient Way

Language:PythonApache-2.0392 10 69

Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

MIT383 6 4

llamafia.github

Language:PythonApache-2.0294 21 2

Q-Instruct

②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.

Language:PythonMIT176 2 24

OpenSource-LLMs-better-than-OpenAI

Listing all reported open-source LLMs achieving a higher score than proprietary, paying OpenAI models (ChatGPT, GPT-4).

RLCD

Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment

Language:PythonMIT57 7 3

ShareGPTs

Language:Python26 40

C-VQA

Counterfactual Reasoning VQA Dataset

Language:Python21 3 1

gpt_paper_assistant

GPT4 based personalized ArXiv paper assistant bot

Language:PythonApache-2.0500