Fangkai Jiao (SparkJiao)

SparkJiao

Geek Repo

Company:NTU-NLP & I2R, A*STAR, Singapore

Location:Sinagpore

Home Page:jiaofangkai.com

Github PK Tool:Github PK Tool

Fangkai Jiao's starred repositories

LLaMA-Factory

Unify Efficient Fine-Tuning of 100+ LLMs

Language:PythonLicense:Apache-2.0Stargazers:25617Issues:171Issues:4136

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonLicense:MITStargazers:6024Issues:66Issues:150

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:4385Issues:49Issues:284

Pearl

A Production-ready Reinforcement Learning AI Agent Library brought by the Applied Reinforcement Learning team at Meta.

Language:Jupyter NotebookLicense:MITStargazers:2426Issues:32Issues:53

EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Language:PythonLicense:Apache-2.0Stargazers:2291Issues:42Issues:86

lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Language:PythonLicense:Apache-2.0Stargazers:2025Issues:21Issues:167

DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Language:PythonLicense:Apache-2.0Stargazers:1756Issues:41Issues:282

TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

Language:PythonLicense:Apache-2.0Stargazers:1603Issues:34Issues:256

LLMAgentPapers

Must-read Papers on LLM Agents.

Language:PythonLicense:Apache-2.0Stargazers:1115Issues:19Issues:50
Language:PythonLicense:Apache-2.0Stargazers:1079Issues:13Issues:88

dlrover

DLRover: An Automatic Distributed Deep Learning System

Language:PythonLicense:NOASSERTIONStargazers:1062Issues:50Issues:217

DeepSeek-MoE

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Language:PythonLicense:MITStargazers:916Issues:15Issues:34
Language:PythonLicense:Apache-2.0Stargazers:857Issues:9Issues:0

llama-moe

⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training

Language:PythonLicense:Apache-2.0Stargazers:789Issues:8Issues:18

alpaca_farm

A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.

Language:PythonLicense:Apache-2.0Stargazers:738Issues:8Issues:41

flash-linear-attention

Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton

Language:PythonLicense:MITStargazers:703Issues:19Issues:23

Awesome-LLMs-Evaluation-Papers

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

EAGLE

Official Implementation of EAGLE

Language:PythonLicense:Apache-2.0Stargazers:627Issues:12Issues:80

MS-AMP

Microsoft Automatic Mixed Precision Library

Language:PythonLicense:MITStargazers:479Issues:11Issues:57

lumos

Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"

Language:PythonLicense:MITStargazers:423Issues:10Issues:4

CoLLiE

Collaborative Training of Large Language Models in an Efficient Way

Language:PythonLicense:Apache-2.0Stargazers:392Issues:10Issues:69

Awesome-Reasoning-Foundation-Models

✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models

Language:PythonLicense:Apache-2.0Stargazers:294Issues:21Issues:2

Q-Instruct

②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.

Language:PythonLicense:MITStargazers:176Issues:2Issues:24

OpenSource-LLMs-better-than-OpenAI

Listing all reported open-source LLMs achieving a higher score than proprietary, paying OpenAI models (ChatGPT, GPT-4).

RLCD

Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment

Language:PythonLicense:MITStargazers:57Issues:7Issues:3
Language:PythonStargazers:26Issues:4Issues:0

C-VQA

Counterfactual Reasoning VQA Dataset

gpt_paper_assistant

GPT4 based personalized ArXiv paper assistant bot

Language:PythonLicense:Apache-2.0Stargazers:5Issues:0Issues:0