peft

There are 2 repositories under peft topic.

LLaMA-Factory
hiyouga / LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
agent ai chatglm fine-tuning gpt instruction-tuning language-model large-language-models llama llama3 llm lora mistral moe peft qlora quantization qwen rlhf transformers
Language:Python 34716
yangjianxin1 / Firefly
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
gpt alpaca llm baichuan llama lora qlora peft llama2 internlm chatglm qwen aquila mistral mixtral zephyr minicpm gemma llama3 qwen2
Language:Python 5874
modelscope / ms-swift
Use PEFT or Full-parameter to finetune 400+ LLMs or 100+ MLLMs. (LLM: Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, Gemma2, ...; MLLM: Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL2, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, Yi-VL, DeepSeek-VL, Phi3.5-Vision, ...)
agent deploy dpo internvl liger llama llama3 llava llm lora megatron minicpm-v modelscope multimodal peft pre-training qwen2 qwen2-vl reflection sft
Language:Python 4322
InternLM / xtuner
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning
Language:Python 3983
mymusise / ChatGLM-Tuning
基于ChatGLM-6B + LoRA的Fintune方案
chatglm chatgpt lora peft
Language:Python 3742
hiyouga / ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
alpaca chatglm chatglm2 chatgpt fine-tuning huggingface language-model lora peft pytorch qlora rlhf transformers
Language:Python 3671
stochasticai / xTuring
Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJEk6
adapter alpaca deep-learning fine-tuning finetuning gen-ai generative-ai gpt-2 gpt-j language-model llama llm lora mistral mixed-precision peft quantization
Language:Python 2615
ashishpatel26 / LLM-Finetuning
LLM Finetuning with peft
falcon fine-tuning huggingface llama llama2 llm llms lora peft pytorch text-generation
Language:Jupyter Notebook 2169
zyds / transformers-code
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
huggingface peft transformers
Language:Jupyter Notebook 2095
lxe / simple-llm-finetuner
Simple UI for LLM Model Finetuning
ai gpt-2 gpt-3 huggingface huggingface-transformers llama llm peft pytorch
Language:Jupyter Notebook 2045
X-LANCE / SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing
Language:Python 588
LLaMA-LoRA-Tuner
zetavg / LLaMA-LoRA-Tuner
UI tool for fine-tuning and testing your own LoRA models base on LLaMA, GPT-J and more. One-click run on Google Colab. + A Gradio ChatGPT-like Chat UI to demonstrate your language models.
ai alpaca alpaca-lora google-colab gpt gpt-j language-model llama lora machine-learning peft
Language:Python 444
Guitaricet / relora
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
deep-learning distributed-training llama nlp peft transformer
Language:Jupyter Notebook 435
mindspore-courses / step_into_llm
MindSpore online courses: Step into LLM
llm natural-language-processing nlp large-language-models mindspore bert chatgpt codegeex gpt gpt2 instruction-tuning parallel-computing prompt-tuning rlhf chatglm chatglm2 llama llama2 moe peft
Language:Jupyter Notebook 431
Joyce94 / LLM-RLHF-Tuning
LLM Tuning with PEFT (SFT+RM+PPO+DPO with LoRA)
fine-tuning language-model llama llm lora peft ppo reinforcement-learning rlhf
Language:Python 373
TUDB-Labs / mLoRA
An Efficient "Factory" to Build Multiple LoRA Adapters
baichuan chatglm dpo finetune gpu llama llama2 llm lora mlora peft rlhf
Language:Python 272
km1994 / llms_paper
该仓库主要记录 LLMs 算法工程师相关的顶会论文研读笔记（多模态、PEFT、小样本QA问答、RAG、LMMs可解释性、Agents、CoT）
agent llms lora peft qa rag
264
iamarunbrahma / finetuned-qlora-falcon7b-medical
Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset
chatbot qlora fine-tuning llm lora peft falcon falcon-7b mental-health conversational-ai chatbots healthcare
Language:Jupyter Notebook 242
jackaduma / Vicuna-LoRA-RLHF-PyTorch
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT but with Vicuna
chatgpt finetune gpt llama llm lora peft ppo pytorch reward-models rlhf vicuna vicuna-7b
Language:Python 208
jasonvanf / llama-trl
LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA
adapter chatgpt gpt gpt-4 llama lora peft ppo rlhf transformer trl
Language:Python 185
calpt / awesome-adapter-resources
Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning
adapters awesome deep-learning natural-language-processing nlp parameter-efficient-learning parameter-efficient-tuning peft transformers
Language:Python 175
jianzhnie / open-chatgpt
The open source implementation of ChatGPT, Alpaca, Vicuna and RLHF Pipeline. 从0开始实现一个ChatGPT.
chatgpt gpt llm rlhf ppo llama stanford-alpaca lora peft
Language:Python 175
jackaduma / ChatGLM-LoRA-RLHF-PyTorch
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM
chatglm chatglm-6b chatgpt deepspeed finetune gpt llama llm lora peft ppo pytorch reward-models rlhf
Language:Python 126
liuqidong07 / MOELoRA-peft
[SIGIR'24] The official implementation code of MOELoRA.
chatglm large-language-models low-rank-adaptation mixture-of-experts multi-task multitask-learning parameter-efficient-fine-tuning peft peft-fine-tuning-llm
Language:Python 125
ZhengxiangShi / DePT
[ICLR 2024] This is the repository for the paper titled "DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning"
fine-tuning language-model large-language-models natural-language-processing nlp nlp-machine-learning parameter-efficient-fine-tuning parameter-efficient-tuning peft prompt-tuning transfer-learning
Language:Python 94
simplifine-llm / Simplifine
🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨
cloud fine-tuning fine-tuning-llm finetuning-llms large-language-models llama llm llm-training open-source ai gpt instruction-tuning llama3 lora mistral moe peft phi qwen
Language:Python 85
BorealisAI / flora-opt
This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.
deep-learning flax jax large-language-models lora memory-efficient-tuning optax peft random-projection transformers
Language:Python 81
kamalkraj / e5-mistral-7b-instruct
Finetune mistral-7b-instruct for sentence embeddings
finetuning huggingface lora mistral-7b peft pytorch sentence-embeddings transformers
Language:Python 71
NisaarAgharia / Indian-LawyerGPT
Fine-Tuning Falcon-7B, LLAMA 2 with QLoRA to create an advanced AI model with a profound understanding of the Indian legal context.
falcon fine-tuning gpt huggingface-transformers large-language-models llama llama2 llms peft qlora
Language:Jupyter Notebook 67
ziplab / SPT
[ICCV 2023 oral] This is the official repository for our paper: ''Sensitivity-Aware Visual Parameter-Efficient Fine-Tuning''.
adapter lora parameter-efficient-fine-tuning peft prompt-tuning transfer-learning
Language:Python 64
jackaduma / Alpaca-LoRA-RLHF-PyTorch
A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT but with Alpaca
alpaca chatgpt llama llm lora pytorch rlhf gpt finetune deepspeed peft ppo reward-models
Language:Python 56
zjohn77 / lightning-mlflow-hf
Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow
adapter azure-ml deep-learning hugging-face language-model llm lora mlflow nlp peft polars pytorch pytorch-lightning qlora
Language:Python 54
NOLA
UCDvision / NOLA
Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"
large-language-models lora peft peft-fine-tuning-llm transformers vision
Language:Python 49
sharma-n / DAG_Scheduling
HEFT, randomHEFT and IPEFT algorithms for static list DAG Scheduling
dag dag-scheduling directed-acyclic-graph heft ipeft peft scheduling-algorithms
Language:Jupyter Notebook 48
Reason-Wang / flan-alpaca-lora
This repository contains the code to train flan t5 with alpaca instructions and low rank adaptation.
alpaca flan-t5 huggingface instruction-tuning lora low-rank peft pytorch t5
Language:Python 47
SORSA
Gunale0926 / SORSA
SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language Models
fine-tuning lora peft sorsa deep-learning machine-learning llama pytorch rwkv svd nlp python transformer
Language:Python 46

peft

hiyouga / LLaMA-Factory

yangjianxin1 / Firefly

modelscope / ms-swift

InternLM / xtuner

mymusise / ChatGLM-Tuning

hiyouga / ChatGLM-Efficient-Tuning

stochasticai / xTuring

ashishpatel26 / LLM-Finetuning

zyds / transformers-code

lxe / simple-llm-finetuner

X-LANCE / SLAM-LLM

zetavg / LLaMA-LoRA-Tuner

Guitaricet / relora

mindspore-courses / step_into_llm

Joyce94 / LLM-RLHF-Tuning

TUDB-Labs / mLoRA

km1994 / llms_paper

iamarunbrahma / finetuned-qlora-falcon7b-medical

jackaduma / Vicuna-LoRA-RLHF-PyTorch

jasonvanf / llama-trl

calpt / awesome-adapter-resources

jianzhnie / open-chatgpt

jackaduma / ChatGLM-LoRA-RLHF-PyTorch

liuqidong07 / MOELoRA-peft

ZhengxiangShi / DePT

simplifine-llm / Simplifine

BorealisAI / flora-opt

kamalkraj / e5-mistral-7b-instruct

NisaarAgharia / Indian-LawyerGPT

ziplab / SPT

jackaduma / Alpaca-LoRA-RLHF-PyTorch

zjohn77 / lightning-mlflow-hf

UCDvision / NOLA

sharma-n / DAG_Scheduling

Reason-Wang / flan-alpaca-lora

Gunale0926 / SORSA