qlora

There are 10 repositories under qlora topic.

LLaMA-Factory
hiyouga / LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
fine-tuning llama llm peft transformers rlhf qlora quantization qwen instruction-tuning gpt lora large-language-models agent ai moe llama3 deepseek gemma nlp
Language:Python 58292
unsloth
unslothai / unsloth
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
agent ai deepseek deepseek-r1 fine-tuning gemma gemma3 gpt-oss llama llama3 llm llms lora mistral openai qwen qwen3 text-to-speech tts unsloth
Language:Python 45475
bitsandbytes
bitsandbytes-foundation / bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
llm machine-learning pytorch qlora quantization
Language:Python 7582
yangjianxin1 / Firefly
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
gpt alpaca llm baichuan llama lora qlora peft llama2 internlm chatglm qwen aquila mistral mixtral zephyr minicpm gemma llama3 qwen2
Language:Python 6546
lyogavin / airllm
AirLLM 70B inference with single 4GB GPU
chinese-nlp finetune generative-ai instruct-gpt instruction-set llama llm lora open-models open-source open-source-models qlora chinese-llm
Language:Jupyter Notebook 5920
hiyouga / ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
chatglm chatgpt fine-tuning lora alpaca peft huggingface language-model transformers pytorch rlhf chatglm2 qlora
Language:Python 3714
iusztinpaul / hands-on-llms
🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴
bytewax comet-ml huggingface mlops qdrant transformers beam langchain generative-ai llms 3-pipeline-design aws cicd fine-tuning llmops qlora streaming docker
Language:Jupyter Notebook 3351
ssbuild / chatglm_finetuning
chatglm 6b finetuning and alpaca finetuning
chatglm deep-learning lora pytorch p-tuning-v2 adalora sft freeze qlora ia3
Language:Python 1542
georgian-io / LLM-Finetuning-Toolkit
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
classification fine-tuning finetuning large-language-models nlp nlp-machine-learning summarization falcon flan-t5 llama2 lora qlora redpajama ablation-study llm-test mistral-7b unit-testing zephyr
Language:Python 832
X-D-Lab / MindChat
🐋MindChat（漫谈）——心理大模型：漫谈人生路, 笑对风霜途
baichuan-13b chatglm2-6b chatgpt domain-llm internlm large-language-models llm lora qlora qwen qwen-7b qwen1-5 qwen2
Language:Python 663
jianzhnie / LLamaTuner
Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.
chatgpt dpo llama llama3 mixtral ppo qlora qwen rlhf
Language:Python 598
X-D-Lab / Sunsimiao
🌿孙思邈中文医疗大模型(Sunsimiao)：提供安全、可靠、普惠的中文医疗大模型
baichuan baichuan-7b llm qlora chatgpt huggingface modelscope large-language-models medical
Language:Python 445
yangjianxin1 / Firefly-LLaMA2-Chinese
Firefly中文LLaMA-2大模型，支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型
firefly llama llama-2 llama2 llm baichuan baichuan-13b bloom chatglm falcon internlm lora pretrain qlora qwen xverse baichaun2
Language:Python 408
GURPREETKAURJETHRA / END-TO-END-GENERATIVE-AI-PROJECTS
End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects
chainlit finetuning-llms gemini generative-ai gradio-python-llm huggingface langchain large-language-models llama llama-index llm llmops lora mergekit mistral openai-api qlora llama3 llama3-meta-ai gpt4o
392
ddzipp / AutoAudit
AutoAudit—— the LLM for Cyber Security 网络安全大语言模型
cyber-security fine-tuning gpt llama lora qlora security-tools
Language:HTML 324
iamarunbrahma / finetuned-qlora-falcon7b-medical
Finetuning of Falcon-7B LLM using QLoRA on Mental Health Conversational Dataset
chatbot chatbots conversational-ai falcon falcon-7b fine-tuning healthcare llm lora mental-health peft qlora
Language:Jupyter Notebook 261
WangRongsheng / Aurora
The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"
mixtral-8x7b-instruct mixtral mixtral-8x7b instruction-tuning lora qlora fine-tuning large-language-models llm gpt language-model chinese
Language:Python 260
taishan1994 / Llama3.1-Finetuning
对llama3进行全参微调、lora微调以及qlora微调。
llama3 lora qlora qwen
Language:Python 208
yangjianxin1 / LongQLoRA
LongQLoRA: Extent Context Length of LLMs Efficiently
llm long-context longlora lora qlora
Language:Python 164
kuutsav / llm-toys
Small finetuned LLMs for a diverse set of useful tasks
falcon-7b finetuning gpt llm qlora paraphrase-generation dialogue-summarization
Language:Python 126
ssbuild / llm_finetuning
Large language Model fintuning bloom , opt , gpt, gpt2 ,llama,llama-2,cpmant and so on
bloom gpt gpt2 llama lora opt adalora qlora cpmant llama2 mistral
Language:Python 97
ssbuild / qwen_finetuning
qwen models finetuning
lora qlora sfml qwen ia3 adalora
Language:Python 95
taishan1994 / qlora-chinese-LLM
使用qlora对中文大语言模型进行微调，包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE
alpaca belle bloomz chatglm llama lora qlora
Language:Python 85
NisaarAgharia / Indian-LawyerGPT
Fine-Tuning Falcon-7B, LLAMA 2 with QLoRA to create an advanced AI model with a profound understanding of the Indian legal context.
falcon fine-tuning gpt huggingface-transformers llms peft qlora large-language-models llama llama2
Language:Jupyter Notebook 77
zjohn77 / lightning-mlflow-hf
Use QLoRA to tune LLM in PyTorch-Lightning w/ Huggingface + MLflow
azure-ml hugging-face mlflow polars pytorch-lightning peft qlora deep-learning llm nlp pytorch adapter language-model lora
Language:Python 61
verifAI
nikolamilosevic86 / verifAI
VerifAI initiative to build open-source easy-to-deploy generative question-answering engine that can reference and verify answers for correctness (using posteriori model)
machine-learning natural-language-processing nlp nlp-machine-learning bioasq deep-learning llm medline mistral-7b nli pubmed qlora scifact verification hallucination-detection openai opensearch qdrant generative-ai generative-ai-search
Language:Jupyter Notebook 60
kyegomez / Finetuning-Suite
Finetune any model on HF in less than 30 seconds
artificial-intelligence deep-learning finetuning large-language-models transformer gpt gpt4 qlora
Language:Jupyter Notebook 58
michaelnny / InstructLLaMA
Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to InstructGPT or ChatGPT, but on a much smaller scale.
instructgpt ppo rhlf llam2 qlora 4bit-fine-tune
Language:Jupyter Notebook 50
ssbuild / moss_finetuning
moss chat finetuning
adalora chat lora moss finetuing chatmoss sft qlora
Language:Python 50
AviSoori1x / Tuning-the-Finetuning
Tuning the Finetuning: An exploration of achieving success with QLoRA
fine-tuning large-language-models llms lora qlora transformers
Language:Python 43
ssbuild / rwkv_finetuning
rwkv finetuning
lora p-tuning-v2 qlora rwkv rwkv4
Language:Python 36
wangermeng2021 / llm-webui
A Gradio web UI for Large Language Models. Supports LoRA/QLoRA finetuning,RAG(Retrieval-augmented generation) and Chat
finetuning-llms large-language-models llms lora qlora rag retrieval-augmented-generation webui finetune-llms finetuning-large-language-models llama2 mistral-7b zaphyr llm-web-ui
Language:Python 36
ssbuild / baichuan_finetuning
baichuan and baichuan2 finetuning and alpaca finetuning
qlora baichuan-13b baichuan-chat fineturn baichuan baichuan-13b-chat baichuan2-13b baichuan2-13b-chat
Language:Python 32
ssbuild / chatglm_rlhf
chatglm_rlhf_finetuning
chatglm finetuning lora reward rlhf chat qlora
Language:Python 28
GURPREETKAURJETHRA / Meta-LLAMA3-GenAI-UseCases-End-To-End-Implementation-Guides
META LLAMA3 GENAI Real World UseCases End To End Implementation Guide
chromadb fine-tuning fsdp generativeai langchain-python llama3 llama3-70b-8192 llama3-finetune llama3-meta-ai llama3-prompts llama3-rag prompt-tuning pytorch qlora rag sagemaker streamlit huggingface ollama
Language:Jupyter Notebook 25
InquestGeronimo / tllm
An LLM training library for instruction-tuning.
fine-tuning llm qlora trainer cypher text-to-cypher instruction-tuning text-to-sql
Language:Python 25

qlora

hiyouga / LLaMA-Factory

unslothai / unsloth

bitsandbytes-foundation / bitsandbytes

yangjianxin1 / Firefly

lyogavin / airllm

hiyouga / ChatGLM-Efficient-Tuning

iusztinpaul / hands-on-llms

ssbuild / chatglm_finetuning

georgian-io / LLM-Finetuning-Toolkit

X-D-Lab / MindChat

jianzhnie / LLamaTuner

X-D-Lab / Sunsimiao

yangjianxin1 / Firefly-LLaMA2-Chinese

GURPREETKAURJETHRA / END-TO-END-GENERATIVE-AI-PROJECTS

ddzipp / AutoAudit

iamarunbrahma / finetuned-qlora-falcon7b-medical

WangRongsheng / Aurora

taishan1994 / Llama3.1-Finetuning

yangjianxin1 / LongQLoRA

kuutsav / llm-toys

ssbuild / llm_finetuning

ssbuild / qwen_finetuning

taishan1994 / qlora-chinese-LLM

NisaarAgharia / Indian-LawyerGPT

zjohn77 / lightning-mlflow-hf

nikolamilosevic86 / verifAI

kyegomez / Finetuning-Suite

michaelnny / InstructLLaMA

ssbuild / moss_finetuning

AviSoori1x / Tuning-the-Finetuning

ssbuild / rwkv_finetuning

wangermeng2021 / llm-webui

ssbuild / baichuan_finetuning

ssbuild / chatglm_rlhf

GURPREETKAURJETHRA / Meta-LLAMA3-GenAI-UseCases-End-To-End-Implementation-Guides

InquestGeronimo / tllm