supervised-finetuning

There are 1 repository under supervised-finetuning topic.

InternLM / xtuner
An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
agent baichuan chatbot chatglm2 chatglm3 conversational-ai internlm large-language-models llama2 llama3 llava llm llm-training mixtral msagent peft phi3 qwen supervised-finetuning
Language:Python 2712
InternLM / InternLM-XComposer
InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.
chatgpt foundation gpt gpt-4 instruction-tuning language-model large-language-model large-vision-language-model llm mllm multi-modality multimodal supervised-finetuning vision-language-model vision-transformer visual-language-learning
Language:Python 1720
GaryYufei / AlignLLMHumanSurvey
Aligning Large Language Models with Human: A Survey
chatgpt gpt-4 large-language-models llms rlhf supervised-finetuning survey awesome chinese-llama llama llama2
609
chaoswork / sft_datasets
开源SFT数据集整理,随时补充
chinese-dataset datasets large-language-models llms supervised-finetuning
363
Tebmer / Awesome-Knowledge-Distillation-of-LLMs
This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & Vertical Distillation of LLMs.
data-augmentation instruction-following kd knowledge-distillation large-language-model llm self-training survey compression data-synthesis feedback multi-modal self-distillation alignment supervised-finetuning
275
LIN-SHANG / InstructERC
The offical realization of InstructERC
chatglm-6b chatglm2-6b emotion-recognition-in-conversation large-language-models llama-7b llama2-7b supervised-finetuning unified-data-processing
Language:Python 105
sail-sg / sdft
The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
language-model self-distillation supervised-finetuning
Language:Shell 38
fanqiwan / KCA
Knowledge Verification to Nip Hallucination in the Bud
hallucination large-language-models machine-learning supervised-finetuning
Language:Python 17
bhattbhavesh91 / google-gemma-finetuning-n2sql
Finetuning Google's Gemma Model for Translating Natural Language into SQL
finetuning finetuning-llms gemma google fine-tuning natural-language-to-sql lora supervised-finetuning
Language:Jupyter Notebook 9
nsrinidhibhat / fine-tune-llama-2
This project streamlines the fine-tuning process, enabling you to leverage Llama-2's capabilities for your own projects.
fine-tuning huggingface large-language-models llama-2 open supervised-finetuning
Language:Python 3
sovit-123 / lm_sft
Various LMs/LLMs below 3B parameters (for now) trained using SFT (Supervised Fine Tuning) for several downstream tasks
bert gemma gpt gpt2 large-language-models llms supervised-finetuning
Language:Jupyter Notebook 3
KwokHing / AI-Planet-LLM-Bootcamp-Challenge
A LLM challenge to (i) fine-tune pre-trained HuggingFace transformer model to build a Code Generation language model, and (ii) build a retrieval-augmented generation (RAG) application using LangChain
embeddings-model fine-tuning language-model llm mistral-7b qlora retrieval-augmented-generation sentence-embeddings transformer-models supervised-finetuning langchain ocra-mini-3b
Language:Jupyter Notebook 2
tien02 / llm-math
Fine tune Large Language Model on Mathematic dataset
huggingface llama llama2 llm mathematics supervised-finetuning transformer lora
Language:Python 2
18907305772 / KCA
Knowledge Verification to Nip Hallucination in the Bud
hallucination large-language-models machine-learning supervised-finetuning
Language:Python 1
ChryssaNab / ECG-Heartbeat-Classification
Binary classification of pathological heartbeats from ECG signals using 1D CNNs in PyTorch
1d-cnns binary-classification heartbeat-classification pytorch ecg-signals pre-trained-model transfer-learning supervised-finetuning
Language:Python 1
sunnynevarekar / LLM_Mistral_7b_SFT
Finetune Mistral 7b v1.0 on custom dataset
large-language-models llm mistral-7b qlora sft supervised-finetuning text-to-sql
Language:Jupyter Notebook 1
jmaczan / c-137
🦙 Llama 2 7B fine-tuned to revive Rick
deep-learning fine-tuning finetuning llama-2 llama2 llm machine-learning nlp rick-and-morty rick-sanchez rickandmorty sft supervised-finetuning apple-m2 llama2-7b c-137 google-colab
Language:Jupyter Notebook 0
thisisHJLee / RLHF
ppo reinforcement-learning reward-model rlhf supervised-finetuning language-model nlp