sft

There are 0 repository under sft topic.

modelscope / swift
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
agent llm lora llama pre-training sft deploy multimodal dpo llava llama3 modelscope unsloth peft glm4 qwen2 internvl ollama mistral-nemo megatron
Language:Python 2530
ssbuild / chatglm_finetuning
chatglm 6b finetuning and alpaca finetuning
chatglm deep-learning lora pytorch p-tuning-v2 adalora sft freeze qlora ia3
Language:Python 1531
jerry1993-tech / Cornucopia-LLaMA-Fin-Chinese
聚宝盆(Cornucopia): 中文金融系列开源可商用大模型，并提供一套高效轻量化的垂直领域LLM训练框架(Pretraining、SFT、RLHF、Quantize等)
llama nlp chinese finance rlhf sft qa text-generation large-language-models transformers
Language:Python 574
ukairia777 / tensorflow-nlp-tutorial
tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.
tensorflow nlp natural-language-processing question-answering named-entity-recognition bert-ner bert nlp-tutorial keras-tutorial llm dpo llama sft huggingface transformers lora trainer
Language:Jupyter Notebook 502
choosewhatulike / trainable-agents
Code and datasets for "Character-LLM: A Trainable Agent for Role-Playing"
agent language-model llm roleplay sft large-language-models natural-language-processing character
Language:Python 397
0xsequence / erc-1155
Ethereum Semi Fungible Standard (ERC-1155)
erc1155 ethereum nft semi-fungible sft token-contract
Language:TypeScript 317
solv-finance / erc-3525
ERC-3525 Reference Implementation
sft solv erc-3525 erc3525
Language:Solidity 106
muyu42 / DataS
本项目旨在结合以往研究人员的代表性工作，从多个维度评估sft数据，并自动化过滤sft数据。
data-engineering llm-training sft
Language:Python 54
ssbuild / moss_finetuning
moss chat finetuning
adalora chat lora moss finetuing chatmoss sft qlora
Language:Python 50
movescriptions / movescriptions
https://twitter.com/MoveScriptions
blockchain inscription move sft
Language:Move 44
ElvenTools / elven-tools-cli
Elven Tools CLI - command line tool for launching NFTs collections on the MultiversX blockchain (Plus other tools).
elrond nft blockchain javascript nodejs cli multiversx sft
Language:TypeScript 25
wangclnlp / Vision-LLM-Alignment
This repo contains the codes for supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) designed for vision LLMs.
vision dpo llm rlhf sft ppo alignment reward mllm multi-model llava
Language:Python 19
Macielyoung / Baichuan-QLora
Finetune baichuan pretrained model with QLora method
baichuan-7b qlora sft
Language:Python 16
wangclnlp / DeepSpeed-Chat-Extension
This repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
deepspeed llm rlhf sft llama
Language:Python 16
rbga / Low_Density_Parity_Check_LDPC_Codes_-_MATLAB_Simulation
LDPC MATLAB simulation using BPSK + AWGN modulation decoded using Sum Product and Min Sum Algorithm
awgn awgn-channel ber bit-error-rate bpsk bpsk-modulation cyclic-codes decoder forward-error-correction gaussian-noise ldpc ldpc-codes matlab matrix minimum-sum sft simulation sum-product sum-product-algorithm
Language:MATLAB 14
taishan1994 / chinese_llm_sft
使用指令微调对大模型进行微调。
llm lora sft
Language:Python 7
DaehanKim / EasyRLHF
EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets
language-model rlhf dpo instruction-tuning ipo sft rrhf
Language:Python 6
hlp-ai / miniChatGPT
Mini ChatGPT
chatgpt instructgpt ppo pytorch sft reward-model gpt2
Language:Python 6
AlekseyKorshuk / gai-project
Train expert conversational role-play LLMs with synthetic data
chatbot conversational-ai expert-models llm pipeline sft synthetic-data
Language:Python 5
THU-KEG / DICE
DICE: Detecting In-distribution Data Contamination with LLM's Internal State
benchmark data-contamination fine-tuning-llm gsm8k llm sft
Language:Python 5
ldclabs / ic-sft
A SFT (Semi-Fungible Token, implemented ICRC-7 and ICRC-37) canister smart contract on the Internet Computer.
icp nft sft icrc-37 icrc-7
Language:Rust 4
ElvenTools / elven-tools-sft-minter-sc
Elven Tools SFT Minter Smart Contract - launching SFTs collections on the MultiversX blockchain
multiversx sft smart-contracts blockchain rust
Language:Rust 3
Sophietje / SFTLearning
Testing the security of sanitizers by learning symbolic finite transducers
automata learning-algorithm sfa sft sanitizer sanitization transducers verification
Language:Java 2
dgomezde83 / Multifungible-library
MultiversX library for interacting with the MultiversX blockchain's Non-fungible tokens and Semi-fungible tokens.
cpp library multiversx nft sft
Language:C++ 1
Lamsoda1123 / GPT2_medium_finetune-lora-sft
It's a GPT2 finetune project based on peft and transformers. Although can provide quite a imporvement, however, the illusion and inteligent is terrible.
llm lora sft
Language:Python 1
sftchance
sftchance / sftchance
⚪ CHANCE IS A STUDY IN DECENTERED IDENTITY TOURISM AND THE A(E)FFECTS OF PRIVILEGE, ENTITLEMENT, AND CAPITAL, WITH BOUNDLESS MOBILITY ENABLED BY THE INTERNET.
sft sftchance semi-fungible
Language:TypeScript 1
SharathHebbar / Coding-Templates
Coding Templates
dpo sft coding-templates
Language:Jupyter Notebook 1
SharathHebbar / sft_mathgpt2
Supervised Fine tuning using TRL library
decoder gpt2 llm mathgpt sft text-generation transformers trl
Language:Jupyter Notebook 1
sunnynevarekar / LLM_Mistral_7b_SFT
Finetune Mistral 7b v1.0 on custom dataset
large-language-models llm mistral-7b qlora sft supervised-finetuning text-to-sql
Language:Jupyter Notebook 1
sft-logos
TauntonandSomersetNHSTrust / sft-logos
Somerset NHSFT's logos
nhs sft somerset
Language:Shell 1
tonyskapunk / sft-aur
Scripts to keep up with latest scaleft packages to build them for AUR
arch aur hacktoberfest linux sft
Language:Shell 1
dag0310 / Fast-SFTP-Folder-Uploader
Upload folders faster via SFTP by temporarily zipping on the client and unzipping on the host.
fast folder sft temporary unzip upload zip
Language:Python 0
jmaczan / c-137
🦙 Llama 2 7B fine-tuned to revive Rick
deep-learning fine-tuning finetuning llama-2 llama2 llm machine-learning nlp rick-and-morty rick-sanchez rickandmorty sft supervised-finetuning apple-m2 llama2-7b c-137 google-colab
Language:Jupyter Notebook 0
XpastaX / Instruction-Fusion
Advancing Prompt Evolution through Hybridization
codegeneration dataset llm sft
Language:Python 0
Nexdata-AI / 100000-Instruction-Following-Evaluation-SFT-for-Chinese-LLM-Text-Data
100000-Instruction-Following-Evaluation-SFT-for-Chinese-LLM-Text-Data
llm-training nlp large-language-models sft
PhilipMay / llm-data
LLM Training Data
llm sft
Language:Jupyter Notebook

sft

modelscope / swift

ssbuild / chatglm_finetuning

jerry1993-tech / Cornucopia-LLaMA-Fin-Chinese

ukairia777 / tensorflow-nlp-tutorial

choosewhatulike / trainable-agents

0xsequence / erc-1155

solv-finance / erc-3525

muyu42 / DataS

ssbuild / moss_finetuning

movescriptions / movescriptions

ElvenTools / elven-tools-cli

wangclnlp / Vision-LLM-Alignment

Macielyoung / Baichuan-QLora

wangclnlp / DeepSpeed-Chat-Extension

rbga / Low_Density_Parity_Check_LDPC_Codes_-_MATLAB_Simulation

taishan1994 / chinese_llm_sft

DaehanKim / EasyRLHF

hlp-ai / miniChatGPT

AlekseyKorshuk / gai-project

THU-KEG / DICE

ldclabs / ic-sft

ElvenTools / elven-tools-sft-minter-sc

Sophietje / SFTLearning

dgomezde83 / Multifungible-library

Lamsoda1123 / GPT2_medium_finetune-lora-sft

sftchance / sftchance

SharathHebbar / Coding-Templates

SharathHebbar / sft_mathgpt2

sunnynevarekar / LLM_Mistral_7b_SFT

TauntonandSomersetNHSTrust / sft-logos

tonyskapunk / sft-aur

dag0310 / Fast-SFTP-Folder-Uploader

jmaczan / c-137

XpastaX / Instruction-Fusion

Nexdata-AI / 100000-Instruction-Following-Evaluation-SFT-for-Chinese-LLM-Text-Data

PhilipMay / llm-data