There are 0 repository under dpo topic.
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO。
ms-swift: Use PEFT or Full-parameter to finetune 200+ LLMs or 15+ MLLMs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach
Technical anaysis library for .NET
CodeUltraFeedback for aligning large language models to coding preferences
Various training, inference and validation code and results related to Open LLM's that were pretrained (full or partially) on the Dutch language.
Processings Register for DPO (GDPR) - GLPI Plugin
A Laravel package to simplify using DPO Payment API in your application. https://dpogroup.com
Data and models for the paper "Configurable Safety Tuning of Language Models with Synthetic Preference Data"
A open-source framework designed to adapt pre-trained Language Models (LLMs), such as Llama, Mistral, and Mixtral, to a wide array of domains and languages.
Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon
This is the DPO Pay plugin for WooCommerce.
This is the DPO Group plugin for Gravity Forms.
DPO Group Payment gateway PHP SDK
An open source collection of tools meant to simplify the life of data protection officers (DPOs) of large entities
Unofficial PHP wrapper for Direct Pay Online API
This repository contains the source code used for finetuning the LLM phi-2 with several frameworks, such as DPO.
We're improving Yi-9B-200K with a ton of new abilities for high performance in generalist and specialist tasks.
Privacy Mapping Open Source Software
replaced pandas-ta calls with numpy/numba functions to speed up calculating ema, tema, rsi, mfi, adx, dpo
This is the DPO Group plugin for Gravity Forms.
Proof-of-concept leveraging DPO loss to fine-tune a ResNet to classify images from CIFAR10 dataset.
Awesome tools and information for Data Protection Officers - GDPR professionals
Direct Preference Optimization of ChatGPT2 using TRL Library