rlaif

There are 2 repositories under rlaif topic.

distilabel
argilla-io / distilabel
⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
ai huggingface llms openai python rlaif rlhf synthetic-data synthetic-dataset-generation
Language:Python 997
mengdi-li / awesome-RLAIF
A continually updated list of literature on Reinforcement Learning from AI Feedback (RLAIF)
alignment llms rl rlaif rlhf
78
vicgalle / zero-shot-reward-models
ZYN: Zero-Shot Reward Models with Yes-No Questions
llm reinforcement-learning rlhf zero-shot reward-models trlx rlaif
Language:Python 31
holarissun / Prompt-OIRL
code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning
inverse-reinforcement-learning irl large-language-models llm offline-rl prompt-engineering rlaif rlhf offline-irl
Language:Python 24
vicgalle / distilled-self-critique
distilled Self-Critique refines the outputs of a LLM with only synthetic data
llm rlaif synthetic-data self-critique
Language:Jupyter Notebook 10
vicgalle / awesome-rlaif
A curated and updated list of relevant articles and repositories on Reinforcement Learning from AI Feedback (RLAIF)
awesome language-model llm research rlaif rlhf
8

argilla-io / distilabel