clam004 / rlhf

fine tuning natural language generation using a reinforcement learning signal

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

language_reinforce

fine tuning natural language generation using a reinforcement learning signal

python virtual environment

you@you chat-api % python3 -m venv venv
you@you chat-api % source venv/bin/activate
(venv) you@you chat-api % pip install --upgrade pip
(venv) you@you chat-api % pip install -r requirements.txt

About

fine tuning natural language generation using a reinforcement learning signal


Languages

Language:Jupyter Notebook 85.7%Language:Python 14.3%