Giters
jack139
/
RLHF_test
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
Watchers:
1
Issues:
0
Forks:
RLHF test
Reinforcement Learning from Human Feedback
RLHF on LM model
RLHF on RL model
About
Languages
Language:
Python
95.0%
Language:
HTML
4.8%
Language:
Shell
0.2%
Language:
Procfile
0.0%