Geek Repo
followers
following
stars
Company:Tsinghua
Github PK Tool:Github PK Tool
Reasonable Reward Evaluation of Large Language Models
LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization
This is the repo for RL-ViGen