Harry-mic

Harryis Wang's repositories

Reasonable Reward Evaluation of Large Language Models

Language:PythonMIT7 10

Language:Jupyter Notebook010

LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization

Language:PythonMIT000

This is the repo for RL-ViGen

Language:PythonMIT000