Harryis Wang (Harry-mic)

Harry-mic

Geek Repo

Company:Tsinghua

Github PK Tool:Github PK Tool

Harryis Wang's repositories

TREvaL

Reasonable Reward Evaluation of Large Language Models

Language:PythonLicense:MITStargazers:6Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

la-mbda

LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

RL-ViGen

This is the repo for RL-ViGen

Language:PythonLicense:MITStargazers:0Issues:0Issues:0