ruizhaogit's repositories

music

Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)

Language:PythonLicense:NOASSERTIONStargazers:35Issues:5Issues:3

EnergyBasedPrioritization

Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)

Language:PythonLicense:NOASSERTIONStargazers:32Issues:2Issues:2

maximum_entropy_population_based_training

Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination

mep

Maximum Entropy-Regularized Multi-Goal Reinforcement Learning (ICML 2019)

Language:PythonLicense:NOASSERTIONStargazers:23Issues:3Issues:1

GuessWhat-TemperedPolicyGradient

Learning Goal-Oriented Visual Dialog via Tempered Policy Gradient (SLT 2018) (IJCAIw 2018)

Language:LuaLicense:NOASSERTIONStargazers:8Issues:2Issues:0

MNIST-GuessNumber

Efficient Dialog Policy Learning via Positive Memory Retention (SLT 2018) (NIPSw 2018)

Language:PythonLicense:NOASSERTIONStargazers:6Issues:2Issues:0

PositiveMemoryRetention

Efficient Dialog Policy Learning via Positive Memory Retention (SLT 2018) (NIPSw 2018)

Language:LuaLicense:NOASSERTIONStargazers:3Issues:2Issues:0

alf

Agent Learning Framework https://alf.readthedocs.io

Language:PythonLicense:Apache-2.0Stargazers:2Issues:0Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0