ruizhaogit's repositories
EnergyBasedPrioritization
Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)
maximum_entropy_population_based_training
Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination
GuessWhat-TemperedPolicyGradient
Learning Goal-Oriented Visual Dialog via Tempered Policy Gradient (SLT 2018) (IJCAIw 2018)
MNIST-GuessNumber
Efficient Dialog Policy Learning via Positive Memory Retention (SLT 2018) (NIPSw 2018)
PositiveMemoryRetention
Efficient Dialog Policy Learning via Positive Memory Retention (SLT 2018) (NIPSw 2018)