Zhizhou Ren's repositories
Hindsight-Goal-Generation
TensorFlow implementation for our paper "Exploration via Hindsight Goal Generation"
Randomized-Return-Decomposition
TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"
Adaptation-with-Noisy-OracLE
PyTorch implementation for our paper "Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation"
Doubly-Bounded-Q-Learning
TensorFlow implementation for our paper "On the Estimation Bias in Double Q-Learning"