Christopher Hesse's repositories
atari-demo
Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"
atari-reset
Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"
deeptype
Code for the paper "DeepType: Multilingual Entity Linking by Neural Type System Evolution"
EPG
Code for the paper "Evolved Policy Gradients"
evolution-strategies-starter
Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"
finetune-transformer-lm
Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
generating-reviews-discovering-sentiment
Code for "Learning to Generate Reviews and Discovering Sentiment"
glow
Code for reproducing results in "Glow: Generative Flow with Invertible 1x1 Convolutions"
gym
A toolkit for developing and comparing reinforcement learning algorithms.
iaf
Code for reproducing key results in the paper "Improving Variational Inference with Inverse Autoregressive Flow"
imitation
Code for the paper "Generative Adversarial Imitation Learning"
improved-gan
Code for the paper "Improved Techniques for Training GANs"
InfoGAN
Code for reproducing key results in the paper "InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets"
large-scale-curiosity
Code for the paper "Large-Scale Study of Curiosity-Driven Learning"
maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
mlsh
Code for the paper "Meta-Learning Shared Hierarchies"
mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
multiagent-competition
Code for the paper "Emergent Complexity via Multi-agent Competition"
multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
neural-gpu
Code for the Neural GPU model originally described in "Neural GPUs Learn Algorithms"
ot-gan
Code for the paper "Improving GANs Using Optimal Transport"
pixel
Code for a single pixel debate game from the paper "AI safety via debate" https://arxiv.org/abs/1805.00899
pixel-cnn
Code for the paper "PixelCNN++: A PixelCNN Implementation with Discretized Logistic Mixture Likelihood and Other Modifications"
retro
Retro Games in Gym
roboschool
Open-source software for robot simulation, integrated with OpenAI Gym.
signup-forms
Code for the paper "World of Bits: An Open-Domain Platform for Web-Based Agents"
spinningup
An educational resource to help anyone learn deep reinforcement learning.
supervised-reptile
Code for the paper "On First-Order Meta-Learning Algorithms"
vime
Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"
weightnorm
Example code for Weight Normalization, from "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks"