Ilya Kostrikov's repositories
pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
pytorch-a3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
pytorch-flows
PyTorch implementations of algorithms for density estimation
pytorch-trpo
PyTorch implementation of Trust Region Policy Optimization
pytorch-meta-optimizer
A PyTorch implementation of Learning to learn by gradient descent by gradient descent
pytorch-ddpg-naf
Implementation of algorithms for continuous control (DDPG and NAF).
TensorFlow-Pointer-Networks
TensorFlow implementation of Pointer Networks
Mine_tf2.0
MINE: Mutual Information Neural Estimation in pytorch
motion_imitation
Code accompanying the paper "Learning Agile Robotic Locomotion Skills by Imitating Animals"
Implicit-Q-Learning
PyTorch implementation of the implicit Q-learning algorithm (IQL)
unitree_sim
MuJoCo models for Unitree Robots
gym-wordle
Gym environment for playing Wordle with RL agents
oatomobile
A research framework for autonomous driving