safarzadeh-reza / Reinforcement-Learning-Code

A repository for codes of reinforcement learning algorithms with PyTorch

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Reinforcement Learning Code with PyTorch

Papers

Algorithms

01. Model-Free Reinforcement Learning

Deep Q-Network (DQN)

Double DQN (DDQN)

Advantage Actor-Critic (A2C)

Asynchronous Advantage Actor-Critic (A3C)

Deep Deterministic Policy Gradient (DDPG)

Truncated Natural Policy Gradient (TNPG)

Trust Region Policy Optimization (TRPO)

TRPO + Generalized Advantage Estimator (GAE)

Proximal Policy Optimization (PPO)

PPO + Generalized Advantage Estimator (GAE)

Soft Actor-Critic (SAC)


02. Inverse Reinforcement Learning

Apprenticeship Learning via Inverse Reinforcement Learning (APP)

Maximum Entropy Inverse Reinforcement Learning (MaxEnt)

Generative Adversarial Imitation Learning (GAIL)

Variational Adversarial Imitation Learning (VAIL)


Learning curve

CartPole

Pendulum

Hopper


Reference

About

A repository for codes of reinforcement learning algorithms with PyTorch

License:MIT License


Languages

Language:Python 100.0%