Alireza Kazemipour's repositories
DIAYN-PyTorch
Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.
Discrete-SAC-PyTorch
PyTorch implementation of discrete version of Soft Actor-Critic.
Continuous-PPO
Proximal Policy Optimization (Continuous Version) in PyTorch.
NN-Without-Frameworks
Let's build Neural Networks from scratch.
Distributional-RL
Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.
Cycle-GAN-PyTorch
PyTorch implementation of the Cycle GAN paper.
DeepRL-Paradise
Comprehensive Deep RL Implementations
TRPO-PyTorch
Trust Region Policy Optimization in PyTorch.
A3C-ACER-PyTorch
Implementation of ACER and A3C in PyTorch.
A2C-SIL-TF2
TensorFlow2 implementation of Self-Imitation Learning (SIL) with Synchronous Advantage Actor-Critic (A2C).
Discrete-PPO
Implementation of the proximal policy optimization on the Atari environments.
homework_fall2021
Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2021)
PyExpUtils
Experiment utility code, specifically designed for use with Compute Canada.
reinforcement_learning_an_introduction
Notes and exercise solutions for second edition of Sutton & Barto's book
TD3-PyTorch
Addressing Function Approximation Error in Actor-Critic Methods