Alireza Kazemipour (alirezakazemipour)

alirezakazemipour

Geek Repo

Company:University of Alberta

Location:Edmonton, AB

Home Page:alirezakazemipour.github.io

Github PK Tool:Github PK Tool

Alireza Kazemipour's repositories

DDPG-HER

Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.

DIAYN-PyTorch

Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.

Language:PythonLicense:MITStargazers:56Issues:2Issues:3

PPO-RND

Random network distillation on Montezuma's Revenge and Super Mario Bros.

Discrete-SAC-PyTorch

PyTorch implementation of discrete version of Soft Actor-Critic.

Language:PythonLicense:MITStargazers:25Issues:3Issues:1

Continuous-PPO

Proximal Policy Optimization (Continuous Version) in PyTorch.

NN-Without-Frameworks

Let's build Neural Networks from scratch.

Language:PythonLicense:MITStargazers:14Issues:2Issues:0

Distributional-RL

Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.

Language:PythonLicense:MITStargazers:6Issues:1Issues:0

DQN-HER

Implementation of the hindsight experience by DQN algorithm on the bit flip environment.

Language:PythonStargazers:6Issues:3Issues:0

Cycle-GAN-PyTorch

PyTorch implementation of the Cycle GAN paper.

Language:PythonStargazers:4Issues:2Issues:0

DeepRL-Paradise

Comprehensive Deep RL Implementations

License:MITStargazers:3Issues:1Issues:0

Rainbow

Combining Improvements in Deep Reinforcement Learning

TRPO-PyTorch

Trust Region Policy Optimization in PyTorch.

Language:PythonLicense:MITStargazers:2Issues:2Issues:0

A3C-ACER-PyTorch

Implementation of ACER and A3C in PyTorch.

Language:PythonLicense:MITStargazers:1Issues:2Issues:0
Language:PythonLicense:MITStargazers:1Issues:1Issues:0
Language:Jupyter NotebookStargazers:1Issues:2Issues:0
Language:Jupyter NotebookLicense:MITStargazers:1Issues:1Issues:0

A2C-SIL-TF2

TensorFlow2 implementation of Self-Imitation Learning (SIL) with Synchronous Advantage Actor-Critic (A2C).

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Stargazers:0Issues:1Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:2Issues:0
License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:2Issues:0

Discrete-PPO

Implementation of the proximal policy optimization on the Atari environments.

Language:PythonStargazers:0Issues:2Issues:0

homework_fall2021

Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2021)

Language:PythonStargazers:0Issues:0Issues:0

PyExpUtils

Experiment utility code, specifically designed for use with Compute Canada.

Language:PythonStargazers:0Issues:0Issues:0

reinforcement_learning_an_introduction

Notes and exercise solutions for second edition of Sutton & Barto's book

Language:TeXLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

TD3-PyTorch

Addressing Function Approximation Error in Actor-Critic Methods

Language:PythonLicense:GPL-3.0Stargazers:0Issues:2Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0