alirezakazemipour

Alireza Kazemipour's repositories

DDPG-HER

Implementation of the Deep Deterministic Policy Gradient and Hindsight Experience Replay.

Language:Python84 2 5

DIAYN-PyTorch

Diversity is All You Need: Learning Skills without a Reward Function in PyTorch.

Language:PythonMIT56 2 3

PPO-RND

Random network distillation on Montezuma's Revenge and Super Mario Bros.

Language:Python40 2 2

Discrete-SAC-PyTorch

PyTorch implementation of discrete version of Soft Actor-Critic.

Language:PythonMIT25 3 1

Continuous-PPO

Proximal Policy Optimization (Continuous Version) in PyTorch.

Language:Python24 2 2

NN-Without-Frameworks

Let's build Neural Networks from scratch.

Language:PythonMIT14 20

Distributional-RL

Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.

Language:PythonMIT6 10

DQN-HER

Implementation of the hindsight experience by DQN algorithm on the bit flip environment.

Language:Python6 30

Cycle-GAN-PyTorch

PyTorch implementation of the Cycle GAN paper.

Language:Python4 20

DeepRL-Paradise

Comprehensive Deep RL Implementations

MIT3 10

Rainbow

Combining Improvements in Deep Reinforcement Learning

Language:Python3 2 1

TRPO-PyTorch

Trust Region Policy Optimization in PyTorch.

Language:PythonMIT2 20

A3C-ACER-PyTorch

Implementation of ACER and A3C in PyTorch.

Language:PythonMIT1 20

ACKTR-PyTorch

Language:PythonMIT1 10

DDQN-Random-Network-Distillation

Language:Python1 20

DeepLearning-Collection

Language:Jupyter Notebook1 20

Parkinson-Disease-Classification

Language:Jupyter NotebookMIT1 10

A2C-SIL-TF2

TensorFlow2 implementation of Self-Imitation Learning (SIL) with Synchronous Advantage Actor-Critic (A2C).

Language:PythonMIT020

alirezakazemipour

010

alirezakazemipour.github.io

Language:JavaScriptMIT020

brett-daley.github.io

MIT000

Cartpole-RL

Language:Python020

Discrete-PPO

Implementation of the proximal policy optimization on the Atari environments.

Language:Python020

homework_fall2021

Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2021)

Language:Python000

PyExpUtils

Experiment utility code, specifically designed for use with Compute Canada.

Language:Python000

reinforcement_learning_an_introduction

Notes and exercise solutions for second edition of Sutton & Barto's book

Language:TeXMIT000

rl-prediction-template

Language:Python000

TD3-PyTorch

Addressing Function Approximation Error in Actor-Critic Methods

Language:PythonGPL-3.0020

Top-50-Crypto-Kaggle

Language:PythonMIT010

TreeBased-And-SVM-Classifiers

Language:Jupyter NotebookMIT010