Beast code in Giters

ImranRolo's repositories

deep-rl-docker

Docker image with OpenAI Gym, Baselines, MuJoCo and Roboschool, utilizing TensorFlow and JupyterLab.

Language:DockerfileMIT010

DeepMind-Advanced-Deep-Learning-and-Reinforcement-Learning

Advanced Deep Learning and Reinforcement Learning course taught at UCL in partnership with Deepmind

010

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonMIT010

PPO-Stein-Control-Variate

Proximal Policy Optimization with Stein Control Variates:

Language:PythonMIT010

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).

Language:PythonMIT010

RandWireNN

Implementation of: "Exploring Randomly Wired Neural Networks for Image Recognition"

Language:Python010

reinforcement-learning-algorithms

This repository contains most of classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, A3C, PPO, TRPO. (More algorithms are still in progress)

Language:PythonMIT000

Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials

Language:PythonMIT010

Variational_Discriminator_Bottleneck

Implementation (with some experimentation) of the paper titled "VARIATIONAL DISCRIMINATOR BOTTLENECK: IMPROVING IMITATION LEARNING, INVERSE RL, AND GANS BY CONSTRAINING INFORMATION FLOW" (arxiv -> https://arxiv.org/pdf/1810.00821.pdf)

Language:PythonMIT000

ImranRolo

ImranRolo's repositories

bayesian_neural_network

deep-rl-docker

DeepMind-Advanced-Deep-Learning-and-Reinforcement-Learning

gpt-2

ImranRolo.github.io

PPO-Stein-Control-Variate

pytorch-a2c-ppo-acktr

RandWireNN

reinforcement-learning-algorithms

Reinforcement-learning-with-tensorflow

Variational_Discriminator_Bottleneck