ImranRolo's repositories

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

deep-rl-docker

Docker image with OpenAI Gym, Baselines, MuJoCo and Roboschool, utilizing TensorFlow and JupyterLab.

Language:DockerfileLicense:MITStargazers:0Issues:1Issues:0

DeepMind-Advanced-Deep-Learning-and-Reinforcement-Learning

Advanced Deep Learning and Reinforcement Learning course taught at UCL in partnership with Deepmind

Stargazers:0Issues:1Issues:0

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

PPO-Stein-Control-Variate

Proximal Policy Optimization with Stein Control Variates:

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

pytorch-a2c-ppo-acktr

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

RandWireNN

Implementation of: "Exploring Randomly Wired Neural Networks for Image Recognition"

Language:PythonStargazers:0Issues:1Issues:0

reinforcement-learning-algorithms

This repository contains most of classic deep reinforcement learning algorithms, including - DQN, DDQN, Dueling Network, DDPG, A3C, PPO, TRPO. (More algorithms are still in progress)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

Variational_Discriminator_Bottleneck

Implementation (with some experimentation) of the paper titled "VARIATIONAL DISCRIMINATOR BOTTLENECK: IMPROVING IMITATION LEARNING, INVERSE RL, AND GANS BY CONSTRAINING INFORMATION FLOW" (arxiv -> https://arxiv.org/pdf/1810.00821.pdf)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0