aistrych's repositories

rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

RecurrentHighwayNetworks

Recurrent Highway Networks - Implementations for Tensorflow, Torch7, Theano and Brainstorm

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

keras-rl

Deep Reinforcement Learning for Keras.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

residual_block_keras

Residual network block in Keras

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

A3C

Advantage async actor-critic Algorithms (A3C) and Progressive Neural Network implemented by tensorflow.

Language:PythonStargazers:0Issues:0Issues:0

tensorflow_with_latest_papers

Implementation of Newest RNN and Seq2Seq Features

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

tf-agent

tensorflow reinforcement learning agents for OpenAI gym environments

Language:PythonStargazers:0Issues:0Issues:0

DNI-tensorflow

DNI(Decoupled Neural Interfaces using Synthetic Gradients) implementation with Tensorflow

Language:PythonStargazers:0Issues:0Issues:0

Atari

Persistent advantage learning dueling double DQN for the Arcade Learning Environment

Language:LuaLicense:MITStargazers:0Issues:0Issues:0

ufcnn-keras

Implementation of UFCNN in Keras

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

async_deep_reinforce

Asynchronous Methods for Deep Reinforcement Learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

highway-fcn

Simple fully-connected highway networks using TensorFlow and Fomoro.

Language:PythonStargazers:0Issues:0Issues:0

highway-cnn

Simple convolutional highway networks using TensorFlow and Fomoro.

Language:PythonStargazers:0Issues:0Issues:0

deep_trader

This project uses reinforcement learning on stock market and agent tries to learn trading. The goal is to check if the agent can learn to read tape. The project is dedicated to hero in life great Jesse Livermore.

Language:PythonStargazers:0Issues:0Issues:0

ufcnn

Implementation of Undecimated Fully Convolutional Neural Network for time series modeling

Language:PythonStargazers:0Issues:0Issues:0

keras-resnet

Residual networks implementation using Keras-1.0 functional API

Language:PythonStargazers:0Issues:0Issues:0

async_deep_reinforce-1

A3C implementation which trains an agent for a small MDP.

Language:PythonStargazers:0Issues:0Issues:0

RAM

"Recurrent Models of Visual Attention" in TensorFlow

Language:PythonStargazers:0Issues:0Issues:0

snli-entailment

attention model for entailment on SNLI corpus

Language:PythonStargazers:0Issues:0Issues:0

async-rl

Tensorflow + Keras + OpenAI Gym implementation of 1-step Q Learning from "Asynchronous Methods for Deep Reinforcement Learning"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

genstake-newer

Genstake - fork for historic purposes ;)

Language:C++License:MITStargazers:0Issues:0Issues:0

Associative_LSTM

LSTM with associative memory cells (http://arxiv.org/abs/1602.03032)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Asynchronous-Methods-for-Deep-Reinforcement-Learning

Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain in here: http://arxiv.org/pdf/1602.01783v1.pdf I used the one-step-Q-learning pseudocode, and now we can train the Pong game in less than 20 hours and without any GPU or network distribution.

Language:PythonStargazers:0Issues:0Issues:0

resnet-tf

ResNet Implementation in TensorFlow

Language:PythonStargazers:0Issues:0Issues:0

DARQN

Deep Attention Recurrent Q-Network

Language:LuaStargazers:0Issues:0Issues:0

DropoutUncertaintyDemos

What My Deep Model Doesn't Know...

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

highway-networks

An implementation of Highway Networks in Caffe

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0