shangxing2015

shangxing2015

Geek Repo

0

following

0

stars

Github PK Tool:Github PK Tool

shangxing2015's repositories

mosquitto

Mosquitto

Language:CLicense:NOASSERTIONStargazers:0Issues:0Issues:0

TensorFlow-Examples

TensorFlow Tutorial and Examples for beginners

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

TensorFlow-Tutorials-for-Time-Series

TensorFlow Tutorial for Time Series Prediction

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

deep-rl

Collection of Deep Reinforcement Learning algorithms

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

dqn-atari

A TensorFlow based implementation of the DeepMind Atari playing "Deep Q Learning" Agent

Language:PythonStargazers:0Issues:0Issues:0

tensorflow

Computation using data flow graphs for scalable machine learning

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

deep-reinforcement-learning-papers

A list of recent papers regarding deep reinforcement learning

Stargazers:0Issues:0Issues:0

tensorflow-deepq

A deep Q learning demonstration using Google Tensorflow

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

async-rl

Tensorflow + Keras + OpenAI Gym implementation of 1-step Q Learning from "Asynchronous Methods for Deep Reinforcement Learning"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

async_deep_reinforce

Asynchronous Methods for Deep Reinforcement Learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

reinforcejs

Reinforcement Learning Agents in Javascript (Dynamic Programming, Temporal Difference, Deep Q-Learning, Stochastic/Deterministic Policy Gradients)

Language:HTMLStargazers:0Issues:0Issues:0

DQN-Atari-Tensorflow

Simplest Version of playing Atari with Deep Q Learning in Tensorflow

Language:PythonStargazers:0Issues:0Issues:0

Asynchronous-Methods-for-Deep-Reinforcement-Learning

Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain in here: http://arxiv.org/pdf/1602.01783v1.pdf I used the one-step-Q-learning pseudocode, and now we can train the Pong game in less than 20 hours and without any GPU or network distribution.

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

blog

for code created as part of http://studywolf.wordpress.com

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

coursera-android-labs

Skeletons and Tests - Programming Mobile Applications for Android Handheld Systems

Language:JavaLicense:MITStargazers:0Issues:0Issues:0