WeiWang's repositories
Tensorflow_2player_pong
A code reimplementation of DeepMind's "Multiagent Cooperation and Competition with Deep Reinforcement Learning" with Tensorflow
Actor-Critic-cart-pole
cart-pole by Advantage Actor-Critic (A2C)
Asynchronous-Methods-for-Deep-Reinforcement-Learning
Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain in here: http://arxiv.org/pdf/1602.01783v1.pdf I used the one-step-Q-learning pseudocode, and now we can train the Pong game in less than 20 hours and without any GPU or network distribution.
CommNet
Neural network model, suitable for multi-agent learning. https://arxiv.org/abs/1605.07736
coursera-introduction-to-recommender-systems
The course assignments for Introduction to Recommender Systems at University of Minnesota.
cs231n
Assignments of Stanford cs231n in spring 2017.
ddpg-pendulum
ddpg
deep-reinforcement-learning-papers
A list of recent papers regarding deep reinforcement learning
deep-rl
Collection of Deep Reinforcement Learning algorithms
DeepMind-Atari-Deep-Q-Learner-2Player
Multiagent Cooperation and Competition with Deep Reinforcement Learning
MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
Mixed-Policy-Asynchronous-Deep-Q-Learning
Deep-learning version of WoLF-PHC, GIGA-WoLF, WPL, EMA-QL and PGA-APP
Pong-game-kivy
A Pong desktop game for two players.
Pytorch-NCE
The Noise Contrastive Estimation for softmax output written in Pytorch
reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
sundry-musings
A repository of various daydreams