WeiWang (wweichn)

wweichn

Geek Repo

Company:Zhejiang University

Github PK Tool:Github PK Tool

WeiWang's repositories

Tensorflow_2player_pong

A code reimplementation of DeepMind's "Multiagent Cooperation and Competition with Deep Reinforcement Learning" with Tensorflow

Language:PythonStargazers:6Issues:2Issues:0

Actor-Critic-cart-pole

cart-pole by Advantage Actor-Critic (A2C)

Language:PythonStargazers:1Issues:0Issues:0

cs234

Assignments of Stanford cs234 in spring 2017.

Language:PythonStargazers:1Issues:2Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Asynchronous-Methods-for-Deep-Reinforcement-Learning

Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain in here: http://arxiv.org/pdf/1602.01783v1.pdf I used the one-step-Q-learning pseudocode, and now we can train the Pong game in less than 20 hours and without any GPU or network distribution.

Language:PythonStargazers:0Issues:0Issues:0

CommNet

Neural network model, suitable for multi-agent learning. https://arxiv.org/abs/1605.07736

Language:LuaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

coursera-introduction-to-recommender-systems

The course assignments for Introduction to Recommender Systems at University of Minnesota.

Language:HTMLStargazers:0Issues:0Issues:0

cs231n

Assignments of Stanford cs231n in spring 2017.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

deep-reinforcement-learning-papers

A list of recent papers regarding deep reinforcement learning

Stargazers:0Issues:0Issues:0

deep-rl

Collection of Deep Reinforcement Learning algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepMind-Atari-Deep-Q-Learner-2Player

Multiagent Cooperation and Competition with Deep Reinforcement Learning

Language:LuaStargazers:0Issues:0Issues:0

ijcai

writeup

Language:TeXStargazers:0Issues:2Issues:0

MARL-Papers

Paper list of multi-agent reinforcement learning (MARL)

Stargazers:0Issues:0Issues:0

milq

Useful code of Manuel Ignacio López Quintero

Language:PHPLicense:MITStargazers:0Issues:2Issues:0

Mixed-Policy-Asynchronous-Deep-Q-Learning

Deep-learning version of WoLF-PHC, GIGA-WoLF, WPL, EMA-QL and PGA-APP

Language:PythonStargazers:0Issues:0Issues:0

Pong-game-kivy

A Pong desktop game for two players.

Language:PythonStargazers:0Issues:0Issues:0

Pytorch-NCE

The Noise Contrastive Estimation for softmax output written in Pytorch

License:MITStargazers:0Issues:0Issues:0

reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

SR-GNN

Source code and datasets for the paper "Session-based Recommendation with Graph Neural Networks" (AAAI-19)

Language:PythonStargazers:0Issues:1Issues:0

sundry-musings

A repository of various daydreams

Language:PythonStargazers:0Issues:0Issues:0