wweichn

WeiWang's repositories

Tensorflow_2player_pong

A code reimplementation of DeepMind's "Multiagent Cooperation and Competition with Deep Reinforcement Learning" with Tensorflow

Language:Python6 20

Actor-Critic-cart-pole

cart-pole by Advantage Actor-Critic (A2C)

Language:Python100

cs234

Assignments of Stanford cs234 in spring 2017.

Language:Python1 20

Asynchronous-Methods-for-Deep-Reinforcement-Learning

Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain in here: http://arxiv.org/pdf/1602.01783v1.pdf I used the one-step-Q-learning pseudocode, and now we can train the Pong game in less than 20 hours and without any GPU or network distribution.

Language:Python000

CommNet

Neural network model, suitable for multi-agent learning. https://arxiv.org/abs/1605.07736

Language:LuaNOASSERTION000

coursera-introduction-to-recommender-systems

The course assignments for Introduction to Recommender Systems at University of Minnesota.

Language:HTML000

cs231n

Assignments of Stanford cs231n in spring 2017.

Language:Jupyter Notebook000

ddpg-pendulum

ddpg

Language:Python000

deep-reinforcement-learning-papers

A list of recent papers regarding deep reinforcement learning

000

deep-rl

Collection of Deep Reinforcement Learning algorithms

Language:PythonMIT000

DeepMind-Atari-Deep-Q-Learner-2Player

Multiagent Cooperation and Competition with Deep Reinforcement Learning

Language:Lua000

ijcai

writeup

Language:TeX020

MARL-Papers

Paper list of multi-agent reinforcement learning (MARL)

000

milq

Useful code of Manuel Ignacio López Quintero

Language:PHPMIT020

Mixed-Policy-Asynchronous-Deep-Q-Learning

Deep-learning version of WoLF-PHC, GIGA-WoLF, WPL, EMA-QL and PGA-APP

Language:Python000

Pong-game-kivy

A Pong desktop game for two players.

Language:Python000

Pytorch-NCE

The Noise Contrastive Estimation for softmax output written in Pytorch

MIT000

reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Language:Jupyter NotebookMIT000

SR-GNN

Source code and datasets for the paper "Session-based Recommendation with Graph Neural Networks" (AAAI-19)

Language:Python010

sundry-musings

A repository of various daydreams

Language:Python000