Shibi He's repositories

Model-Free-Episodic-Control

This is the implementation of paper Model Free Episodic Control

Language:PythonLicense:MITStargazers:37Issues:6Issues:4

Q-Optimality-Tightening

This is my implementation of the Optimality Tightening

Language:PythonLicense:MITStargazers:37Issues:3Issues:3

Stanford-CS231n-assignments

The assignments of CS231n finished by me

Language:Jupyter NotebookStargazers:20Issues:0Issues:0

Poker-Fictitious-Play

Fictitious Self-play & Reinforcement Learning

Language:PythonStargazers:19Issues:3Issues:0
Language:PythonStargazers:5Issues:0Issues:0

DQN_OpenAI_keras

This is the DQN implementation written by myself using OpenAI gym and keras.

Language:PythonStargazers:5Issues:5Issues:0

Machine_learning_Deng_Cai

Deng Cai's ML course

Language:MatlabStargazers:5Issues:0Issues:0
Language:PythonStargazers:2Issues:2Issues:0

deep_q_rl

Theano-based implementation of Deep Q-learning

Language:PythonLicense:BSD-3-ClauseStargazers:1Issues:2Issues:0

reinforcement-learning-an-introduction

Python code for Reinforcement Learning: An Introduction

Language:PythonLicense:Apache-2.0Stargazers:1Issues:2Issues:0
Language:PythonStargazers:0Issues:4Issues:0

async-rl

Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

atfm_bpr

Bayesian Personalized Ranking Model with Attribute-to-Feature Mappings for Cold-Start Recommendation

Language:PythonStargazers:0Issues:2Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Language:Jupyter NotebookStargazers:0Issues:2Issues:0

DeepMind-Atari-Deep-Q-Learner

The original code from the DeepMind article + my tweaks

Language:LuaStargazers:0Issues:2Issues:0

dqn

This is a very basic DQN implementation, which uses OpenAI's gym environment and Keras/Theano neural networks.

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:0Issues:3Issues:0
Stargazers:0Issues:2Issues:0

neural-networks-and-deep-learning

Code samples for my book "Neural Networks and Deep Learning"

Language:PythonStargazers:0Issues:2Issues:0

paper-notes

Some notes of papers I have read

Stargazers:0Issues:2Issues:0
Stargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:2Issues:0

stanford_dl_ex

Programming exercises for the Stanford Unsupervised Feature Learning and Deep Learning Tutorial

Language:MatlabLicense:MITStargazers:0Issues:2Issues:0

tensorflow

Computation using data flow graphs for scalable machine learning

Language:C++License:Apache-2.0Stargazers:0Issues:2Issues:0

webgl-lessons

https://github.com/tparisi/webgl-lessons is now the officially maintained fork for this project

Language:HTMLLicense:MITStargazers:0Issues:2Issues:0
Language:JavaStargazers:0Issues:3Issues:0