Beast code in Giters

Shibi He's repositories

Model-Free-Episodic-Control

This is the implementation of paper Model Free Episodic Control

Language:PythonMIT37 6 4

Q-Optimality-Tightening

This is my implementation of the Optimality Tightening

Language:PythonMIT37 3 3

Stanford-CS231n-assignments

The assignments of CS231n finished by me

Language:Jupyter Notebook2000

Poker-Fictitious-Play

Fictitious Self-play & Reinforcement Learning

Language:Python19 30

Active_learning_version1

Language:Python5 40

DoubanMovieSearch

Language:Python500

DQN_OpenAI_keras

This is the DQN implementation written by myself using OpenAI gym and keras.

Language:Python5 50

Machine_learning_Deng_Cai

Deng Cai's ML course

Language:Matlab500

Machine-Learning-Classifying-Emails-into-Spams-and-Hams

Naive Bayes and Logistic Regression

Language:Python2 20

deep_q_rl

Theano-based implementation of Deep Q-learning

Language:PythonBSD-3-Clause1 20

reinforcement-learning-an-introduction

Python code for Reinforcement Learning: An Introduction

Language:PythonApache-2.01 20

dqn-multigpus

Language:Python040

async-rl

Replicating "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)

Language:PythonMIT020

atfm_bpr

Bayesian Personalized Ranking Model with Attribute-to-Feature Mappings for Cold-Start Recommendation

Language:Python020

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT020

cvpr2015

Language:Jupyter Notebook020

DeepMind-Atari-Deep-Q-Learner

The original code from the DeepMind article + my tweaks

Language:Lua020

dqn

This is a very basic DQN implementation, which uses OpenAI's gym environment and Keras/Theano neural networks.

Language:PythonMIT020

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonMIT030

MixMHC

020

neural-networks-and-deep-learning

Code samples for my book "Neural Networks and Deep Learning"

Language:Python020

paper-notes

Some notes of papers I have read

020

Readings

000

ShibiHe.github.io

Language:HTML020

stanford_dl_ex

Programming exercises for the Stanford Unsupervised Feature Learning and Deep Learning Tutorial

Language:MatlabMIT020

tensorflow

Computation using data flow graphs for scalable machine learning

Language:C++Apache-2.0020

webgl-lessons

https://github.com/tparisi/webgl-lessons is now the officially maintained fork for this project

Language:HTMLMIT020

WeDream-android

Language:Java030