Peter Henderson's repositories

RLSSContinuousControlTutorial

Tutorial on continuous control at Reinforcement Learning Summer School 2017.

Language:PythonStargazers:34Issues:6Issues:0

MultiStepBootstrappingInRL

Here, we compare Q(\sigma) learning presented by Sutton and Barto in [1] to Tree-Backup, n-step Expected Sarsa, and n-step Sarsa.

SarsaVsExpectedSarsa

An a bias-variance tradeoff of Sarsa vs. Expected Sarsa with experiments.

Language:Jupyter NotebookStargazers:8Issues:3Issues:0
Language:PythonStargazers:7Issues:3Issues:0

ValuePolicyIterationVariations

Experiments testing variants of Value and Policy iterations.

Language:Jupyter NotebookStargazers:5Issues:3Issues:0
Language:PythonStargazers:4Issues:0Issues:0

TemporalYolo

Experiments on temporal YOLO

Language:PythonStargazers:2Issues:0Issues:0
Language:PythonStargazers:2Issues:0Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

Option-Critic-Turing-Machines

A development toybox and pitch for integrating the option-critic architecture with neural turing machines.

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

TemporalDeepQLearning

Experiments in temporal deep Q learning

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

BrachialNerveSegmentation

Experiments for Kaggle competition (got top 10% with one of the models): https://www.kaggle.com/c/ultrasound-nerve-segmentation

Language:PythonStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

DialogueSystemPresentation

Presentation for COMP-599 on a dialogue response task

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

DrQA

A pytorch implementation of Reading Wikipedia to Answer Open-Domain Questions.

Language:PythonStargazers:0Issues:0Issues:0

EndToEndPresentation

Presentation for reading group on Levine et al. End-to-end Learning of Visuomotor Policies

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

hate-speech-and-offensive-language

Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Imitation_Learning_RL

Imitation Learning in Deep RL

Language:PythonStargazers:0Issues:0Issues:0

modular_rl

Implementation of TRPO and related algorithms

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

parallel-trpo

A parallel version of Trust Region Policy Optimization

Language:PythonStargazers:0Issues:0Issues:0

pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

SpotIt

See the Spotted events happening near you

Language:JavaScriptStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

yolo_tensorflow

Tensorflow implementation of YOLO, including training and test phase.

Language:PythonStargazers:0Issues:0Issues:0

Zennectome

A library for analysis of connectomes. Provides CLI's for community detection, unifying several external libraries to make analysis easy.

Language:PythonStargazers:0Issues:0Issues:0