Breakend

Peter Henderson's repositories

RLSSContinuousControlTutorial

Tutorial on continuous control at Reinforcement Learning Summer School 2017.

Language:Python34 60

MultiStepBootstrappingInRL

Here, we compare Q(\sigma) learning presented by Sutton and Barto in [1] to Tree-Backup, n-step Expected Sarsa, and n-step Sarsa.

Language:Python13 3 2

SarsaVsExpectedSarsa

An a bias-variance tradeoff of Sarsa vs. Expected Sarsa with experiments.

Language:Jupyter Notebook8 30

CMACvTileCode

Language:Python7 30

ValuePolicyIterationVariations

Experiments testing variants of Value and Policy iterations.

Language:Jupyter Notebook5 30

ExperimentsInIRL

Language:Python400

TemporalYolo

Experiments on temporal YOLO

Language:Python4 4 1

DeepMultiObjectTracking

Language:Python200

drqawrapper

Language:Python200

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT1 20

Option-Critic-Turing-Machines

A development toybox and pitch for integrating the option-critic architecture with neural turing machines.

Language:Jupyter Notebook100

TemporalDeepQLearning

Experiments in temporal deep Q learning

Language:PythonMIT100

BayesianExploration

Language:Python000

BrachialNerveSegmentation

Experiments for Kaggle competition (got top 10% with one of the models): https://www.kaggle.com/c/ultrasound-nerve-segmentation

Language:Python020

Conv2Conv

Language:Python000

DialogDatasetStats

Language:Python000

DialogueSystemPresentation

Presentation for COMP-599 on a dialogue response task

Language:JavaScriptMIT000

DrQA

A pytorch implementation of Reading Wikipedia to Answer Open-Domain Questions.

Language:Python000

EndToEndPresentation

Presentation for reading group on Levine et al. End-to-end Learning of Visuomotor Policies

Language:JavaScriptMIT000

gym-navigation-2d

Language:Jupyter Notebook000

hate-speech-and-offensive-language

Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017

Language:Jupyter NotebookMIT000

Imitation_Learning_RL

Imitation Learning in Deep RL

Language:Python000

modular_rl

Implementation of TRPO and related algorithms

Language:PythonMIT000

parallel-trpo

A parallel version of Trust Region Policy Optimization

Language:Python000

pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization

Language:PythonMIT000

rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Language:PythonNOASSERTION000

SpotIt

See the Spotted events happening near you

Language:JavaScript000

UnifiedPolicyGradients

Language:Python000

yolo_tensorflow

Tensorflow implementation of YOLO, including training and test phase.

Language:Python000

Zennectome

A library for analysis of connectomes. Provides CLI's for community detection, unifying several external libraries to make analysis easy.

Language:Python000