Jack Parker-Holder's repositories

DvD_ES

Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is the Evolution Strategies implementation, but of course the method can be used for gradient based RL algorithms (e.g. TD3).

Language:PythonLicense:Apache-2.0Stargazers:44Issues:1Issues:1

PB2

Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.

Language:PythonLicense:MITStargazers:21Issues:1Issues:0

ASEBO

Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... please get in touch if interested!!

Language:PythonLicense:MITStargazers:16Issues:1Issues:4
Language:PythonStargazers:5Issues:0Issues:0

ES

Simple ES implementation using ray and numpy

Language:PythonLicense:MITStargazers:2Issues:1Issues:0

SAC-PyTorch

🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation

Language:PythonLicense:MITStargazers:2Issues:0Issues:0
Language:JavaScriptStargazers:1Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

Data-Mining_Small-Caps

Using data mining techniques to classify small cap equity returns

Stargazers:1Issues:0Issues:0

hanabi_SAD

Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning

Language:C++License:NOASSERTIONStargazers:1Issues:0Issues:0
Language:CSSStargazers:1Issues:0Issues:0

OffCon3

📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

ReadyPolicyOne

🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)

License:NOASSERTIONStargazers:1Issues:0Issues:0

bsuite

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

License:Apache-2.0Stargazers:0Issues:0Issues:0

continual_rl

Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dataviz-forum

This will serve as the public forum repository to which anyone can post. [Don't put your graded homework here!]

Language:HTMLStargazers:0Issues:0Issues:0

deep-neuroevolution

Deep Neuroevolution

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

dreamerv2

Mastering Atari with Discrete World Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch_notebooks

tutorial notebooks

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

qmss2017

Materials for the 2017 QMSS Python Workshop

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

rl-generalization-paper

A list of papers regarding generalization in (deep) reinforcement learning

Stargazers:0Issues:0Issues:0

Speeding-Up-CNNs-Using-Random-Feature-Maps

Final project for big data and machine learning.

Language:Jupyter NotebookStargazers:0Issues:0Issues:0