jparkerholder

Jack Parker-Holder's repositories

DvD_ES

Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is the Evolution Strategies implementation, but of course the method can be used for gradient based RL algorithms (e.g. TD3).

Language:PythonApache-2.044 1 1

PB2

Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.

Language:PythonMIT21 10

ASEBO

Code to run the ASEBO algorithm from the paper: From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization... please get in touch if interested!!

Language:PythonMIT16 1 4

procgen_autorl

Language:Python500

ES

Simple ES implementation using ray and numpy

Language:PythonMIT2 10

SAC-PyTorch

🧶 Minimal PyTorch Soft Actor Critic (SAC) implementation

Language:PythonMIT200

autorl.github.io

Language:JavaScript100

brain-tokyo-workshop

🧠🗼

Language:PythonApache-2.0100

Data-Mining_Small-Caps

Using data mining techniques to classify small cap equity returns

100

hanabi_SAD

Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning

Language:C++NOASSERTION100

jparkerholder.github.io

Language:CSS100

OffCon3

📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)

Language:PythonMIT100

ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Language:PythonApache-2.0100

ReadyPolicyOne

🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)

NOASSERTION100

bsuite

bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent

Apache-2.0000

continual_rl

Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.

Language:PythonMIT000