LARS12llt's repositories

TDEOC

Original Code for paper: Diversity Enriched Option-Critic

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

adversarial-surprise

Explore and Control with Adversarial Surprise

Language:PythonStargazers:0Issues:0Issues:0

brax

Massively parallel rigidbody physics simulation on accelerator hardware.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CollaQ

A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

distributedRL

A framework for easy prototyping of distributed reinforcement learning algorithms

Language:PythonStargazers:0Issues:0Issues:0

dqn_zoo

DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (DQN) agent.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dreamerv2

Mastering Atari with Discrete World Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DRIML

Code for Deep Reinforcement and InfoMax Learning (Neurips 2020)

Language:PythonStargazers:0Issues:0Issues:0

EfficientZero

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

google-research

Google Research

License:Apache-2.0Stargazers:0Issues:0Issues:0

h-baselines

A repository of high-performing hierarchical reinforcement learning models and algorithms.

License:MITStargazers:0Issues:0Issues:0

hopfield-layers

Hopfield Networks is All You Need

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

jax-rl

Jax (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

License:MITStargazers:0Issues:0Issues:0

level-replay

This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to learn from during training.

License:NOASSERTIONStargazers:0Issues:0Issues:0

mrcl

Code for the NeurIPS19 paper "Meta-Learning Representations for Continual Learning"

Stargazers:0Issues:0Issues:0

muzero

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

procgen-competition

Sample efficiency and generalisation in reinforcement learning using procedural generation.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

License:MITStargazers:0Issues:0Issues:0

pytorch_sac_ae

PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

rad

RAD: Reinforcement Learning with Augmented Data

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

RE3

RE3: State Entropy Maximization with Random Encoders for Efficient Exploration

Stargazers:0Issues:0Issues:0

rlpyt

Reinforcement Learning in PyTorch

License:MITStargazers:0Issues:0Issues:0

seed_rl

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Testing

CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning

Language:PythonStargazers:0Issues:0Issues:0

VE-principle-for-model-based-RL

Repository for ML Reproducibility Challenge 2020 for the Neurips paper, "The Value Equivalence Principle for Model-Based Reinforcement Learning"

License:MITStargazers:0Issues:0Issues:0
License:BSD-3-ClauseStargazers:0Issues:0Issues:0

xagents

Train, tune, and use reinforcement learning agents within minutes directly or through command line.

License:MITStargazers:0Issues:0Issues:0