Beast code in Giters

LARS12llt's repositories

TDEOC

Original Code for paper: Diversity Enriched Option-Critic

Language:PythonMIT100

adversarial-surprise

Explore and Control with Adversarial Surprise

Language:Python000

brax

Massively parallel rigidbody physics simulation on accelerator hardware.

Language:Jupyter NotebookApache-2.0000

CollaQ

A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"

Language:PythonNOASSERTION000

distributedRL

A framework for easy prototyping of distributed reinforcement learning algorithms

Language:Python000

dqn_zoo

DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (DQN) agent.

Language:PythonApache-2.0000

dreamerv2

Mastering Atari with Discrete World Models

Language:PythonMIT000

DRIML

Code for Deep Reinforcement and InfoMax Learning (Neurips 2020)

Language:Python000

EfficientZero

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Language:PythonGPL-3.0000

h-baselines

A repository of high-performing hierarchical reinforcement learning models and algorithms.

MIT000

hopfield-layers

Hopfield Networks is All You Need

Language:PythonNOASSERTION000

jax-rl

Jax (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

MIT000

This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to learn from during training.

NOASSERTION000

mrcl

Code for the NeurIPS19 paper "Meta-Learning Representations for Continual Learning"

000

muzero

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

MIT000

muzero-general

MuZero

MIT000

procgen-competition

Sample efficiency and generalisation in reinforcement learning using procedural generation.

Language:PythonApache-2.0000

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

MIT000

LARS12llt

LARS12llt's repositories

TDEOC

adversarial-surprise

brax

CollaQ

dice_rl

distributedRL

dqn_zoo

dreamerv2

DRIML

EfficientZero

google-research

h-baselines

hopfield-layers

ibsgd

jax-rl

level-replay

mrcl

muzero

muzero-general

procgen-competition

pytorch-a2c-ppo-acktr-gail

pytorch_sac_ae

rad

RE3

rlpyt

seed_rl

Testing

VE-principle-for-model-based-RL

warp-drive

xagents