CrazySssst's starred repositories

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:10432Issues:428Issues:165

sonnet

TensorFlow-based neural network library

Language:PythonLicense:Apache-2.0Stargazers:9736Issues:423Issues:193

tianshou

An elegant PyTorch deep reinforcement learning library.

Language:PythonLicense:MITStargazers:7664Issues:93Issues:727

Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

Language:PythonLicense:MITStargazers:5514Issues:107Issues:70

stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:4105Issues:62Issues:946

football

Check out the new game server:

Language:PythonLicense:Apache-2.0Stargazers:3292Issues:94Issues:317

rlpyt

Reinforcement Learning in PyTorch

Language:PythonLicense:MITStargazers:2211Issues:52Issues:169

pymarl

Python Multi-Agent Reinforcement Learning framework

Language:PythonLicense:Apache-2.0Stargazers:1792Issues:29Issues:130

TradeMaster

TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning :fire: :zap: :rainbow:

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1273Issues:36Issues:66

softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Language:PythonLicense:NOASSERTIONStargazers:1189Issues:37Issues:103

procgen

Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments

Language:C++License:MITStargazers:991Issues:143Issues:75

awesome-game-ai

Awesome Game AI materials of Multi-Agent Reinforcement Learning

License:MITStargazers:733Issues:36Issues:0

deep-learning-uncertainty

Literature survey, paper reviews, experimental setups and a collection of implementations for baselines methods for predictive uncertainty estimation in deep learning models.

Language:Jupyter NotebookStargazers:593Issues:32Issues:2

minihack

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

Language:PythonLicense:Apache-2.0Stargazers:465Issues:14Issues:40

atari-representation-learning

Code for "Unsupervised State Representation Learning in Atari"

Language:PythonLicense:MITStargazers:235Issues:17Issues:47

Awesome-Evolutionary-Reinforcement-Learning

Research Papers and Code Repository on the Integration of Evolutionary Algorithms and Reinforcement Learning

BEAR

Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction

CDS

[NeurIPS 2021] CDS achieves remarkable success in challenging benchmarks SMAC and GRF by balancing sharing and diversity.

Language:PythonLicense:Apache-2.0Stargazers:81Issues:1Issues:11

RE3

RE3: State Entropy Maximization with Random Encoders for Efficient Exploration

Language:Jupyter NotebookStargazers:65Issues:3Issues:1

adversarially-motivated-intrinsic-goals

This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".

Language:PythonLicense:NOASSERTIONStargazers:62Issues:15Issues:4
Language:PythonLicense:MITStargazers:59Issues:4Issues:6

rapid

[ICLR 2021] Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments.

Language:PythonLicense:MITStargazers:54Issues:3Issues:3

adversarially-guided-actor-critic

AGAC: Adversarially Guided Actor-Critic

Language:PythonLicense:MITStargazers:47Issues:4Issues:3

DvD_ES

Code from the paper "Effective Diversity in Population Based Reinforcement Learning", presented as a spotlight at NeurIPS 2020. This is the Evolution Strategies implementation, but of course the method can be used for gradient based RL algorithms (e.g. TD3).

Language:PythonLicense:Apache-2.0Stargazers:44Issues:1Issues:1

EMC

The code for paper, "Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration", NeurIPS 2021.

mepol

Implementation of the MEPOL algorithm - A policy gradient method for task-agnostic exploration

Language:PythonStargazers:12Issues:4Issues:0

Anti-exploration-RL

Anti exploration in offline reinforcement learning

Language:Jupyter NotebookLicense:MITStargazers:8Issues:2Issues:0
Language:PythonLicense:MITStargazers:4Issues:2Issues:0