Beast code in Giters

PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF) and Extensions: N-step Bootstrapping, PER, Noisy Layer, Dueling Networks, and parallelization.

MIT000

RL-Causality

References at the Intersection of Causality and Reinforcement Learning

000

gym-domain

Reinforcement learning gyms for experimenting with domain generalization, domain adaptation, and robustness to domain shift

Language:PythonMIT000

gym-stochastic

Reinforcement learning gyms for experimenting with stochasticity

Language:Jupyter NotebookMIT300

alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4

Language:PythonMIT000

roberts-creek-adventure

Simple text-only adventure game system for educational purposes, made at Roberts Creek Code Club

Language:Python100

dnd_battle_system

Simple text-only battle system for educational purposes, made at Roberts Creek Code Club

Language:Python000

deep-rl-tf2

🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2

Apache-2.0000

dist-rl-tf2

🐳 Implementation of various Distributional Reinforcement Learning Algorithms using TensorFlow2. [C51, QR-DQN, IQN]

Apache-2.0000

SEPT

Single Episode Policy Transfer in Reinforcement Learning

BSD-3-Clause000

show-notes

Changelog episode show notes in Markdown format 📝

000

alphaxos

Deep Reinforcement Learning with Self-Play

Language:PythonMIT1100

BCQ

PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"

000

stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

MIT000

machina

Deep Reinforcement Learning framework

Language:PythonMIT000

playground

PlayGround: AI Research into Multi-Agent Learning.

Language:PythonApache-2.0000

obstacle-tower-challenge

Starter Kit for the Unity Obstacle Tower challenge

Language:PythonApache-2.0000

gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

Language:PythonMIT000

dopamine

Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.

Language:Jupyter NotebookApache-2.0000

quantile-regression-dqn-pytorch

Quantile Regression DQN a Minimal Working Example

MIT000

probabilistic-modelling-notebooks

A collection of Jupyter notebooks on Probabilistic Models.

Language:Jupyter NotebookGPL-3.0100

pathway

Robin Ranjit Singh Chauhan's repositories

research_howto

simpletransformers

spy_game_newest

dcapy

DeepRLInTheWorld

rllib_tutorials

mimic_sepsis

storytime

crosslang_embed

FQF-and-Extensions

RL-Causality

gym-domain

gym-stochastic

alpha-zero-general

roberts-creek-adventure

dnd_battle_system

deep-rl-tf2

dist-rl-tf2

SEPT

show-notes

alphaxos

BCQ

stable-baselines

machina

playground

obstacle-tower-challenge

gpt-2

dopamine

quantile-regression-dqn-pytorch

probabilistic-modelling-notebooks