Robin Ranjit Singh Chauhan's repositories
gym-stochastic
Reinforcement learning gyms for experimenting with stochasticity
probabilistic-modelling-notebooks
A collection of Jupyter notebooks on Probabilistic Models.
roberts-creek-adventure
Simple text-only adventure game system for educational purposes, made at Roberts Creek Code Club
crosslang_embed
Process multilingual phrases using embeddings. Combines translation, phrase embedding, embedding search, and embedding visualization.
gym-domain
Reinforcement learning gyms for experimenting with domain generalization, domain adaptation, and robustness to domain shift
alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4
dcapy
Decision curve analysis library for Python
deep-rl-tf2
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
DeepRLInTheWorld
From search engines, to science, to robotics, this reposity is meant to showcase the use of reinforcement learning in the world..
dist-rl-tf2
🐳 Implementation of various Distributional Reinforcement Learning Algorithms using TensorFlow2. [C51, QR-DQN, IQN]
dnd_battle_system
Simple text-only battle system for educational purposes, made at Roberts Creek Code Club
FQF-and-Extensions
PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF) and Extensions: N-step Bootstrapping, PER, Noisy Layer, Dueling Networks, and parallelization.
mimic_sepsis
Sepsis cohort from MIMIC dataset
obstacle-tower-challenge
Starter Kit for the Unity Obstacle Tower challenge
playground
PlayGround: AI Research into Multi-Agent Learning.
quantile-regression-dqn-pytorch
Quantile Regression DQN a Minimal Working Example
RL-Causality
References at the Intersection of Causality and Reinforcement Learning
rllib_tutorials
Ray RLlib tutorial material
SEPT
Single Episode Policy Transfer in Reinforcement Learning
show-notes
Changelog episode show notes in Markdown format 📝
simpletransformers
Transformers for Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms