Robin Ranjit Singh Chauhan's repositories
simpletransformers
Transformers for Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
dcapy
Decision curve analysis library for Python
DeepRLInTheWorld
From search engines, to science, to robotics, this reposity is meant to showcase the use of reinforcement learning in the world..
rllib_tutorials
Ray RLlib tutorial material
mimic_sepsis
Sepsis cohort from MIMIC dataset
storytime
Storytime Project for python class at IdeasSpace Grades 5-7
crosslang_embed
Process multilingual phrases using embeddings. Combines translation, phrase embedding, embedding search, and embedding visualization.
FQF-and-Extensions
PyTorch implementation of the state-of-the-art distributional reinforcement learning algorithm Fully Parameterized Quantile Function (FQF) and Extensions: N-step Bootstrapping, PER, Noisy Layer, Dueling Networks, and parallelization.
RL-Causality
References at the Intersection of Causality and Reinforcement Learning
gym-domain
Reinforcement learning gyms for experimenting with domain generalization, domain adaptation, and robustness to domain shift
gym-stochastic
Reinforcement learning gyms for experimenting with stochasticity
alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4
roberts-creek-adventure
Simple text-only adventure game system for educational purposes, made at Roberts Creek Code Club
dnd_battle_system
Simple text-only battle system for educational purposes, made at Roberts Creek Code Club
deep-rl-tf2
🐋 Simple implementations of various popular Deep Reinforcement Learning algorithms using TensorFlow2
dist-rl-tf2
🐳 Implementation of various Distributional Reinforcement Learning Algorithms using TensorFlow2. [C51, QR-DQN, IQN]
SEPT
Single Episode Policy Transfer in Reinforcement Learning
show-notes
Changelog episode show notes in Markdown format 📝
BCQ
PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"
stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
machina
Deep Reinforcement Learning framework
playground
PlayGround: AI Research into Multi-Agent Learning.
obstacle-tower-challenge
Starter Kit for the Unity Obstacle Tower challenge
gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
quantile-regression-dqn-pytorch
Quantile Regression DQN a Minimal Working Example
probabilistic-modelling-notebooks
A collection of Jupyter notebooks on Probabilistic Models.