SSubhnil

Shubham Subhnil's repositories

RacingCARLA

Learning Model Predictive Control (LMPC) for autonomous racing in CARLA 3D environment.

Language:PythonMIT22 20

BAC-DAC-gym

Bayesian Actor-Critic with Neural Networks. Developing an OpenAI Gym toolkit for Bayesian AC reinforcement learning.

Language:PythonGPL-3.07 20

CoGen_Benchmarking

Benchmarking existing RL algorithms including model-free and model-based approaches on confounded versions of popular environments. Tests generalization and sample efficiency.

Language:PythonApache-2.0100

RacingLMPC

Language:PythonMIT1 20

Vehicle-Dynamics-Toolkit

Some advanced tools for race car design - Steady state and transient dynamics, Tyre Data synthesis

Language:MATLABApache-2.01 20

Causal-Gridworld

Testing the causal implications of the wind in the gridworld environment. The wind is the confounder.

Language:Python000

CausalBench

Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning

Language:PythonMIT000

CausalCuriosity-test

Causal Curiosity fork for testing in confounded environments.

Language:Python000

CDL-bench

Benchmarking CDL in confounded MDP and POMDP settings

Language:Python000

D4PG-bench

Benchmarking D4PG in confounded environements.

Language:PythonMIT000

dreamerv3-benchmod

Modifying DreamerV3 for benchmarking in confounded environments

Language:PythonMIT000

Lane-Lines-Detection-Python-OpenCV

Lane Lines Detection using Python and OpenCV for self-driving car

Language:Jupyter Notebook010

mamba-test

Meta-RL Model-Based Algorithm - Confounding tests

Language:PythonNOASSERTION000

dreamer-new

Updated version of DreamerV3 cloned from danijar/dreamerv3

Language:PythonMIT000

dv3-torch

Benchmarking DreamerV3 with Plan2Explore.

Language:PythonMIT000

FCD-bench

Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning (ICML 2024)

MIT000

GRADER-bench

Repository for benchmarking GRADER in confounded environments for zero and few-shot generalization.

Language:PythonMIT000

mocoda-b

Testing MoCoDA in DM Control Suite and confounded environments.

000

mpo-bench

Baseline tests on MPO with unobserved confounders

Language:PythonGPL-3.0000

MWM-bench

Benchmarking MWM in confounded environments

NOASSERTION000

P2P-bench

Code accompanying paper "Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning".

000

RIA-bench

Benchmarking RIA in confounded environments for zero and few-shot generalization. Now compatible with TF2.

Language:Python000

RIA_base

RIA base version. With new Walker environment similar to DM Control Suite physics and reward function.

Language:Python000

rl2-bench

Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'

000

sac-bench

PyTorch implementation of Soft Actor-Critic (SAC) for Unobserved Confounders

Language:Jupyter NotebookMIT000

slac-bench

Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

MIT000

TMCL-b

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)

000