Shubham Subhnil (SSubhnil)

SSubhnil

Geek Repo

Company:Trinity College Dublin

Location:Dublin, Ireland

Github PK Tool:Github PK Tool

Shubham Subhnil's repositories

RacingCARLA

Learning Model Predictive Control (LMPC) for autonomous racing in CARLA 3D environment.

Language:PythonLicense:MITStargazers:22Issues:2Issues:0

BAC-DAC-gym

Bayesian Actor-Critic with Neural Networks. Developing an OpenAI Gym toolkit for Bayesian AC reinforcement learning.

Language:PythonLicense:GPL-3.0Stargazers:7Issues:2Issues:0

CoGen_Benchmarking

Benchmarking existing RL algorithms including model-free and model-based approaches on confounded versions of popular environments. Tests generalization and sample efficiency.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:2Issues:0

Vehicle-Dynamics-Toolkit

Some advanced tools for race car design - Steady state and transient dynamics, Tyre Data synthesis

Language:MATLABLicense:Apache-2.0Stargazers:1Issues:2Issues:0

Causal-Gridworld

Testing the causal implications of the wind in the gridworld environment. The wind is the confounder.

Language:PythonStargazers:0Issues:0Issues:0

CausalBench

Official data and code for our paper Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

CausalCuriosity-test

Causal Curiosity fork for testing in confounded environments.

Language:PythonStargazers:0Issues:0Issues:0

CDL-bench

Benchmarking CDL in confounded MDP and POMDP settings

Language:PythonStargazers:0Issues:0Issues:0

D4PG-bench

Benchmarking D4PG in confounded environements.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dreamerv3-benchmod

Modifying DreamerV3 for benchmarking in confounded environments

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Lane-Lines-Detection-Python-OpenCV

Lane Lines Detection using Python and OpenCV for self-driving car

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

mamba-test

Meta-RL Model-Based Algorithm - Confounding tests

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

dreamer-new

Updated version of DreamerV3 cloned from danijar/dreamerv3

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dv3-torch

Benchmarking DreamerV3 with Plan2Explore.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

FCD-bench

Fine-Grained Causal Dynamics Learning with Quantization for Improving Robustness in Reinforcement Learning (ICML 2024)

License:MITStargazers:0Issues:0Issues:0

GRADER-bench

Repository for benchmarking GRADER in confounded environments for zero and few-shot generalization.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

mocoda-b

Testing MoCoDA in DM Control Suite and confounded environments.

Stargazers:0Issues:0Issues:0

mpo-bench

Baseline tests on MPO with unobserved confounders

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

MWM-bench

Benchmarking MWM in confounded environments

License:NOASSERTIONStargazers:0Issues:0Issues:0

P2P-bench

Code accompanying paper "Plan To Predict: Learning an Uncertainty-Foreseeing Model for Model-Based Reinforcement Learning".

Stargazers:0Issues:0Issues:0

RIA-bench

Benchmarking RIA in confounded environments for zero and few-shot generalization. Now compatible with TF2.

Language:PythonStargazers:0Issues:0Issues:0

RIA_base

RIA base version. With new Walker environment similar to DM Control Suite physics and reward function.

Language:PythonStargazers:0Issues:0Issues:0

rl2-bench

Implementation of 'RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning'

Stargazers:0Issues:0Issues:0

sac-bench

PyTorch implementation of Soft Actor-Critic (SAC) for Unobserved Confounders

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

slac-bench

Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model

License:MITStargazers:0Issues:0Issues:0

TMCL-b

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)

Stargazers:0Issues:0Issues:0