Shubham Subhnil (SSubhnil)

SSubhnil

Geek Repo

Company:Trinity College Dublin

Location:Dublin, Ireland

Github PK Tool:Github PK Tool

Shubham Subhnil's starred repositories

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:29613Issues:325Issues:5438

reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Language:Jupyter NotebookLicense:MITStargazers:20358Issues:860Issues:155

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:15584Issues:648Issues:850

deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:13023Issues:325Issues:318

bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

Language:C++License:NOASSERTIONStargazers:12366Issues:407Issues:1984

FinRL

FinRL: Financial Reinforcement Learning. 🔥

Language:Jupyter NotebookLicense:MITStargazers:9568Issues:199Issues:711

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:8608Issues:60Issues:1450

ai-deadlines

:alarm_clock: AI conference deadline countdowns

Language:JavaScriptLicense:MITStargazers:5541Issues:99Issues:92

dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Language:PythonLicense:Apache-2.0Stargazers:3692Issues:130Issues:407

DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Language:PythonLicense:Apache-2.0Stargazers:2910Issues:22Issues:199

HighwayEnv

A minimalist environment for decision-making in autonomous driving

Language:PythonLicense:MITStargazers:2538Issues:29Issues:455

PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Language:PythonLicense:NOASSERTIONStargazers:2517Issues:20Issues:368

zhusuan

A probabilistic programming library for Bayesian deep learning, generative models, based on Tensorflow

Language:PythonLicense:MITStargazers:2199Issues:143Issues:61

rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Language:PythonLicense:MITStargazers:2163Issues:42Issues:590

FidelityFX-FSR

FidelityFX Super Resolution

dreamerv3

Mastering Diverse Domains through World Models

Language:PythonLicense:MITStargazers:1237Issues:26Issues:127

robosuite

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning

Language:PythonLicense:NOASSERTIONStargazers:1214Issues:24Issues:358

DRL-code-pytorch

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

Language:PythonLicense:MITStargazers:997Issues:5Issues:15

xmanager

A platform for managing machine learning experiments

Language:PythonLicense:Apache-2.0Stargazers:813Issues:30Issues:32

iris

Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.

Language:PythonLicense:GPL-3.0Stargazers:785Issues:23Issues:23

dreamerv3-torch

Implementation of Dreamer v3 in pytorch.

Language:PythonLicense:MITStargazers:374Issues:10Issues:57

Minari

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Language:PythonLicense:NOASSERTIONStargazers:253Issues:11Issues:51

CausalWorld

CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning

Language:PythonLicense:MITStargazers:205Issues:20Issues:40

dmc2gym

OpenAI Gym wrapper for the DeepMind Control Suite

Language:PythonLicense:MITStargazers:196Issues:5Issues:12

continual_rl

Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.

Language:PythonLicense:MITStargazers:99Issues:6Issues:10

DRL

Deconfounding Reinforcement Learning in Observational Settings

GRADER

This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Reasoning"

Language:PythonLicense:MITStargazers:30Issues:2Issues:1

RIA

TensorFlow implementation of "A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Learning" (ICLR 2022).

gym-windy-gridworlds

Windy GridWorlds environments compatible with OpenAI gym.

Language:PythonLicense:MITStargazers:13Issues:2Issues:0
Language:PythonLicense:MITStargazers:3Issues:0Issues:0