SSubhnil

Shubham Subhnil's starred repositories

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonApache-2.029613 325 5438

reinforcement-learning

Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.

Language:Jupyter NotebookMIT20358 860 155

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT15584 648 850

deepmind-research

This repository contains implementations and illustrative code to accompany DeepMind publications

Language:Jupyter NotebookApache-2.013023 325 318

bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

Language:C++NOASSERTION12366 407 1984

FinRL

FinRL: Financial Reinforcement Learning. 🔥

Language:Jupyter NotebookMIT9568 199 711

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonMIT8608 60 1450

ai-deadlines

:alarm_clock: AI conference deadline countdowns

Language:JavaScriptMIT5541 99 92

dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Language:PythonApache-2.03692 130 407

DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Language:PythonApache-2.02910 22 199

HighwayEnv

A minimalist environment for decision-making in autonomous driving

Language:PythonMIT2538 29 455

PettingZoo

An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities

Language:PythonNOASSERTION2517 20 368

zhusuan

A probabilistic programming library for Bayesian deep learning, generative models, based on Tensorflow

Language:PythonMIT2199 143 61

rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Language:PythonMIT2163 42 590

FidelityFX-FSR

FidelityFX Super Resolution

Language:CMIT2041 54 29

dreamerv3

Mastering Diverse Domains through World Models

Language:PythonMIT1237 26 127

robosuite

robosuite: A Modular Simulation Framework and Benchmark for Robot Learning

Language:PythonNOASSERTION1214 24 358

DRL-code-pytorch

Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.

Language:PythonMIT997 5 15

xmanager

A platform for managing machine learning experiments

Language:PythonApache-2.0813 30 32

iris

Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.

Language:PythonGPL-3.0785 23 23

dreamerv3-torch

Implementation of Dreamer v3 in pytorch.

Language:PythonMIT374 10 57

Minari

A standard format for offline reinforcement learning datasets, with popular reference datasets and related utilities

Language:PythonNOASSERTION253 11 51

CausalWorld

CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning

Language:PythonMIT205 20 40

dmc2gym

OpenAI Gym wrapper for the DeepMind Control Suite

Language:PythonMIT196 5 12

continual_rl

Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily extensible to new methods.

Language:PythonMIT99 6 10

DRL

Deconfounding Reinforcement Learning in Observational Settings

Language:Python48 2 3

GRADER

This is the official implementation of NeurIPS 2022 paper "Generalizing Goal-Conditioned Reinforcement Learning with Variational Causal Reasoning"

Language:PythonMIT30 2 1

RIA

TensorFlow implementation of "A Relational Intervention Approach for Unsupervised Dynamics Generalization in Model-Based Reinforcement Learning" (ICLR 2022).

Language:Python15 1 1

gym-windy-gridworlds

Windy GridWorlds environments compatible with OpenAI gym.

Language:PythonMIT13 20

IVOPEwithACME

Language:PythonMIT300