Dibya Ghosh (dibyaghosh)

dibyaghosh

Geek Repo

Home Page:https://dibyaghosh.com

Twitter:@its_dibya

Github PK Tool:Github PK Tool


Organizations
data-8
stat134

Dibya Ghosh's repositories

icvf_release

Public code for "Reinforcement Learning from Passive Data via Latent Intentions"

Language:PythonLicense:MITStargazers:77Issues:4Issues:2

gcsl

Code for "Learning to Reach Goals via Iterated Supervised Learning"

jaxrl_m

Skeleton for scalable and flexible Jax RL implementations

Language:PythonLicense:MITStargazers:56Issues:4Issues:0

autogit

Python library for creating periodic git backups of your codebase (e.g. right before launching an experiment).

Language:PythonStargazers:5Issues:3Issues:0
Language:PythonStargazers:4Issues:2Issues:0

codesave

Easy way to save codebases (for later loading!)

Language:PythonLicense:MITStargazers:2Issues:2Issues:0

dibyaghosh.github.io

A staging area for potential websites

Language:HTMLLicense:MITStargazers:2Issues:1Issues:0

epistemic_pomdp

Repository for the NeurIPS 2021 paper: "Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability

config_spec

A small library to help create JSON-serializable configs for functions and classes

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

jaxamine

Basically the Dopamine RL library, but without any TF references

Language:PythonStargazers:1Issues:3Issues:0
Language:Jupyter NotebookLicense:MITStargazers:1Issues:3Issues:0

rlutil

Custom version of justinjfu/rlutil

Language:PythonLicense:Apache-2.0Stargazers:1Issues:4Issues:0
Language:HTMLStargazers:1Issues:2Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

blog

variationalbay.es

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Language:C++License:MITStargazers:0Issues:3Issues:0

d4rl

A benchmark for offline reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:2Issues:0

level-replay

This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to learn from during training.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

octo-anonymous

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:3Issues:1

pretrained_vision

Utilities for converting pretrained checkpoints to jaxrl_m compatible format

Language:PythonStargazers:0Issues:2Issues:0

procgen

Procgen Benchmark: Procedurally Generated Game-Like Gym Environments

Language:C++License:MITStargazers:0Issues:2Issues:0
Stargazers:0Issues:3Issues:0
Stargazers:0Issues:3Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0