dibyaghosh

followers

following

stars

https://dibyaghosh.com

Organizations

data-8

stat134

Dibya Ghosh's repositories

icvf_release

Public code for "Reinforcement Learning from Passive Data via Latent Intentions"

Language:PythonMIT77 4 2

gcsl

Code for "Learning to Reach Goals via Iterated Supervised Learning"

Language:Python75 7 8

jaxrl_m

Skeleton for scalable and flexible Jax RL implementations

Language:PythonMIT56 40

autogit

Python library for creating periodic git backups of your codebase (e.g. right before launching an experiment).

Language:Python5 30

remote_gym

Language:Python4 20

codesave

Easy way to save codebases (for later loading!)

Language:PythonMIT2 20

dibyaghosh.github.io

A staging area for potential websites

Language:HTMLMIT2 10

epistemic_pomdp

Repository for the NeurIPS 2021 paper: "Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability

Language:HTML2 4 1

config_spec

A small library to help create JSON-serializable configs for functions and classes

Language:PythonMIT1 20

jaxamine

Basically the Dopamine RL library, but without any TF references

Language:Python1 30

jaxrl

Language:Jupyter NotebookMIT1 30

offline_procgen

Language:Python1 30

offlinerl_adaptation

Language:HTML1 30

rlutil

Custom version of justinjfu/rlutil

Language:PythonApache-2.01 40

rtx_viz

Language:HTML1 20

bairblog.github.io

Language:JavaScriptMIT000

blog

variationalbay.es

Language:Jupyter NotebookMIT000

coinrun

Language:C++MIT030

d4rl

A benchmark for offline reinforcement learning.

Language:PythonApache-2.0020

doodad

Language:PythonMIT020

flax_model

Language:PythonMIT000

google-research

Google Research

Language:Jupyter NotebookApache-2.0020

level-replay

This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to learn from during training.

Language:PythonNOASSERTION020

octo-anonymous

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Language:PythonMIT010

ppo-ensemble-2

Language:PythonMIT03 1

pretrained_vision

Utilities for converting pretrained checkpoints to jaxrl_m compatible format

Language:Python020

procgen

Procgen Benchmark: Procedurally Generated Game-Like Gym Environments

Language:C++MIT020

testtt

030

testtt22

030

tpu_utils

Language:PythonMIT000