dhruvsreenivas

Dhruv Sreenivas's repositories

byol-offline

Bootstrap your own latent (BYOL) methods in offline reinforcement learning

Language:Python4 30

acme

A library of reinforcement learning components and agents.

Language:PythonApache-2.0000

amp_extensions

Extension of AMP framework (https://github.com/xbpeng/DeepMimic) to include gym environment. Also, adapted code from MILO (https://github.com/jdchang1/milo) to test on AMP framework.

Language:C++000

amp_milo

Model based offline imitation learning on AMP tasks.

Language:Python020

DeepMimic

Motion imitation with deep reinforcement learning.

Language:C++MIT000

dm_control

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Apache-2.0000

dots-and-boxes

Testing various AI methods for the game Dots & Boxes

Language:Python010

dqn_zoo

DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (DQN) agent.

Language:PythonApache-2.0000

dril

Disagreement-Regularized Imitation Learning

Language:Python000

gym-simplifiedtetris

🟥 Simplified Tetris environments compliant with OpenAI Gym's API

Language:PythonMIT000

homework_fall2020

Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2020)

Language:Jupyter Notebook000

IQ-Learn

(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation

NOASSERTION000

IsaacGymEnvs

Isaac Gym Reinforcement Learning Environments

Language:PythonNOASSERTION000

lightATAC-rlhf

A lightweight reimplementation of Adversarially Trained Actor Critic

Language:PythonMIT000

OfflineRL-Kit

An elegant PyTorch offline reinforcement learning library for researchers.

Language:PythonMIT000

pillbox

Contains implementation of AdVIL, AdRIL, and DAeQuIL algorithms from the ICML '21 Paper Of Moments and Matching.

000

pytorch-a2c-trpo-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonNOASSERTION000

dhruvsreenivas

Dhruv Sreenivas's repositories

byol-offline

cs330_stanford

jax_sandbox

acme

amp_extensions

amp_milo

DeepMimic

dhruvsreenivas.github.io

dm_control

dmcgym

dots-and-boxes

dqn_zoo

dril

gym-simplifiedtetris

homework_fall2020

humanoid-bench-exps

IQ-Learn

IsaacGymEnvs

jaxrl2

lightATAC-rlhf

mbrl_pretrain_finetune

OfflineRL-Kit

pillbox

pytorch-a2c-trpo-ppo-acktr-gail

pytorch_sac

rl-trained-agents

sqil-atari

TD3_BC

tril

v-d4rl