Gokul Swamy's repositories

fast_irl

Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.

Language:Jupyter NotebookStargazers:43Issues:4Issues:1

pillbox

Contains implementation of AdVIL, AdRIL, and DAeQuIL algorithms from the ICML '21 Paper Of Moments and Matching.

Language:Jupyter NotebookStargazers:21Issues:2Issues:0

dotfiles

some of my configs

Language:Emacs LispStargazers:9Issues:2Issues:0

causal_il

Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlated Noise.

Language:Jupyter NotebookStargazers:7Issues:2Issues:0

sequence_model_il

Contains sequence-model implementations of on and off-policy imitation learning algorithms for problems with unobserved contexts.

Language:Jupyter NotebookStargazers:5Issues:2Issues:0

valuedice

Fork of ValueDICE code that supports discrete action spaces, pybullet, and is truly off-policy.

Language:PythonStargazers:4Issues:2Issues:0

meta-rl-bci

meta learning + maxent deeprl for shared autonomy from eeg signals

Language:PythonStargazers:3Issues:3Issues:0

replay_est

Contains implementation of the replay estimation algorithm from "Minimax Optimal Online Imitation Learning via Replay Estimation."

Language:PythonStargazers:2Issues:3Issues:0

adversarial_rl

CS 294-131 Project, reimplementing https://arxiv.org/pdf/1702.02284.pdf.

Language:PythonStargazers:1Issues:4Issues:0

ViZDoom

Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information. :godmode:

Language:C++Stargazers:1Issues:2Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

294-149-coding-proj

Code for https://www.overleaf.com/project/5bb426f847a81409e6f4fd86

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

294-149_final_proj

CS 294-149 Final Project: Information-Theoretic Selection of Classifiers

Language:Jupyter NotebookStargazers:0Issues:3Issues:0

autosort

Human assisted few-shot object sorting

Stargazers:0Issues:3Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:2Issues:0
Language:HTMLStargazers:0Issues:2Issues:0

D4RL

A collection of reference environments for offline reinforcement learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DQfD

An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

gym-minigrid

Minimalistic gridworld package for OpenAI Gym

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0
Language:Jupyter NotebookStargazers:0Issues:2Issues:0

jaco_learning

Control, planning, and learning system for human-robot interaction with a JACO2 7DOF robotic arm.

Language:OpenEdge ABLStargazers:0Issues:1Issues:0

mjrl

Reinforcement learning algorithms for MuJoCo tasks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

mmil

Website for ICML'21 paper.

Language:HTMLStargazers:0Issues:2Issues:0

mu4e-dashboard

A dashboard for mu4e (mu for emacs)

Language:Emacs LispLicense:GPL-3.0Stargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

serializable

Utilities for creating serializable classes

Language:PythonStargazers:0Issues:2Issues:0

softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

valentinp.github.com

My public page.

Language:CSSStargazers:0Issues:1Issues:0