gkswamy98

followers

following

stars

http://gokul.dev

Gokul Swamy's repositories

fast_irl

Contains implementation of the FILTER algorithm for exponentially faster inverse reinforcement learning.

Language:Jupyter Notebook43 4 1

pillbox

Contains implementation of AdVIL, AdRIL, and DAeQuIL algorithms from the ICML '21 Paper Of Moments and Matching.

Language:Jupyter Notebook21 20

dotfiles

some of my configs

Language:Emacs Lisp9 20

causal_il

Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlated Noise.

Language:Jupyter Notebook7 20

sequence_model_il

Contains sequence-model implementations of on and off-policy imitation learning algorithms for problems with unobserved contexts.

Language:Jupyter Notebook5 20

valuedice

Fork of ValueDICE code that supports discrete action spaces, pybullet, and is truly off-policy.

Language:Python4 20

meta-rl-bci

meta learning + maxent deeprl for shared autonomy from eeg signals

Language:Python3 30

replay_est

Contains implementation of the replay estimation algorithm from "Minimax Optimal Online Imitation Learning via Replay Estimation."

Language:Python2 30

adversarial_rl

CS 294-131 Project, reimplementing https://arxiv.org/pdf/1702.02284.pdf.

Language:Python1 40

ViZDoom

Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information. :godmode:

Language:C++1 20

icl

Language:HTML010

294-149-coding-proj

Code for https://www.overleaf.com/project/5bb426f847a81409e6f4fd86

Language:Jupyter Notebook020

294-149_final_proj

CS 294-149 Final Project: Information-Theoretic Selection of Classifiers

Language:Jupyter Notebook030

autosort

Human assisted few-shot object sorting

030

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT020

causil

Language:HTML020

D4RL

A collection of reference environments for offline reinforcement learning

Language:PythonApache-2.0000

DQfD

An implementation of Deep Q-Learning from Demonstrations (DQfD) for playing Atari 2600 video games

Language:PythonMIT000

filter

Language:HTML000

gym-minigrid

Minimalistic gridworld package for OpenAI Gym

Language:PythonBSD-3-Clause020

il_envs

Language:Jupyter Notebook020

jaco_learning

Control, planning, and learning system for human-robot interaction with a JACO2 7DOF robotic arm.

Language:OpenEdge ABL010

mjrl

Reinforcement learning algorithms for MuJoCo tasks

Language:PythonApache-2.0010

mmil

Website for ICML'21 paper.

Language:HTML020

mu4e-dashboard

A dashboard for mu4e (mu for emacs)

Language:Emacs LispGPL-3.0010

replay

Language:HTML010

sequil

Language:HTML010

serializable

Utilities for creating serializable classes

Language:Python020

softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Language:PythonNOASSERTION020

valentinp.github.com

My public page.

Language:CSS010