Beast code in Giters

Seokin Seo's starred repositories

dro

A package of distributionally robust optimization (DRO) methods. Implemented via cvxpy and PyTorch

Language:PythonNOASSERTION1900

MC-LAVE-RL

ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"

Language:PythonGPL-2.03000

Awesome-Realistic-Semi-Supervised-Learning

An awesome paper list of Semi-Supervised Learning under realistic settings.

Language:Shell9500

stable-baselines3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

MIT100

Resolving_copycat_problems_via_residual_prediction

This is an official code for paper Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction https://arxiv.org/abs/2207.09705.

Language:PythonMIT600

DSTC10-SIMMC

Repository (preliminary codes) for DSTC10 SIMMC track.

Language:PythonMIT1900

optidice

OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation

Language:Python1300

mbrl-lib

Library for Model Based RL

Language:PythonMIT95400

CQL

Code for conservative Q-learning

Language:Python39700

JEM

Project site for "Your Classifier is Secretly an Energy-Based Model and You Should Treat it Like One"

Language:PythonApache-2.041500

stable-baselines-tf2

Explainable & Easy-to-debug Deep Reinforcement Learning Framework

Language:PythonMIT1500

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonMIT356700

tzs930

Seokin Seo's starred repositories

DRBC

dro

MC-LAVE-RL

Awesome-Realistic-Semi-Supervised-Learning

palr

stable-baselines3-contrib

Resolving_copycat_problems_via_residual_prediction

MVTCAE

imitation-dice

DSTC10-SIMMC

imitation-dice

AutomaticWatchFaces

optidice

mbrl-lib

CQL

JEM

repb-sde

IIAE

probprog20

stable-baselines-tf2

pytorch-a2c-ppo-acktr-gail

minirts

sac

lime-experiments

BGAIL

tf-explain

DNDT

lime

RepBM

softlearning