Seokin Seo's starred repositories

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

dro

A package of distributionally robust optimization (DRO) methods. Implemented via cvxpy and PyTorch

Language:PythonLicense:NOASSERTIONStargazers:19Issues:0Issues:0

MC-LAVE-RL

ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"

Language:PythonLicense:GPL-2.0Stargazers:30Issues:0Issues:0

Awesome-Realistic-Semi-Supervised-Learning

An awesome paper list of Semi-Supervised Learning under realistic settings.

Language:ShellStargazers:95Issues:0Issues:0
Language:PythonLicense:MITStargazers:12Issues:0Issues:0

stable-baselines3-contrib

Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code

License:MITStargazers:1Issues:0Issues:0

Resolving_copycat_problems_via_residual_prediction

This is an official code for paper Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction https://arxiv.org/abs/2207.09705.

Language:PythonLicense:MITStargazers:6Issues:0Issues:0
Language:PythonStargazers:13Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:18Issues:0Issues:0

DSTC10-SIMMC

Repository (preliminary codes) for DSTC10 SIMMC track.

Language:PythonLicense:MITStargazers:19Issues:0Issues:0
License:Apache-2.0Stargazers:1Issues:0Issues:0
Language:SwiftStargazers:4Issues:0Issues:0

optidice

OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation

Language:PythonStargazers:13Issues:0Issues:0

mbrl-lib

Library for Model Based RL

Language:PythonLicense:MITStargazers:954Issues:0Issues:0

CQL

Code for conservative Q-learning

Language:PythonStargazers:397Issues:0Issues:0

JEM

Project site for "Your Classifier is Secretly an Energy-Based Model and You Should Treat it Like One"

Language:PythonLicense:Apache-2.0Stargazers:415Issues:0Issues:0
Language:PythonLicense:MITStargazers:7Issues:0Issues:0
Language:PythonStargazers:27Issues:0Issues:0
Language:ClojureStargazers:35Issues:0Issues:0

stable-baselines-tf2

Explainable & Easy-to-debug Deep Reinforcement Learning Framework

Language:PythonLicense:MITStargazers:15Issues:0Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:3567Issues:0Issues:0

minirts

We release dataset collected for our research, code that implement neural network models described in the paper, and scripts to reproduce all of our results, and visualization tool for visualize dataset.

Language:C++License:NOASSERTIONStargazers:160Issues:0Issues:0

sac

Soft Actor-Critic

Language:PythonLicense:NOASSERTIONStargazers:973Issues:0Issues:0

lime-experiments

Code for all experiments.

Language:PythonLicense:BSD-2-ClauseStargazers:306Issues:0Issues:0

BGAIL

Bayesian Approach to Generative Adversarial Imitation Learning

Language:PythonStargazers:8Issues:0Issues:0

tf-explain

Interpretability Methods for tf.keras models with Tensorflow 2.x

Language:PythonLicense:MITStargazers:1016Issues:0Issues:0

DNDT

Deep Neural Decision Trees

Language:Jupyter NotebookLicense:UnlicenseStargazers:156Issues:0Issues:0

lime

Lime: Explaining the predictions of any machine learning classifier

Language:JavaScriptLicense:BSD-2-ClauseStargazers:11539Issues:0Issues:0

RepBM

Representation Balancing MDPs for Off-Policy Policy Evaluation

Language:PythonLicense:MITStargazers:8Issues:0Issues:0

softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Language:PythonLicense:NOASSERTIONStargazers:1200Issues:0Issues:0