Younggyo Seo's repositories

Ape-X

PyTorch Implementation of Distributed Prioritized Experience Replay(Ape-X)

vae-cf-pytorch

Variational Autoencoders for Collaborative Filtering - Implementation in PyTorch

MWM

Masked World Models for Visual Control

Language:PythonLicense:NOASSERTIONStargazers:102Issues:3Issues:5

RE3

RE3: State Entropy Maximization with Random Encoders for Efficient Exploration

Language:Jupyter NotebookStargazers:62Issues:3Issues:1

CaDM

CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning

pytorch-nfsp

Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)

trajectory_mcl

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)

lasertag-v0

Implementation of Deepmind's LaserTag-v0 game in A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning(2017)

Language:PythonStargazers:18Issues:2Issues:0

pytorch-acer

PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)

rnn-auxiliary-loss

Learning Longer-term Dependencies in RNNs with Auxiliary Losses - Implementation in PyTorch.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

knk-c-programming

My Solutions to exercises and programming projects to K.N.King's C programming.

Language:CStargazers:1Issues:1Issues:0

TF_DeepNLP

TensorFlow Models for NLP

Language:PythonStargazers:1Issues:2Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

dqn-lasertag

Implementation of DQN for LaserTag-v0

Language:PythonStargazers:0Issues:1Issues:0

dreamer

Dream to Control: Learning Behaviors by Latent Imagination

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

dreamerv2

Mastering Atari with Discrete World Models

License:MITStargazers:0Issues:0Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

handful-of-trials

Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

InfoGAN-PyTorch

PyTorch Implementation of InfoGAN

Language:PythonStargazers:0Issues:2Issues:0

ISLR-python

An Introduction to Statistical Learning (James, Witten, Hastie, Tibshirani, 2013): Python code

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

pytorch-a2c-ppo-acktr

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

RL-Adventure

Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

RL-Adventure-2

PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

RLBench

A large-scale benchmark and learning environment.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:0Issues:0