younggyoseo

Younggyo Seo's repositories

Ape-X

PyTorch Implementation of Distributed Prioritized Experience Replay(Ape-X)

Language:Python148 10 3

vae-cf-pytorch

Variational Autoencoders for Collaborative Filtering - Implementation in PyTorch

Language:Python126 4 3

MWM

Masked World Models for Visual Control

Language:PythonNOASSERTION102 3 5

RE3

RE3: State Entropy Maximization with Random Encoders for Efficient Exploration

Language:Jupyter Notebook62 3 1

CaDM

CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning

Language:Python60 7 4

pytorch-nfsp

Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)

Language:Python42 3 2

trajectory_mcl

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)

Language:Python39 3 1

lasertag-v0

Implementation of Deepmind's LaserTag-v0 game in A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning(2017)

Language:Python18 20

pytorch-acer

PyTorch implementation of Sample Efficient Actor-Critic with Experience Replay(ACER)

Language:Python16 2 2

rnn-auxiliary-loss

Learning Longer-term Dependencies in RNNs with Auxiliary Losses - Implementation in PyTorch.

Language:Python13 4 1

knk-c-programming

My Solutions to exercises and programming projects to K.N.King's C programming.

Language:C1 10

TF_DeepNLP

TensorFlow Models for NLP

Language:Python1 20

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT010

dqn-lasertag

Implementation of DQN for LaserTag-v0

Language:Python010

dreamer

Dream to Control: Learning Behaviors by Latent Imagination

Language:PythonMIT010

dreamerv2

Mastering Atari with Discrete World Models

MIT000

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonNOASSERTION020

handful-of-trials

Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

Language:PythonMIT020

InfoGAN-PyTorch

PyTorch Implementation of InfoGAN

Language:Python020

ISLR-python

An Introduction to Statistical Learning (James, Witten, Hastie, Tibshirani, 2013): Python code

Language:Jupyter NotebookMIT010

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).

Language:PythonMIT020

RL-Adventure

Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL

Language:Jupyter Notebook020

RL-Adventure-2

PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay