Dong-Ki Kim's repositories

meta-mapg

Source code for "A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning" (ICML 2021)

Language:PythonLicense:MITStargazers:31Issues:2Issues:2

further

Source code for "Influencing Long-Term Behavior in Multiagent Reinforcement Learning" (NeurIPS 2022)

Language:PythonLicense:MITStargazers:18Issues:1Issues:0

mape-tutorial

Tutorial for multi-agent particle environment

Language:PythonLicense:MITStargazers:4Issues:3Issues:0

gumbel-rl-gridworld

The use of Gumbel-softmax for a single agent reinforcement learning in a simple gridworld

Language:PythonLicense:MITStargazers:3Issues:0Issues:0

gym-wolfpack

Implementation of wolfpack domain as in Leibo et al., AAMAS-17

Language:PythonLicense:MITStargazers:3Issues:2Issues:0
Language:CSSLicense:NOASSERTIONStargazers:2Issues:1Issues:0

dkkim93.github.io

Dong-Ki Kim's Academic Webpage

Language:JavaScriptLicense:MITStargazers:1Issues:1Issues:0

cavia

Code for "Fast Context Adaptation via Meta-Learning"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

gym

A toolkit for developing and comparing reinforcement learning algorithms.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

gym-craftenv-render

code for rendering the craft environment in "Modular Multitask Reinforcement Learning with Policy Sketches" (Andreas, Klein, Levine. ICML 2017)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

lola

Code release for Learning with Opponent-Learning Awareness and variations.

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0

LOLA_DiCE

Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

mctx

Monte Carlo tree search in JAX

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MER

Fork of the GEM project (https://github.com/facebookresearch/GradientEpisodicMemory) including Meta-Experience Replay (MER) methods from the ICLR 2019 paper (https://openreview.net/pdf?id=B1gTShAct7)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

multiagent-competition

Code for the paper "Emergent Complexity via Multi-agent Competition"

Language:PythonStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:2Issues:0

PettingZoo

Gym for multi-agent reinforcement learning

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

pytorch-maml-rl

Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

safety-starter-agents

Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

spinningup

An educational resource to help anyone learn deep reinforcement learning.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

SuperSuit

Easy-to-use micro-wrappers for Gym and PettingZoo based RL Environments

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

trl

Train transformer language models with reinforcement learning.

License:Apache-2.0Stargazers:0Issues:0Issues:0

ubuntu-misc

Personal ubuntu misc files (e.g., zshrc, vimrc, flake8, terminator)

Language:Vim ScriptStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0