seolhokim

seolhokim

Geek Repo

Company:KC-ML2

Location:Seoul, Korea

Home Page:seolhokim.github.io

Github PK Tool:Github PK Tool


Organizations
kc-ml2

seolhokim's repositories

Mujoco-Pytorch

PPO, DDPG, SAC implementation on mujoco environment

Language:PythonLicense:MITStargazers:81Issues:3Issues:3

InverseRL-Pytorch

Pytorch GAIL VAIL AIRL VAIRL EAIRL SQIL Implementation

Language:PythonLicense:MITStargazers:56Issues:2Issues:0

DistributedRL-Pytorch-Ray

Distributed RL Implementation using Pytorch and Ray (ApeX(Ape-X), A3C, Distributed-PPO(DPPO), Impala)

Language:PythonLicense:MITStargazers:29Issues:3Issues:5

Transportation-Routes-Optimization-by-RL

Application of reinforcement learning to Optimize transportation routes using reinforcement learning

Language:PythonLicense:NOASSERTIONStargazers:17Issues:3Issues:3

Deep-Multi-Agent-Reinforcement-Learning

deep multi agent reinforcement learning tutorial book for intermediate

BipedalWalker-BranchingDQN

The Easiest Pytorch Implementation of Branching-DQN

Language:PythonLicense:Apache-2.0Stargazers:8Issues:2Issues:0

ddpg-mountain-car-continuous

DDPG Algorithm is implemented using Pytorch

Language:Jupyter NotebookStargazers:5Issues:2Issues:0

SimpleDistributedRL

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

Language:PythonLicense:MITStargazers:4Issues:1Issues:0

BYOL

Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning

Language:PythonStargazers:2Issues:1Issues:0

Starcraft-II-Minigame-RL

Starcraft minigame single/multi agents

Language:PythonLicense:MITStargazers:2Issues:2Issues:1

supermariobros-random-network-distillation

Algorithm RND-PPO is implemented in super mario bros.

Language:Jupyter NotebookStargazers:2Issues:2Issues:0
Language:PythonLicense:MITStargazers:1Issues:2Issues:0
Language:HTMLLicense:MITStargazers:1Issues:2Issues:0

apt

Behavior From the Void: Unsupervised Active Pre-Training (pytorch version)

Language:PythonLicense:MITStargazers:0Issues:3Issues:0
Language:PythonLicense:MITStargazers:0Issues:2Issues:0

cictest

CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery

Language:PythonStargazers:0Issues:1Issues:0

convex-optimization-for-all.github.io

모두를 위한 컨백스 최적화

Language:CSSLicense:NOASSERTIONStargazers:0Issues:1Issues:0

dreamerv2

Mastering Atari with Discrete World Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

jekyll-theme-hamilton

A minimal and beautiful Jekyll theme best for writing and note-taking.

Language:SCSSLicense:MITStargazers:0Issues:1Issues:0

minimalRL

Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

multiagent-particle-envs

Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"

License:MITStargazers:0Issues:0Issues:0

pointer-networks-pytorch

Implementation of Pointer Networks using PyTorch

License:MITStargazers:0Issues:0Issues:0

proto

Proto-RL: Reinforcement Learning with Prototypical Representations

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

SimSiam

A pytorch implementation for paper 'Exploring Simple Siamese Representation Learning'

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

vacation_research

2018-08~2018-10 elementary research in ML

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

world-models

Reimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0