Jaesik Yoon (jsikyoon)

jsikyoon

Geek Repo

Company:SAP, @ahn-ml

Location:Seoul, Korea

Home Page:jaesikyoon.com

Twitter:@jaesikyoon_

Github PK Tool:Github PK Tool

Jaesik Yoon's repositories

dreamer-torch

Pytorch version of Dreamer, which follows the original TF v2 codes.

Language:PythonLicense:MITStargazers:113Issues:2Issues:14

bmaml_rl

This repository contains implementations of the paper, Bayesian Model-Agnostic Meta-Learning.

Language:PythonLicense:MITStargazers:19Issues:4Issues:4

V-MPO_torch

V-MPO torch version with DMLab30 and GTrXL

Language:PythonLicense:MITStargazers:12Issues:3Issues:3

OCRL

Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to RL tasks.

Language:PythonLicense:MITStargazers:9Issues:2Issues:4
Language:PythonLicense:MITStargazers:8Issues:3Issues:0

ASNP-RMR

This is an official Tensorflow implementation of the ASNP-RMR.

Language:PythonLicense:MITStargazers:7Issues:1Issues:0

rl-starter-files

RL starter files in order to immediatly train, visualize and evaluate an agent without writing any line of code

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

scalable_agent

A TensorFlow implementation of Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures.

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

torch-ac

Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

crafter

Benchmarking the Spectrum of Agent Capabilities

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

dm_control

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

dreamer

Dream to Control: Learning Behaviors by Latent Imagination

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

dreamer-1

Dream to Control: Learning Behaviors by Latent Imagination

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

gym-minigrid

Minimalistic gridworld package for OpenAI Gym

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

iris

Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0

jsikyoon

profile

Stargazers:0Issues:3Issues:0

lab

A customisable 3D platform for agent-based AI research

Language:CLicense:NOASSERTIONStargazers:0Issues:1Issues:0

lightning-bolts

Toolbox of models, callbacks, and datasets for AI/ML researchers.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Miniworld

Simple and easily configurable 3D FPS-game-like environments for reinforcement learning

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pycolab

A highly-customisable gridworld game engine with some batteries included. Make your own gridworld games to test reinforcement learning agents!

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch-ntm

Neural Turing Machines (NTM) - PyTorch Implementation

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

Recall2Imagine

To investigate R2I framework on my side

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

seed_rl

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

snp

Sequential Neural Processes

Language:PythonStargazers:0Issues:2Issues:0

SPACE

Official PyTorch implementation of "SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition"

Language:PythonLicense:GPL-3.0Stargazers:0Issues:2Issues:0

spriteworld

Spriteworld: a flexible, configurable python-based reinforcement learning environment

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:HTMLLicense:NOASSERTIONStargazers:0Issues:1Issues:0