Jong-Kook Heo (JongKook-Heo)

JongKook-Heo

Geek Repo

Company:Korea University

Location:Seoul

Github PK Tool:Github PK Tool

Jong-Kook Heo's repositories

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

causal-confusion

A toolkit for developing and comparing reinforcement learning algorithms.

License:NOASSERTIONStargazers:0Issues:0Issues:0

control-pcgrl

Train or evolve controllable and diverse level-generators.

License:MITStargazers:0Issues:0Issues:0

CQL

PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and continuous action spaces.

Stargazers:0Issues:0Issues:0

DeepRL_PyTorch

Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

DPPO

Official implementation of "Direct Preference-based Policy Optimization without Reward Modeling" (NeurIPS 2023)

License:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

epic

Implements the Equivalent-Policy Invariant Comparison (EPIC) distance for reward functions.

License:MITStargazers:0Issues:0Issues:0

gym-pcgrl

A package for "Procedural Content Generation via Reinforcement Learning" OpenAI Gym interface.

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Kernel-based-Learning

Kernel SVM Implementation for Molecule Dataset

Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

mario-gpt

[Neurips 2023] Generating Mario Levels with GPT2. Code for the paper "MarioGPT: Open-Ended Text2Level Generation through Large Language Models" https://arxiv.org/abs/2302.05981

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

ms-level-gen

Start Small: Training Controllable Game Level Generators without Training Data by Learning at Multiple Sizes

License:MITStargazers:0Issues:0Issues:0

multi-sensor-FDII-health-forecasting-for-autonomous-vehicles

An Extension of A2D2 dataset by Audi with augmented sensor faults and different degradation paths

License:MITStargazers:0Issues:0Issues:0

oprl

Official Codebase for TMLR 2023, Benchmarks and Algorithms for Offline Preference-Based Reward Learning

License:MITStargazers:0Issues:0Issues:0

paired

PAIRED in PyTorch 🔥

License:Apache-2.0Stargazers:0Issues:0Issues:0

PALM-E

Implementation of "PaLM-E: An Embodied Multimodal Language Model"

License:Apache-2.0Stargazers:0Issues:0Issues:0

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

robotic-transformer-pytorch

Implementation of RT1 (Robotic Transformer) in Pytorch

License:MITStargazers:0Issues:0Issues:0

rune

Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learning

License:MITStargazers:0Issues:0Issues:0

Semi-SupervisedLearning

SemiSupervisedLearning

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

TorchSSL

A PyTorch-based library for semi-supervised learning (NeurIPS'21)

License:MITStargazers:0Issues:0Issues:0