Haanvid Lee (haanvid)

haanvid

Geek Repo

Company:KAIST

Location:Daejeon, Republic of Korea

Github PK Tool:Github PK Tool

Haanvid Lee's repositories

DSTC10-SIMMC

Repository (preliminary codes) for DSTC10 SIMMC track.

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:PythonStargazers:1Issues:0Issues:0

kmifqe

Kernel Metric learning for In-sample Fitted Q Evaluation (KMIFQE)

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

kmis

local kernel metric learning for IS (KMIS) OPE estimation

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

agents

TF-Agents is a library for Reinforcement Learning in TensorFlow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

alberdice

Office PyTorch implementation of AlberDICE

Language:PythonStargazers:0Issues:0Issues:0

BCQ

PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"

Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

generative-models

Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.

Language:PythonLicense:UnlicenseStargazers:0Issues:0Issues:0

google-research

Google Research

License:Apache-2.0Stargazers:0Issues:0Issues:0

GPT-Critic

GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems

Stargazers:0Issues:0Issues:0

haanvid.github.io

Personal website

Stargazers:0Issues:0Issues:0

LSPI

LSPI(Least-Squares Policy Iteration) with TF1.5

Language:PythonStargazers:0Issues:0Issues:0

MC-LAVE-RL

ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"

License:GPL-2.0Stargazers:0Issues:0Issues:0

models

Models built with TensorFlow

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:MATLABStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

probability

Probabilistic reasoning and statistical analysis in TensorFlow

License:Apache-2.0Stargazers:0Issues:0Issues:0

RepBM

Representation Balancing MDPs for Off-Policy Policy Evaluation

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
License:GPL-3.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

softlearning

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

License:NOASSERTIONStargazers:0Issues:0Issues:0

SVGD

TensorFlow Implementation of Stein Variational Gradient Descent (SVGD)

Language:PythonStargazers:0Issues:0Issues:0

tutorial-git

:blue_book: 어떻게 깃을 사용하는지 빠르게 알아봅시다. (Quick learn How to use Git.)

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

zr-obp

Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation

License:Apache-2.0Stargazers:0Issues:0Issues:0