Haanvid Lee's repositories
DSTC10-SIMMC
Repository (preliminary codes) for DSTC10 SIMMC track.
agents
TF-Agents is a library for Reinforcement Learning in TensorFlow
alberdice
Office PyTorch implementation of AlberDICE
BCQ
PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"
generative-models
Collection of generative models, e.g. GAN, VAE in Pytorch and Tensorflow.
google-research
Google Research
GPT-Critic
GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems
haanvid.github.io
Personal website
LSPI
LSPI(Least-Squares Policy Iteration) with TF1.5
MC-LAVE-RL
ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"
models
Models built with TensorFlow
probability
Probabilistic reasoning and statistical analysis in TensorFlow
RepBM
Representation Balancing MDPs for Off-Policy Policy Evaluation
rllab
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
SVGD
TensorFlow Implementation of Stein Variational Gradient Descent (SVGD)
tutorial-git
:blue_book: 어떻게 깃을 사용하는지 빠르게 알아봅시다. (Quick learn How to use Git.)
zr-obp
Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation