Youngsoo Jang's repositories
MC-LAVE-RL
ICLR 2021: "Monte-Carlo Planning and Learning with Language Action Value Estimates"
GPT-Critic
GPT-Critic: Offline Reinforcement Learning for End-to-End Task-Oriented Dialogue Systems
alberdice
Office PyTorch implementation of AlberDICE
Language:Python000
gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:PythonNOASSERTION000
Hierarchical-Attention
[TASLP2018] Cross-language Neural Dialog State Tracker for Large Ontologies using Hierarchical Attention
Language:Python000