aiueola

Haruka Kiyohara's repositories

(WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"

Language:PythonApache-2.013 10

(KDD2023) "Off-Policy Evaluation of Ranking Policies under Diverse User Behavior"

Language:PythonApache-2.08 10

(NeurIPS2023) "Future-Dependent Value-Based Off-Policy Evaluation in POMDPs"

Language:PythonApache-2.04 10

(WebConf2024) "Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction"

Language:PythonApache-2.0100

An index of algorithms for offline reinforcement learning (offline-rl)

000

An offline deep reinforcement learning library

Language:PythonMIT000

Language:PythonMIT000

scikit-learn: machine learning in Python

Language:PythonBSD-3-Clause000

Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation

Language:PythonApache-2.0000