Haruka Kiyohara's repositories

wsdm2022-cascade-dr

(WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"

Language:PythonLicense:Apache-2.0Stargazers:13Issues:1Issues:0

kdd2023-aips

(KDD2023) "Off-Policy Evaluation of Ranking Policies under Diverse User Behavior"

Language:PythonLicense:Apache-2.0Stargazers:8Issues:1Issues:0

neurips2023-future-dependent-ope

(NeurIPS2023) "Future-Dependent Value-Based Off-Policy Evaluation in POMDPs"

Language:PythonLicense:Apache-2.0Stargazers:4Issues:1Issues:0

webconf2024-slate-ope-via-abstraction

(WebConf2024) "Off-Policy Evaluation of Slate Bandit Policies via Optimizing Abstraction"

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

awesome-offline-rl

An index of algorithms for offline reinforcement learning (offline-rl)

Stargazers:0Issues:0Issues:0

d3rlpy

An offline deep reinforcement learning library

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

scikit-learn

scikit-learn: machine learning in Python

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0

zr-obp

Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0