Question regarding using contextual bandits for Learning-To-Rank

Question

Question regarding using contextual bandits for Learning-To-Rank

NikhilKothari opened this issue 3 years ago · comments

Hey,

I want to use MAB in an LTR problem. Can you help me figure out which algorithm to use and a little about how to use it?
(a beginner in MAB I am)

I'm thinking of using the functionality for ranking top-N arms instead of always picking the single best one.
Any suggestion would help. Thanks in advance.

david-cortes · Answer 1 · Sun Jan 23 2022 23:55:52 GMT+0800 (China Standard Time)

That’s the part where you’re supposed to do a literature review, use your skills, do experiments, and get creative. This library models a specific problem which you can find in the documentation – how to use it for a different problem domain is up to you and that is an open-ended problem.