Yui010206 / SeViLA

[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering

Home Page:https://arxiv.org/abs/2305.06988

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How to define answer_id?

junwenchen opened this issue · comments

Hi,

Thanks for releasing the code.
I found that L101 of sevila.py defines the answer_id = [71, 272, 205, 309, 262], which corresponds to A B C D E.
Can you let me know how is the answer_id obtained?

Hi, thanks for your interest. They are the id of letters A-E in T5 model tokenizer vocabulary.