[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering
Home Page:https://arxiv.org/abs/2305.06988
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
zhl98 opened this issue a year ago · comments
Hello, I am very interested in your work. I want to know how to fine tune the model on the new dataset? Is it Localizer Self definition plus Answerer Fine tuning?