Yui010206 / SeViLA

[NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering

Home Page:https://arxiv.org/abs/2305.06988

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How to fintune on my own data?

zhl98 opened this issue · comments

commented

Hello, I am very interested in your work.
I want to know how to fine tune the model on the new dataset? Is it Localizer Self definition plus Answerer Fine tuning?
image