Vision-Language Pre-training for Image Captioning and Question Answering
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
xiaohu306 opened this issue 3 years ago · comments
区域几何信息 应该从数据集哪里读入呢。我看给的特征里只有 feature 和 标签概率
已经解决