📖Some sample code for multimodal transformer use CLIP, include ITC, ITM, VQA
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool