YuchenLiu98 / COMM

Pytorch code for paper From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

How much data in the first pretrain stage?

shipengai opened this issue · comments

about 100M?