FoundationVision / Groma

[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization

Home Page:https://groma-mllm.github.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

FoundationVision/Groma Issues