(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
Home Page:https://arxiv.org/abs/2308.09936
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool