DirtyHarryLYL / LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Add these papers please

Johnx69 opened this issue · comments

  1. Lumos : Empowering Multimodal LLMs with Scene Text Recognition
  2. Dej´a Vu Memorization in Vision-Language Models
  3. Red Teaming Visual Language Models
  4. VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation