AILab-CVC / SEED

Official implementation of SEED-LLaMA (ICLR 2024).

Home Page:https://ailab-cvc.github.io/seed

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Does model has Chinese OCR ability?

luohao123 opened this issue · comments

Hi, have 2 questions wanna ask:

  1. Does the model has OCR ability, unlike llava, it limited on English OCR ability in vision encoder, does this has?
  2. If the model performance is not good, what's it's limitation? Is in the Tokenzier, or LLM?