Does model has Chinese OCR ability?

Question

luohao123 opened this issue 2 months ago · comments

Hi, have 2 questions wanna ask:

Does the model has OCR ability, unlike llava, it limited on English OCR ability in vision encoder, does this has?
If the model performance is not good, what's it's limitation? Is in the Tokenzier, or LLM?