AILab-CVC / SEED-Bench

(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Update for mPLUG-Owl

MAGAer13 opened this issue · comments

Thanks for the excellent evaluation work!
The results of mPLUG-Owl seems to be the initial release version. Now we have trained in more image-text pairs with the latest version, which shows promising on MMBench. Would you like to try to evaluate it?

We will inform the results to you as well as releasing the checkpoint of latest version.

Thank you for your attention to our SEED-Bench.

We have released SEED-Bench leaderboard in https://huggingface.co/spaces/AILab-CVC/SEED-Bench_Leaderboard and you can update the results of your models in the leaderboard by following our evaluation instructions.