ZhangErliCarl

Zhang Erli's starred repositories

Co-Instruct

④[ECCV 2024, Comparison among Multiple Images!] A study on open-ended multi-image quality comparison: a dataset, a model and a benchmark.

MIT5600

Q-Align

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Language:PythonMIT21200

Q-Instruct

②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.

Language:PythonMIT17900

Q-Bench

①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and visual quality assessment.

Language:Jupyter Notebook21600

Chinese-Q-Bench

[WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese)，包含中文版【底层视觉问答】和【底层视觉描述】数据集，以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the future.

1800

ExplainableVQA

[ACMMM Oral, 2023] "Towards Explainable In-the-wild Video Quality Assessment: A Database and a Language-Prompted Approach"

Language:PythonMIT6200

FAST-VQA-and-FasterVQA

[ECCV2022, TPAMI2023] FAST-VQA, and its extended version FasterVQA.

Language:Jupyter NotebookMIT23900

DOVER

[ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives". Official Weights and Demos provided.

Language:Jupyter NotebookMIT23800