Zhang Erli's starred repositories

Co-Instruct

④[ECCV 2024, Comparison among Multiple Images!] A study on open-ended multi-image quality comparison: a dataset, a model and a benchmark.

License:MITStargazers:56Issues:0Issues:0

Q-Align

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Language:PythonLicense:MITStargazers:212Issues:0Issues:0

Q-Instruct

②[CVPR 2024] Low-level visual instruction tuning, with a 200K dataset and a model zoo for fine-tuned checkpoints.

Language:PythonLicense:MITStargazers:179Issues:0Issues:0

Q-Bench

①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and visual quality assessment.

Language:Jupyter NotebookStargazers:216Issues:0Issues:0

Chinese-Q-Bench

[WIP@Oct 13] 质衡-基准测试 (Q-Bench in Chinese),包含中文版【底层视觉问答】和【底层视觉描述】数据集,以及中文提示下的图片质量评价。 We will release Q-Bench in more languages in the future.

Stargazers:18Issues:0Issues:0

ExplainableVQA

[ACMMM Oral, 2023] "Towards Explainable In-the-wild Video Quality Assessment: A Database and a Language-Prompted Approach"

Language:PythonLicense:MITStargazers:62Issues:0Issues:0

FAST-VQA-and-FasterVQA

[ECCV2022, TPAMI2023] FAST-VQA, and its extended version FasterVQA.

Language:Jupyter NotebookLicense:MITStargazers:239Issues:0Issues:0

DOVER

[ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives". Official Weights and Demos provided.

Language:Jupyter NotebookLicense:MITStargazers:238Issues:0Issues:0