OpenVQA is a general platform for visual question ansering (VQA) research, with implementing state-of-the-art approaches (e.g., BUTD, MFH, BAN and MCAN) on different benchmark datasets like VQA-v2, GQA and CLEVR.
This project is released under the Apache 2.0 license.
Supported methods and benchmark datasets are shown in the below table. Results and models are available in MODEL_ZOO.md.
VQA-v2 | GQA | CLEVR | |
---|---|---|---|
BUTD | |||
MLB | |||
MUTAN | |||
MFB | |||
MFH | |||
BAN | |||
MCAN | ✓ |
Please refer to INSTALL.md for installation and dataset preparation.
Please see GETTING_STARTED.md for the basic usage of OpenVQA.
We appreciate all contributions to improve OpenVQA. Please refer to CONTRIBUTING.md for the contributing guideline.