andyyang13 / openvqa

A lightweight, scalable, and general framework for visual question ansering (VQA) research

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

OpenVQA (In Construction)

OpenVQA is a general platform for visual question ansering (VQA) research, with implementing state-of-the-art approaches (e.g., BUTD, MFH, BAN and MCAN) on different benchmark datasets like VQA-v2, GQA and CLEVR.

News & Updates

License

This project is released under the Apache 2.0 license.

Benchmark and Model Zoo

Supported methods and benchmark datasets are shown in the below table. Results and models are available in MODEL_ZOO.md.

VQA-v2 GQA CLEVR
BUTD
MLB
MUTAN
MFB
MFH
BAN
MCAN

Installation

Please refer to INSTALL.md for installation and dataset preparation.

Get Started

Please see GETTING_STARTED.md for the basic usage of OpenVQA.

Contributing

We appreciate all contributions to improve OpenVQA. Please refer to CONTRIBUTING.md for the contributing guideline.

About

A lightweight, scalable, and general framework for visual question ansering (VQA) research

License:Apache License 2.0


Languages

Language:Python 100.0%