vqa-dataset

There are 1 repository under vqa-dataset topic.

vztu / BVQA_Benchmark
A resource list and performance benchmark for blind video quality assessment (BVQA) models on user-generated content (UGC) datasets. [IEEE TIP'2021] "UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content", Zhengzhong Tu, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan C. Bovik
ugc-vqa ugc-datasets performance-benchmark video-quality-assessment image-quality-assessment picture-quality youtube-dataset vqa-dataset bvqa-benchmark bvqa-models
Language:Python 113
abachaa / VQA-Med-2019
Visual Question Answering in the Medical Domain VQA-Med 2019
imageclef medical-imaging nlp radiology vqa vqa-dataset vqa-med
79
Cloud-CV / VQA
CloudCV Visual Question Answering Demo
artificial-intelligence machine-learning vqa vqa-dataset
Language:Lua 66
sutdcv / SUTD-TrafficQA
[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
annotations cvpr cvpr2021 dataset multimodal multimodal-deep-learning paper traffic-events video-qa video-reasoning vqa vqa-dataset
Language:JavaScript 46
findalexli / SciGraphQA
SciGraphQA
datasets llm synthetic-data vision-language vision-transformer vqa vqa-dataset
Language:Jupyter Notebook 34
vzhou842 / easy-VQA
The Easy Visual Question Answering dataset.
dataset easy-vqa visual-question-answering vqa vqa-dataset
Language:Python 32
badripatro / awesome-vqg
Visual Question Generation reading list
vqa vqa-dataset vqg visualization visual-questions-generation questions-and-answers question-answering questions-generation cvpr2018 emnlp2018 emnlp2017 emnlp acl2018 acl naacl coling-2018 visual-question-answering domain-adaptation
27
CAMMA-public / SSG-VQA
SSG-VQA is a Visual Question Answering (VQA) dataset on laparoscopic videos providing diverse, geometrically grounded, unbiased and surgical action-oriented queries generated using scene graphs.
scene-graph surgical-data-science vqa-dataset
Language:Python 26
Letian2003 / C-VQA
Counterfactual Reasoning VQA Dataset
benchmark counterfactual dataset llm reasoning symbolic vqa vqa-dataset
Language:Python 22
abachaa / VQA-Med-2021
VQA-Med 2021
medical-imaging radiology visual-question-answering visual-question-generation vqa vqa-dataset vqa-med
Language:Python 16
google-research-datasets / maverics
MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering (VQA).
data-creation evaluation maverics multimodal vq2a vqa vqa-dataset
14
yanx27 / CLEVR3D
CLEVR3D Dataset: Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation
point-cloud scene-graph scene-understanding vqa-3d vqa-dataset
Language:Python 11
badripatro / MDN-VQG
vqa visual-question-generation visual-question-answering question-answering vqg vqa-dataset awesome-vqg domain-adaptation triplet triplet-loss classification-model emnlp-2018 deep-learning machine-learning question-generation multimodel multimodel-network
10
csebuetnlp / IllusionVQA
This repository contains the data and code of the paper titled "IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models"
optical-illusions visual-language-models vqa vqa-dataset
Language:Jupyter Notebook 6
yousefkotp / Visual-Question-Answering
A Light weight deep learning model with with a web application to answer image-based questions with a non-generative approach for the VizWiz grand challenge 2023 by carefully curating the answer vocabulary and adding linear layer on top of Open AI's CLIP model as image and text encoder
clip clip-model deep-learning image-and-text image-encoding machine-learning text-encoding visual-question-answering visual-question-anwsering vizwiz vizwiz-vqa vqa vqa-dataset open-ai-clip vqa-2023
Language:Jupyter Notebook 6
ghazaleh-mahmoodi / lxmert_compression
B.Sc. Final Project: LXMERT Model Compression for Visual Question Answering.
vqa visual-question-answering pytorch pruning python vqa-dataset deep-learning
Language:Python 5
lisamalani / VLR_term_project
Multi-page document understanding and VQA using OCR-free method
computer-vision deep-learning vision-transformer vqa-dataset ocr-free-recognition
Language:Python 5
VibhuJawa / vqa-2018
This repo implements attention networks for visual question answering
vqa-dataset attention-model pytorch
Language:Python 4
manoja328 / vqatools
API for VQA , visual 7w dataset
visual7w vqa-dataset vqa
Language:Jupyter Notebook 3
zeryabmoussaoui / Real-time-VQA
A real-time Visual Question Answering Framework
vqa vqa-dataset visual question-answering phythia dialog multi-modal
Language:Jupyter Notebook 3
gutbash / lmm-graph-vision
How well do the GPT-4V, Gemini Pro Vision, and Claude 3 Opus models perform zero-shot vision tasks on data structures?
data-structures gpt-4-vision-preview gpt-4v openai visual-question-answering vqa vqa-dataset gemini-pro-vision google-generative-ai claude-3
Language:Python 2
IAmS4n / Visual-Question-Answering
Investigation on VQA dataset. TensorFlow is utilized for the implementation of a solution based on CNN and RNN architectures plus some ideas such as Attention and Positional features.
tensorflow vqa-dataset cnn rnn
Language:Python 2
juletx / egunean-behin-vqa
Egunean Behin Visual Question Answering Dataset
egunean-behin vqa vqa-dataset visual-question-answering visual-question-generation qa question-answering
Language:Jupyter Notebook 2
radonys / CFB-VQA
VQA Challenge - hosted on Hasura using Flask
vqa vqa-dataset deep-neural-networks lstm vgg16 keras-tensorflow keras-models hasura doselect hackathon-project
Language:Python 2
rentainhe / TRAR-Feature-Extraction
Grid features extraction for ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
visual-question-answering iccv2021 extract-features pytorch vqav2 vqa vqa-dataset
Language:Python 2
chandrakanthm / visual-question-generator
natural-questions tensorflow vqa vqa-dataset
Language:Python 1
jiayi-wei / vqa-tf2
san stacked-attention-networks tensorflow2 vqa vqa-dataset tensorflow visual-q
Language:Python 1
nishitmehta1 / Deep-Image-Understanding-Visual-Question-Answering
python deep deeplearning computer-vision cnn-model vqa vqa-dataset machine-learning
Language:Python 1
thatAverageGuy / EarlyFusion-on-EasyVQA
Streamlit app for demonstrating multi-modal(vision+language) modelling in Pytorch.
multimodal-deep-learning transformers vqa-dataset early-fusion pytorch streamlit visual-question-answering
Language:Python 1
zeryabmoussaoui / VQA-dataset-Generator
vqa vqa-dataset indoor-scenes keyword-text
Language:Jupyter Notebook 1
abdur75648 / MedicalGPT
Medical Report Generation And VQA (Adapting XrayGPT to Any Modality)
chatgpt chatgpt4o llama llm llms medical-dataset medical-imaging medical-report-generation medicalgpt minigpt4 multimodal-llm vicuna vqa vqa-dataset xraygpt
Language:Python 0
dinesh-kumar-mr / MediVQA
Part of our final year project work involving complex NLP tasks along with experimentation on various datasets and different LLMs
llms llms-benchmarking medical-application vqa vqa-dataset vqa-med-2018
Language:HTML 0
shivam1423 / VQA
Visual Question Answer (VQA) software! Powered by Flask, this project seamlessly combines images and questions to generate accurate responses. Explore the world of interactive visual understanding with ease.
flask html-css-javascript jupyter-notebook python vqa-dataset
Language:HTML 0
AnshDesai / visual-question-answering
Deep Learning Web app that responds to any question about an image.
deep-learning nlp spacy vqa-dataset vqa vgg16
Language:Python
cserajdeep / Visual-Question-Answering-VQA
Visual Question Answering (VQA)
vqa vqa-dataset computer-vision python tensorflow keras flask
Language:Python

vqa-dataset

vztu / BVQA_Benchmark

abachaa / VQA-Med-2019

Cloud-CV / VQA

sutdcv / SUTD-TrafficQA

findalexli / SciGraphQA

vzhou842 / easy-VQA

badripatro / awesome-vqg

CAMMA-public / SSG-VQA

Letian2003 / C-VQA

abachaa / VQA-Med-2021

google-research-datasets / maverics

yanx27 / CLEVR3D

badripatro / MDN-VQG

csebuetnlp / IllusionVQA

yousefkotp / Visual-Question-Answering

ghazaleh-mahmoodi / lxmert_compression

lisamalani / VLR_term_project

VibhuJawa / vqa-2018

manoja328 / vqatools

zeryabmoussaoui / Real-time-VQA

gutbash / lmm-graph-vision

IAmS4n / Visual-Question-Answering

juletx / egunean-behin-vqa

radonys / CFB-VQA

rentainhe / TRAR-Feature-Extraction

chandrakanthm / visual-question-generator

jiayi-wei / vqa-tf2

nishitmehta1 / Deep-Image-Understanding-Visual-Question-Answering

thatAverageGuy / EarlyFusion-on-EasyVQA

zeryabmoussaoui / VQA-dataset-Generator

abdur75648 / MedicalGPT

dinesh-kumar-mr / MediVQA

shivam1423 / VQA

AnshDesai / visual-question-answering

cserajdeep / Visual-Question-Answering-VQA