There are 0 repository under video-qa topic.
[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
[EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding
Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
https://arxiv.org/abs/1707.00836
Unifying the Video and Question Attentions for Open-Ended Video Question Answering
Video Question Answering via Hierarchical Spatio-Temporal Attention Networks