qiantianwen / NuScenes-QA

[AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

[AAAI 2024] NuScenes-QA

Official repository for the AAAI 2024 paper NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.

DataConstruction

🔥 News

  • 2023.12.09 Our paper is accepted by AAAI 2024!
  • 2023.09.04 Our NuScenes-QA dataset v1.0 released.

⏳ To Do

  • Release question & anwswer data
  • Release visual feature
  • Release training and testing code

🏃 Getting Started

Data Preparation

We have released our question-answer annotations, please download it from HERE.

For the visual data, you can download the origin nuScenes dataset from HERE, and prepare the data refer to this LINK. As an alternative, you can also download our provided object-level features extracted using pre-trained detection models from HERE (to be released soon).

Training & Testing

Todo.

⭐ Others

If you have any questions about the dataset and its generation or the object-level feature extraction, feel free to cantact me with twqian19@fudan.edu.cn.

📖 Citation

If you find our paper and project useful, please consider citing:

@article{qian2023nuscenes,
  title={NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario},
  author={Qian, Tianwen and Chen, Jingjing and Zhuo, Linhai and Jiao, Yang and Jiang, Yu-Gang},
  journal={arXiv preprint arXiv:2305.14836},
  year={2023}
}

About

[AAAI 2024] NuScenes-QA: A Multi-modal Visual Question Answering Benchmark for Autonomous Driving Scenario.

License:MIT License