Projects per year
Abstract
Visual Question Answering can be a functionally relevant task if purposed as such. In this paper, we aim to investigate and evaluate its efficacy in terms of localization-based question answering. We do this specifically in the context of autonomous driving where this functionality is important. To achieve our aim, we provide a new dataset, Auto-QA. Our new dataset is built over the Argoverse dataset and provides a truly multi-modal setting with seven views per frame and point-cloud LIDAR data being available for answering a localization-based question. We contribute localized attention adaptations of most popular VQA baselines and evaluate them on this task. We also provide joint point-cloud and image-based baselines that perform well on this task. An additional evaluation that we perform is to analyse whether the attention module is accurate or not for the image-based VQA baselines. To summarize, through this work we thoroughly analyze the localization abilities through visual question answering for autonomous driving and provide a new benchmark task for the same. Our best joint baseline model achieves a useful 74.8% accuracy on this task. We release our dataset and source code for our baseline modules in the following webpage: https: //delta-lab-iitk.github.io/AUTO-QA/
Original language | English |
---|---|
Title of host publication | 2022 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW) |
Place of Publication | U. S. A. |
Publisher | IEEE |
Pages | 272-281 |
Number of pages | 10 |
Volume | 2022 |
ISBN (Electronic) | 9781665458245 |
DOIs | |
Publication status | Published - 15 Feb 2022 |
Event | 2022 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2022 - Waikoloa, USA United States Duration: 4 Jan 2022 → 8 Jan 2022 |
Publication series
Name | Proceedings - 2022 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2022 |
---|---|
ISSN (Electronic) | 2690-621X |
Conference
Conference | 2022 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops, WACVW 2022 |
---|---|
Country/Territory | USA United States |
City | Waikoloa |
Period | 4/01/22 → 8/01/22 |
ASJC Scopus subject areas
- Computer Science Applications
- Computer Vision and Pattern Recognition
Fingerprint
Dive into the research topics of 'Auto QA: The Question Is Not only What, but Also Where'. Together they form a unique fingerprint.-
Centre for the Analysis of Motion, Entertainment Research and Applications (CAMERA) - 2.0
Campbell, N. (PI), Cosker, D. (PI), Bilzon, J. (CoI), Campbell, N. (CoI), Cazzola, D. (CoI), Colyer, S. (CoI), Cosker, D. (CoI), Lutteroth, C. (CoI), McGuigan, P. (CoI), O'Neill, E. (CoI), Petrini, K. (CoI), Proulx, M. (CoI) & Yang, Y. (CoI)
Engineering and Physical Sciences Research Council
1/11/20 → 31/10/25
Project: Research council
-
Centre for the Analysis of Motion, Entertainment Research and Applications (CAMERA)
Cosker, D. (PI), Bilzon, J. (CoI), Campbell, N. (CoI), Cazzola, D. (CoI), Colyer, S. (CoI), Fincham Haines, T. (CoI), Hall, P. (CoI), Kim, K. I. (CoI), Lutteroth, C. (CoI), McGuigan, P. (CoI), O'Neill, E. (CoI), Richardt, C. (CoI), Salo, A. (CoI), Seminati, E. (CoI), Tabor, A. (CoI) & Yang, Y. (CoI)
Engineering and Physical Sciences Research Council
1/09/15 → 28/02/21
Project: Research council