On Creating Benchmark Dataset for Aerial Image Interpretation: Reviews, Guidances, and Million-AID

Yang Long, Gui Song Xia, Shengyang Li, Wen Yang, Michael Ying Yang, Xiao Xiang Zhu, Liangpei Zhang, Deren Li

Research output: Contribution to journalArticlepeer-review

146 Citations (SciVal)

Abstract

The past years have witnessed great progress on remote sensing (RS) image interpretation and its wide applications. With RS images becoming more accessible than ever before, there is an increasing demand for the automatic interpretation of these images. In this context, the benchmark datasets serve as an essential prerequisites for developing and testing intelligent interpretation algorithms. After reviewing existing benchmark datasets in the research community of RS image interpretation, this article discusses the problem of how to efficiently prepare a suitable benchmark dataset for RS image interpretation. Specifically, we first analyze the current challenges of developing intelligent algorithms for RS image interpretation with bibliometric investigations. We then present the general guidances on creating benchmark datasets in efficient manners. Following the presented guidances, we also provide an example on building RS image dataset, i.e., Million Aerial Image Dataset (Online. Available: https://captain-whu.github.io/DiRS/0), a new large-scale benchmark dataset containing a million instances for RS image scene classification. Several challenges and perspectives in RS image annotation are finally discussed to facilitate the research in benchmark dataset construction. We do hope this article will provide the RS community an overall perspective on constructing large-scale and practical image datasets for further research, especially data-driven ones.

Original languageEnglish
Article number9393553
Pages (from-to)4205-4230
Number of pages26
JournalIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Volume14
Early online date1 Apr 2021
DOIs
Publication statusPublished - 1 Apr 2021

Funding

Manuscript received January 23, 2021; revised March 12, 2021; accepted March 22, 2021. Date of publication April 1, 2021; date of current version April 26, 2021. This work was supported in part by the National Natural Science Foundation of China (NSFC) under Grant 61922065, Grant 61771350, Grant 41820104006, Grant 61871299, and Grant 92038301, in part by the Science and Technology Major Project of Hubei Province (Next-Generation AI Technologies) under Grant 2019AEA170, and in part by the German Federal Ministry of Education, and Research (BMBF) in the framework of the International Future AI Lab “AI4EO – Artificial Intelligence for Earth Observation: Reasoning, Uncertainties, Ethics, and Beyond.” (Corresponding author: Gui-Song Xia.) Yang Long, Liangpei Zhang, and Deren Li are with the State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing (LIESMARS), Wuhan University, Wuhan 430079, China (e-mail: [email protected]; [email protected]; [email protected]).

FundersFunder number
National Natural Science Foundation of China41820104006, 61771350, 92038301, 61871299, 61922065
Bundesministerium für Bildung und Forschung
Science and Technology Program of Hubei Province2019AEA170

    Keywords

    • Annotation
    • benchmark datasets
    • Million Aerial Image Dataset (Million-AID)
    • remote sensing (RS) image interpretation
    • scene classification

    ASJC Scopus subject areas

    • Computers in Earth Sciences
    • Atmospheric Science

    Fingerprint

    Dive into the research topics of 'On Creating Benchmark Dataset for Aerial Image Interpretation: Reviews, Guidances, and Million-AID'. Together they form a unique fingerprint.

    Cite this