1National University of Defense Technology 2State Key Lab of CAD&CG, Zhejiang University 3Huazhong University of Science and Technology
We introduce AerialExtreMatch*, a
large-scale, high-fidelity dataset tailored for extreme-view image matching and UAV localization. It
consists of:
(a) Train Pair: a large-scale synthetic dataset containing 1.5
million RGB–depth image pairs rendered from high-quality 3D scenes, simulating diverse UAV and satellite
viewpoints to enable robust training of image matching models.
(b) Evaluation Pair: a 32-level benchmark of ~30,000 pairs graded by
overlap, scale, and pitch to enable fine-grained evaluation of image matching models.
(c) Localization: a real-world UAV localization set using DJI M300
RTK+H20T queries matched against both high-quality UAV-derived orthomosaic/DSM and lower-quality satellite
maps.
* All code and datasets are readily available at GitHub.
Pipeline for collecting RGB-depth pairs and estimating the co-visibility mask.
Evaluation Pair. We construct the Evaluation Pair by categorizing image pairs into discrete difficulty levels. For each image pair, we compute the corresponding metrics and discretize them into the following bins:
Localization.
WIP