大规模航空影像分类的跨分辨率视觉感知增强方法

Massive-Scale Aerial Photo Categorization by Cross-Resolution Visual Perception Enhancement.

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):4017-4030. doi: 10.1109/TNNLS.2021.3055548. Epub 2022 Aug 3.

DOI:10.1109/TNNLS.2021.3055548

Abstract

Categorizing aerial photographs with varied weather/lighting conditions and sophisticated geomorphic factors is a key module in autonomous navigation, environmental evaluation, and so on. Previous image recognizers cannot fulfill this task due to three challenges: 1) localizing visually/semantically salient regions within each aerial photograph in a weakly annotated context due to the unaffordable human resources required for pixel-level annotation; 2) aerial photographs are generally with multiple informative attributes (e.g., clarity and reflectivity), and we have to encode them for better aerial photograph modeling; and 3) designing a cross-domain knowledge transferal module to enhance aerial photograph perception since multiresolution aerial photographs are taken asynchronistically and are mutually complementary. To handle the above problems, we propose to optimize aerial photograph's feature learning by leveraging the low-resolution spatial composition to enhance the deep learning of perceptual features with a high resolution. More specifically, we first extract many BING-based object patches (Cheng et al., 2014) from each aerial photograph. A weakly supervised ranking algorithm selects a few semantically salient ones by seamlessly incorporating multiple aerial photograph attributes. Toward an interpretable aerial photograph recognizer indicative to human visual perception, we construct a gaze shifting path (GSP) by linking the top-ranking object patches and, subsequently, derive the deep GSP feature. Finally, a cross-domain multilabel SVM is formulated to categorize each aerial photograph. It leverages the global feature from low-resolution counterparts to optimize the deep GSP feature from a high-resolution aerial photograph. Comparative results on our compiled million-scale aerial photograph set have demonstrated the competitiveness of our approach. Besides, the eye-tracking experiment has shown that our ranking-based GSPs are over 92% consistent with the real human gaze shifting sequences.

摘要

对具有不同天气/光照条件和复杂地貌因素的航空照片进行分类，是自主导航、环境评估等领域的关键模块。由于在弱标注环境下对每张航空照片进行像素级标注所需的人力成本过高，以前的图像识别器无法完成这项任务。主要存在以下三个挑战：1）在弱标注环境下，由于无法承受的人力成本，每个航空照片中视觉/语义显著区域的本地化；2）航空照片通常具有多个信息属性（例如清晰度和反射率），我们必须对其进行编码以更好地进行航空照片建模；3）设计跨领域知识迁移模块，以增强航空照片的感知能力，因为多分辨率航空照片是异步拍摄的，并且是相互补充的。为了解决上述问题，我们提出通过利用低分辨率空间构成来优化航空照片的特征学习，以增强对高分辨率感知特征的深度学习。更具体地说，我们首先从每张航空照片中提取许多基于 BING 的目标补丁（Cheng 等人，2014 年）。一个弱监督的排序算法通过无缝整合多个航空照片属性，选择少数语义上显著的目标补丁。为了构建一个对人类视觉感知具有指示意义的可解释的航空照片识别器，我们通过链接顶级目标补丁构建一个注视转移路径（GSP），并从中推导出深度 GSP 特征。最后，构建一个跨领域多标签 SVM 来对每张航空照片进行分类。它利用低分辨率对应物的全局特征来优化来自高分辨率航空照片的深度 GSP 特征。在我们编译的百万规模航空照片集上的比较结果表明了我们方法的竞争力。此外，眼动追踪实验表明，我们基于排序的 GSP 与真实的人类注视转移序列的一致性超过 92%。

相似文献

Massive-Scale Aerial Photo Categorization by Cross-Resolution Visual Perception Enhancement.大规模航空影像分类的跨分辨率视觉感知增强方法

IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):4017-4030. doi: 10.1109/TNNLS.2021.3055548. Epub 2022 Aug 3.

Semi-Supervised Perception Augmentation for Aerial Photo Topologies Understanding.半监督感知增强在航空影像拓扑理解中的应用。

IEEE Trans Image Process. 2021;30:7803-7814. doi: 10.1109/TIP.2021.3079820. Epub 2021 Sep 14.

Community-Aware Photo Quality Evaluation by Deeply Encoding Human Perception.基于深度学习的人类感知编码的社区感知图像质量评价

IEEE Trans Cybern. 2022 May;52(5):3136-3146. doi: 10.1109/TCYB.2019.2937319. Epub 2022 May 19.

LR Aerial Photo Categorization by Cross-Resolution Perceptual Knowledge Propagation.基于跨分辨率感知知识传播的航空照片分类

IEEE Trans Neural Netw Learn Syst. 2025 Feb;36(2):3384-3395. doi: 10.1109/TNNLS.2024.3349515. Epub 2025 Feb 6.

Perceptually Aware Image Retargeting for Mobile Devices.移动端有感知的图像重定向。

IEEE Trans Image Process. 2018 May;27(5):2301-2313. doi: 10.1109/TIP.2017.2779272.

Deep Active Learning with Contaminated Tags for Image Aesthetics Assessment.用于图像美学评估的带有污染标签的深度主动学习

IEEE Trans Image Process. 2018 Apr 18. doi: 10.1109/TIP.2018.2828326.

Weakly Supervised Multimodal Kernel for Categorizing Aerial Photographs.弱监督多模态核分类航拍图像。

IEEE Trans Image Process. 2017 Aug;26(8):3748-3758. doi: 10.1109/TIP.2016.2639438. Epub 2016 Dec 14.

Bioinspired Scene Classification by Deep Active Learning With Remote Sensing Applications.基于深度学习的主动学习在遥感场景分类中的应用

IEEE Trans Cybern. 2022 Jul;52(7):5682-5694. doi: 10.1109/TCYB.2020.2981480. Epub 2022 Jul 4.

Scene Categorization by Deeply Learning Gaze Behavior in a Semisupervised Context.在半监督环境下通过深度学习注视行为进行场景分类

IEEE Trans Cybern. 2021 Aug;51(8):4265-4276. doi: 10.1109/TCYB.2019.2913016. Epub 2021 Aug 4.

Scene Categorization Using Deeply Learned Gaze Shifting Kernel.基于深度学习的注视转移核的场景分类。

IEEE Trans Cybern. 2019 Jun;49(6):2156-2167. doi: 10.1109/TCYB.2018.2820731. Epub 2018 May 11.

大规模航空影像分类的跨分辨率视觉感知增强方法

Massive-Scale Aerial Photo Categorization by Cross-Resolution Visual Perception Enhancement.

出版信息

IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):4017-4030. doi: 10.1109/TNNLS.2021.3055548. Epub 2022 Aug 3.

DOI:10.1109/TNNLS.2021.3055548

PMID:33587709

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

大规模航空影像分类的跨分辨率视觉感知增强方法

Massive-Scale Aerial Photo Categorization by Cross-Resolution Visual Perception Enhancement.

出版信息

相似文献

大规模航空影像分类的跨分辨率视觉感知增强方法

Massive-Scale Aerial Photo Categorization by Cross-Resolution Visual Perception Enhancement.

出版信息

相似文献