基于多任务拓扑码本的大规模航空图像分类。

Large-Scale Aerial Image Categorization Using a Multitask Topological Codebook.

出版信息

IEEE Trans Cybern. 2016 Feb;46(2):535-45. doi: 10.1109/TCYB.2015.2408592. Epub 2015 Mar 16.

DOI:10.1109/TCYB.2015.2408592

Abstract

Fast and accurately categorizing the millions of aerial images on Google Maps is a useful technique in pattern recognition. Existing methods cannot handle this task successfully due to two reasons: 1) the aerial images' topologies are the key feature to distinguish their categories, but they cannot be effectively encoded by a conventional visual codebook and 2) it is challenging to build a realtime image categorization system, as some geo-aware Apps update over 20 aerial images per second. To solve these problems, we propose an efficient aerial image categorization algorithm. It focuses on learning a discriminative topological codebook of aerial images under a multitask learning framework. The pipeline can be summarized as follows. We first construct a region adjacency graph (RAG) that describes the topology of each aerial image. Naturally, aerial image categorization can be formulated as RAG-to-RAG matching. According to graph theory, RAG-to-RAG matching is conducted by enumeratively comparing all their respective graphlets (i.e., small subgraphs). To alleviate the high time consumption, we propose to learn a codebook containing topologies jointly discriminative to multiple categories. The learned topological codebook guides the extraction of the discriminative graphlets. Finally, these graphlets are integrated into an AdaBoost model for predicting aerial image categories. Experimental results show that our approach is competitive to several existing recognition models. Furthermore, over 24 aerial images are processed per second, demonstrating that our approach is ready for real-world applications.

摘要

在模式识别中，快速准确地对谷歌地图上的数百万张航拍图像进行分类是一项非常有用的技术。现有的方法由于以下两个原因无法成功处理这项任务：1）航拍图像的拓扑结构是区分其类别的关键特征，但它们无法被传统的视觉代码本有效地编码；2）构建实时图像分类系统具有挑战性，因为一些地理感知应用程序每秒更新超过 20 张航拍图像。为了解决这些问题，我们提出了一种高效的航拍图像分类算法。它专注于在多任务学习框架下学习具有判别力的航拍图像拓扑代码本。该流水线可以概括为：首先构建描述每个航拍图像拓扑结构的区域邻接图（RAG）。自然地，航拍图像分类可以被公式化为 RAG 到 RAG 的匹配。根据图论，RAG 到 RAG 的匹配是通过枚举比较所有各自的图元（即小子图）来进行的。为了减轻高时间消耗，我们提出学习一个包含对多个类别具有共同判别力的拓扑代码本。学习到的拓扑代码本指导提取具有判别力的图元。最后，这些图元被集成到 AdaBoost 模型中，用于预测航拍图像类别。实验结果表明，我们的方法与几个现有的识别模型具有竞争力。此外，每秒处理超过 24 张航拍图像，表明我们的方法已经准备好用于实际应用。

相似文献

Large-Scale Aerial Image Categorization Using a Multitask Topological Codebook.基于多任务拓扑码本的大规模航空图像分类。

IEEE Trans Cybern. 2016 Feb;46(2):535-45. doi: 10.1109/TCYB.2015.2408592. Epub 2015 Mar 16.

Discovering discriminative graphlets for aerial image categories recognition.发现有判别力的图元以识别航空图像类别。

IEEE Trans Image Process. 2013 Dec;22(12):5071-84. doi: 10.1109/TIP.2013.2278465. Epub 2013 Aug 14.

Weakly Supervised Multimodal Kernel for Categorizing Aerial Photographs.弱监督多模态核分类航拍图像。

IEEE Trans Image Process. 2017 Aug;26(8):3748-3758. doi: 10.1109/TIP.2016.2639438. Epub 2016 Dec 14.

Semi-Supervised Perception Augmentation for Aerial Photo Topologies Understanding.半监督感知增强在航空影像拓扑理解中的应用。

IEEE Trans Image Process. 2021;30:7803-7814. doi: 10.1109/TIP.2021.3079820. Epub 2021 Sep 14.

Structurally enhanced incremental neural learning for image classification with subgraph extraction.用于图像分类的具有子图提取功能的结构增强增量神经学习

Int J Neural Syst. 2014 Nov;24(7):1450024. doi: 10.1142/S0129065714500245. Epub 2014 Aug 12.

Learning a Probabilistic Topology Discovering Model for Scene Categorization.学习用于场景分类的概率拓扑发现模型。

IEEE Trans Neural Netw Learn Syst. 2015 Aug;26(8):1622-34. doi: 10.1109/TNNLS.2014.2347398. Epub 2014 Sep 4.

Detecting Densely Distributed Graph Patterns for Fine-Grained Image Categorization.检测密集分布的图模式进行细粒度图像分类。

IEEE Trans Image Process. 2016 Feb;25(2):553-65. doi: 10.1109/TIP.2015.2502147. Epub 2015 Nov 19.

Task-dependent visual-codebook compression.任务相关的视觉码本压缩。

IEEE Trans Image Process. 2012 Apr;21(4):2282-93. doi: 10.1109/TIP.2011.2176950. Epub 2011 Nov 22.

Image Categorization by Learning a Propagated Graphlet Path.基于传播图元路径学习的图像分类。

IEEE Trans Neural Netw Learn Syst. 2016 Mar;27(3):674-85. doi: 10.1109/TNNLS.2015.2444417. Epub 2015 Nov 23.

Beyond Explicit Codebook Generation: Visual Representation Using Implicitly Transferred Codebooks.超越显式代码本生成：使用隐式转移代码本进行视觉表示。

IEEE Trans Image Process. 2015 Dec;24(12):5777-88. doi: 10.1109/TIP.2015.2485783. Epub 2015 Oct 1.

基于多任务拓扑码本的大规模航空图像分类。

Large-Scale Aerial Image Categorization Using a Multitask Topological Codebook.

出版信息

IEEE Trans Cybern. 2016 Feb;46(2):535-45. doi: 10.1109/TCYB.2015.2408592. Epub 2015 Mar 16.

DOI:10.1109/TCYB.2015.2408592

PMID:25794407

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

基于多任务拓扑码本的大规模航空图像分类。

Large-Scale Aerial Image Categorization Using a Multitask Topological Codebook.

出版信息

相似文献

基于多任务拓扑码本的大规模航空图像分类。

Large-Scale Aerial Image Categorization Using a Multitask Topological Codebook.

出版信息

相似文献