• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

弱监督多模态核分类航拍图像。

Weakly Supervised Multimodal Kernel for Categorizing Aerial Photographs.

出版信息

IEEE Trans Image Process. 2017 Aug;26(8):3748-3758. doi: 10.1109/TIP.2016.2639438. Epub 2016 Dec 14.

DOI:10.1109/TIP.2016.2639438
PMID:28113314
Abstract

Accurately distinguishing aerial photographs from different categories is a promising technique in computer vision. It can facilitate a series of applications, such as video surveillance and vehicle navigation. In this paper, a new image kernel is proposed for effectively recognizing aerial photographs. The key is to encode high-level semantic cues into local image patches in a weakly supervised way, and integrate multimodal visual features using a newly developed hashing algorithm. The flowchart can be elaborated as follows. Given an aerial photo, we first extract a number of graphlets to describe its topological structure. For each graphlet, we utilize color and texture to capture its appearance, and a weakly supervised algorithm to capture its semantics. Thereafter, aerial photo categorization can be naturally formulated as graphlet-to-graphlet matching. As the number of graphlets from each aerial photo is huge, to accelerate matching, we present a hashing algorithm to seamlessly fuze the multiple visual features into binary codes. Finally, an image kernel is calculated by fast matching the binary codes corresponding to each graphlet. And a multi-class SVM is learned for aerial photo categorization. We demonstrate the advantage of our proposed model by comparing it with state-of-the-art image descriptors. Moreover, an in-depth study of the descriptiveness of the hash-based graphlet is presented.

摘要

准确区分不同类别的航空照片是计算机视觉中的一项很有前途的技术。它可以促进一系列应用,如视频监控和车辆导航。在本文中,提出了一种新的图像核方法,用于有效地识别航空照片。关键是采用弱监督的方式将高层语义线索编码到局部图像补丁中,并使用新开发的哈希算法集成多模态视觉特征。流程图可以详细说明如下。给定一张航空照片,我们首先提取一些图元来描述其拓扑结构。对于每个图元,我们利用颜色和纹理来捕捉其外观,并利用弱监督算法来捕捉其语义。此后,航空照片分类可以自然地表述为图元到图元的匹配。由于每张航空照片的图元数量巨大,为了加速匹配,我们提出了一种哈希算法,将多个视觉特征无缝融合到二进制代码中。最后,通过快速匹配每个图元对应的二进制代码来计算图像核。并学习多类 SVM 进行航空照片分类。通过与最先进的图像描述符进行比较,我们展示了我们提出的模型的优势。此外,还对基于哈希的图元的描述能力进行了深入研究。

相似文献

1
Weakly Supervised Multimodal Kernel for Categorizing Aerial Photographs.弱监督多模态核分类航拍图像。
IEEE Trans Image Process. 2017 Aug;26(8):3748-3758. doi: 10.1109/TIP.2016.2639438. Epub 2016 Dec 14.
2
Image Categorization by Learning a Propagated Graphlet Path.基于传播图元路径学习的图像分类。
IEEE Trans Neural Netw Learn Syst. 2016 Mar;27(3):674-85. doi: 10.1109/TNNLS.2015.2444417. Epub 2015 Nov 23.
3
Semi-Supervised Perception Augmentation for Aerial Photo Topologies Understanding.半监督感知增强在航空影像拓扑理解中的应用。
IEEE Trans Image Process. 2021;30:7803-7814. doi: 10.1109/TIP.2021.3079820. Epub 2021 Sep 14.
4
Discovering discriminative graphlets for aerial image categories recognition.发现有判别力的图元以识别航空图像类别。
IEEE Trans Image Process. 2013 Dec;22(12):5071-84. doi: 10.1109/TIP.2013.2278465. Epub 2013 Aug 14.
5
Large-Scale Aerial Image Categorization Using a Multitask Topological Codebook.基于多任务拓扑码本的大规模航空图像分类。
IEEE Trans Cybern. 2016 Feb;46(2):535-45. doi: 10.1109/TCYB.2015.2408592. Epub 2015 Mar 16.
6
A Probabilistic Associative Model for Segmenting Weakly-Supervised Images.一种用于分割弱监督图像的概率关联模型。
IEEE Trans Image Process. 2014 Sep;23(9):4150-4159. doi: 10.1109/TIP.2014.2344433. Epub 2014 Jul 30.
7
Hierarchical Recurrent Neural Hashing for Image Retrieval With Hierarchical Convolutional Features.基于层次卷积特征的层次递归神经网络哈希图像检索
IEEE Trans Image Process. 2018;27(1):106-120. doi: 10.1109/TIP.2017.2755766.
8
Massive-Scale Aerial Photo Categorization by Cross-Resolution Visual Perception Enhancement.大规模航空影像分类的跨分辨率视觉感知增强方法
IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):4017-4030. doi: 10.1109/TNNLS.2021.3055548. Epub 2022 Aug 3.
9
Learning a Probabilistic Topology Discovering Model for Scene Categorization.学习用于场景分类的概率拓扑发现模型。
IEEE Trans Neural Netw Learn Syst. 2015 Aug;26(8):1622-34. doi: 10.1109/TNNLS.2014.2347398. Epub 2014 Sep 4.
10
Semantic Neighbor Graph Hashing for Multimodal Retrieval.基于语义邻居图的哈希的多模态检索。
IEEE Trans Image Process. 2018 Mar;27(3):1405-1417. doi: 10.1109/TIP.2017.2776745. Epub 2017 Nov 22.

引用本文的文献

1
Research Trends of Human-Computer Interaction Studies in Construction Hazard Recognition: A Bibliometric Review.建筑危险源识别中的人机交互研究趋势:文献计量学综述。
Sensors (Basel). 2021 Sep 15;21(18):6172. doi: 10.3390/s21186172.