• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于部分引导的关系 Transformer 模型在细粒度视觉识别中的应用

Part-Guided Relational Transformers for Fine-Grained Visual Recognition.

出版信息

IEEE Trans Image Process. 2021;30:9470-9481. doi: 10.1109/TIP.2021.3126490. Epub 2021 Nov 18.

DOI:10.1109/TIP.2021.3126490
PMID:34780327
Abstract

Fine-grained visual recognition is to classify objects with visually similar appearances into subcategories, which has made great progress with the development of deep CNNs. However, handling subtle differences between different subcategories still remains a challenge. In this paper, we propose to solve this issue in one unified framework from two aspects, i.e., constructing feature-level interrelationships, and capturing part-level discriminative features. This framework, namely PArt-guided Relational Transformers (PART), is proposed to learn the discriminative part features with an automatic part discovery module, and to explore the intrinsic correlations with a feature transformation module by adapting the Transformer models from the field of natural language processing. The part discovery module efficiently discovers the discriminative regions which are highly-corresponded to the gradient descent procedure. Then the second feature transformation module builds correlations within the global embedding and multiple part embedding, enhancing spatial interactions among semantic pixels. Moreover, our proposed approach does not rely on additional part branches in the inference time and reaches state-of-the-art performance on 3 widely-used fine-grained object recognition benchmarks. Experimental results and explainable visualizations demonstrate the effectiveness of our proposed approach.

摘要

细粒度视觉识别是将具有相似外观的物体分为子类,随着深度卷积神经网络的发展,细粒度视觉识别取得了巨大的进展。然而,处理不同子类之间的细微差异仍然是一个挑战。在本文中,我们从两个方面提出了一种统一的框架来解决这个问题,即构建特征级别的相互关系和捕捉部分级别的鉴别特征。这个名为 PArt-guided Relational Transformers (PART) 的框架,旨在通过自适应自然语言处理领域的 Transformer 模型,利用自动部分发现模块学习具有鉴别力的部分特征,并利用特征转换模块探索内在相关性。部分发现模块通过与梯度下降过程高度对应的方式,有效地发现具有鉴别力的区域。然后,第二个特征转换模块在全局嵌入和多个部分嵌入之间建立相关性,增强语义像素之间的空间相互作用。此外,我们的方法在推理时不依赖于额外的部分分支,在三个广泛使用的细粒度对象识别基准上取得了最先进的性能。实验结果和可解释性可视化表明了我们方法的有效性。

相似文献

1
Part-Guided Relational Transformers for Fine-Grained Visual Recognition.基于部分引导的关系 Transformer 模型在细粒度视觉识别中的应用
IEEE Trans Image Process. 2021;30:9470-9481. doi: 10.1109/TIP.2021.3126490. Epub 2021 Nov 18.
2
SIM-OFE: Structure Information Mining and Object-Aware Feature Enhancement for Fine-Grained Visual Categorization.SIM-OFE:用于细粒度视觉分类的结构信息挖掘与目标感知特征增强
IEEE Trans Image Process. 2024;33:5312-5326. doi: 10.1109/TIP.2024.3459788. Epub 2024 Sep 27.
3
Multi-Granularity Part Sampling Attention for Fine-Grained Visual Classification.用于细粒度视觉分类的多粒度部分采样注意力机制
IEEE Trans Image Process. 2024;33:4529-4542. doi: 10.1109/TIP.2024.3441813. Epub 2024 Aug 23.
4
Feature relocation network for fine-grained image classification.用于细粒度图像分类的特征重定位网络。
Neural Netw. 2023 Apr;161:306-317. doi: 10.1016/j.neunet.2023.01.050. Epub 2023 Feb 4.
5
Fine-Grained 3D Shape Classification With Hierarchical Part-View Attention.基于层次化局部视图注意力的细粒度三维形状分类
IEEE Trans Image Process. 2021;30:1744-1758. doi: 10.1109/TIP.2020.3048623. Epub 2021 Jan 14.
6
Dual-Dependency Attention Transformer for Fine-Grained Visual Classification.用于细粒度视觉分类的双依赖注意力变换器
Sensors (Basel). 2024 Apr 6;24(7):2337. doi: 10.3390/s24072337.
7
Fine-Grained Recognition With Learnable Semantic Data Augmentation.基于可学习语义数据增强的细粒度识别
IEEE Trans Image Process. 2024;33:3130-3144. doi: 10.1109/TIP.2024.3364500. Epub 2024 Apr 30.
8
Discriminative Suprasphere Embedding for Fine-Grained Visual Categorization.用于细粒度视觉分类的判别性超球面嵌入
IEEE Trans Neural Netw Learn Syst. 2024 Apr;35(4):5092-5102. doi: 10.1109/TNNLS.2022.3202534. Epub 2024 Apr 4.
9
Content-Aware Rectified Activation for Zero-Shot Fine-Grained Image Retrieval.用于零样本细粒度图像检索的内容感知整流激活
IEEE Trans Pattern Anal Mach Intell. 2024 Jun;46(6):4366-4380. doi: 10.1109/TPAMI.2024.3355461. Epub 2024 May 7.
10
Object-Part Attention Model for Fine-Grained Image Classification.目标-部件注意力模型用于细粒度图像分类。
IEEE Trans Image Process. 2018 Mar;27(3):1487-1500. doi: 10.1109/TIP.2017.2774041. Epub 2017 Nov 15.