空间上下文感知目标注意网络的多标签图像分类。

Spatial Context-Aware Object-Attentional Network for Multi-Label Image Classification.

出版信息

IEEE Trans Image Process. 2023;32:3000-3012. doi: 10.1109/TIP.2023.3266161. Epub 2023 May 26.

DOI:10.1109/TIP.2023.3266161

Abstract

Multi-label image classification is a fundamental but challenging task in computer vision. To tackle the problem, the label-related semantic information is often exploited, but the background context and spatial semantic information of related objects are not fully utilized. To address these issues, a multi-branch deep neural network is proposed in this paper. The first branch is designed to extract the discriminant information from regions of interest to detect target objects. In the second branch, a spatial context-aware approach is proposed to better capture the contextual information of an object in its surroundings by using an adaptive patch expansion mechanism. It helps the detection of small objects that are easily lost without the support of context information. The third one, the object-attentional branch, exploits the spatial semantic relations between the target object and its related objects, to better detect partially occluded, small or dim objects with the support of those easily detectable objects. To better encode such relations, an attention mechanism jointly considering the spatial and semantic relations between objects is developed. Two widely used benchmark datasets for multi-labeling classification, MS COCO and PASCAL VOC, are used to evaluate the proposed framework. The experimental results demonstrate that the proposed method outperforms the state-of-the-art methods for multi-label image classification.

摘要

多标签图像分类是计算机视觉中的一项基本但具有挑战性的任务。为了解决这个问题，通常会利用与标签相关的语义信息，但相关对象的背景上下文和空间语义信息并未得到充分利用。针对这些问题，本文提出了一种多分支深度神经网络。第一分支旨在从感兴趣区域中提取判别信息，以检测目标对象。第二分支提出了一种空间上下文感知方法，通过自适应补丁扩展机制更好地捕获对象周围的上下文信息。这有助于检测到没有上下文信息支持很容易丢失的小目标。第三分支，即目标注意分支，利用目标对象与其相关对象之间的空间语义关系，在那些容易检测到的对象的支持下更好地检测部分遮挡、小或暗淡的对象。为了更好地编码这些关系，开发了一种同时考虑对象之间空间和语义关系的注意力机制。使用两个广泛用于多标签分类的基准数据集，即 MS COCO 和 PASCAL VOC，来评估所提出的框架。实验结果表明，所提出的方法在多标签图像分类方面优于最新方法。

相似文献

Spatial Context-Aware Object-Attentional Network for Multi-Label Image Classification.空间上下文感知目标注意网络的多标签图像分类。

IEEE Trans Image Process. 2023;32:3000-3012. doi: 10.1109/TIP.2023.3266161. Epub 2023 May 26.

Multi-Label Hashing for Dependency Relations Among Multiple Objectives.多目标间依赖关系的多标签哈希

IEEE Trans Image Process. 2023;32:1759-1773. doi: 10.1109/TIP.2023.3251028. Epub 2023 Mar 14.

MBAN: multi-branch attention network for small object detection.MBAN：用于小目标检测的多分支注意力网络。

PeerJ Comput Sci. 2024 Mar 29;10:e1965. doi: 10.7717/peerj-cs.1965. eCollection 2024.

ADR-Net: Context extraction network based on M-Net for medical image segmentation.ADR-Net：基于M-Net的医学图像分割上下文提取网络。

Med Phys. 2020 Sep;47(9):4254-4264. doi: 10.1002/mp.14364. Epub 2020 Aug 2.

Video Captioning with Object-Aware Spatio-Temporal Correlation and Aggregation.具有目标感知时空相关性与聚合的视频字幕

IEEE Trans Image Process. 2020 Apr 27. doi: 10.1109/TIP.2020.2988435.

Weakly Supervised Object Detection Using Proposal- and Semantic-Level Relationships.利用提议级和语义级关系的弱监督目标检测

IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):3349-3363. doi: 10.1109/TPAMI.2020.3046647. Epub 2022 May 5.

Attentional feature pyramid network for small object detection.注意特征金字塔网络用于小目标检测。

Neural Netw. 2022 Nov;155:439-450. doi: 10.1016/j.neunet.2022.08.029. Epub 2022 Sep 5.

A Novel Upsampling and Context Convolution for Image Semantic Segmentation.一种用于图像语义分割的新型上采样与上下文卷积

Sensors (Basel). 2021 Mar 20;21(6):2170. doi: 10.3390/s21062170.

Learning From Pixel-Level Label Noise: A New Perspective for Semi-Supervised Semantic Segmentation.从像素级标签噪声中学习：半监督语义分割的新视角

IEEE Trans Image Process. 2022;31:623-635. doi: 10.1109/TIP.2021.3134142. Epub 2021 Dec 22.

Coarse-to-Fine Semantic Segmentation From Image-Level Labels.从图像级标签进行粗到细的语义分割。

IEEE Trans Image Process. 2020;29:225-236. doi: 10.1109/TIP.2019.2926748. Epub 2019 Jul 12.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

空间上下文感知目标注意网络的多标签图像分类。

Spatial Context-Aware Object-Attentional Network for Multi-Label Image Classification.

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献