学习图卷积网络进行多标签识别及应用。

Learning Graph Convolutional Networks for Multi-Label Recognition and Applications.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):6969-6983. doi: 10.1109/TPAMI.2021.3063496. Epub 2023 May 5.

DOI:10.1109/TPAMI.2021.3063496

Abstract

The task of multi-label image recognition is to predict a set of object labels that present in an image. As objects normally co-occur in an image, it is desirable to model the label dependencies to improve the recognition performance. To capture and explore such important information, we propose graph convolutional networks (GCNs) based models for multi-label image recognition, where directed graphs are constructed over classes and information is propagated between classes to learn inter-dependent class-level representations. Following this idea, we design two particular models that approach multi-label classification from different views. In our first model, the prior knowledge about the class dependencies is integrated into classifier learning. Specifically, we propose Classifier Learning GCN (C-GCN) to map class-level semantic representations (e.g., word embeddings) into classifiers that maintain the inter-class topology. In our second model, we decompose the visual representation of an image into a set of label-aware features and propose prediction learning GCN (P-GCN) to encode such features into inter-dependent image-level prediction scores. Furthermore, we also present an effective correlation matrix construction approach to capture inter-class relationships and consequently guide information propagation among classes. Empirical results on generic multi-label image recognition demonstrate that both of the proposed models can obviously outperform other existing state-of-the-arts. Moreover, the proposed methods also show advantages in some other multi-label classification related applications.

摘要

多标签图像识别的任务是预测图像中存在的一组对象标签。由于对象通常在图像中共同出现，因此希望建模标签依赖性以提高识别性能。为了捕获和探索这种重要信息，我们提出了基于图卷积网络（GCN）的模型用于多标签图像识别，其中在类别上构建有向图，并在类别之间传播信息以学习相互依赖的类别级表示。基于此思想，我们从不同角度设计了两个特殊的模型来进行多标签分类。在我们的第一个模型中，将类别的先验知识集成到分类器学习中。具体来说，我们提出了分类器学习 GCN（C-GCN），将类别级语义表示（例如，词嵌入）映射到保持类间拓扑的分类器中。在我们的第二个模型中，我们将图像的视觉表示分解为一组标签感知特征，并提出预测学习 GCN（P-GCN）将这些特征编码为相互依赖的图像级预测分数。此外，我们还提出了一种有效的相关矩阵构建方法来捕获类间关系，并因此指导类之间的信息传播。在通用多标签图像识别上的实验结果表明，所提出的两种模型都可以明显优于其他现有的最先进的方法。此外，所提出的方法在其他一些多标签分类相关应用中也显示出优势。

相似文献

Learning Graph Convolutional Networks for Multi-Label Recognition and Applications.学习图卷积网络进行多标签识别及应用。

IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):6969-6983. doi: 10.1109/TPAMI.2021.3063496. Epub 2023 May 5.

Multi-label zero-shot learning with graph convolutional networks.基于图卷积网络的多标签零样本学习。

Neural Netw. 2020 Dec;132:333-341. doi: 10.1016/j.neunet.2020.09.010. Epub 2020 Sep 21.

MAMF-GCN: Multi-scale adaptive multi-channel fusion deep graph convolutional network for predicting mental disorder.MAMF-GCN：用于预测精神障碍的多尺度自适应多通道融合深度图卷积网络。

Comput Biol Med. 2022 Sep;148:105823. doi: 10.1016/j.compbiomed.2022.105823. Epub 2022 Jul 6.

Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition.基于知识引导的通用图像识别的多标签少样本学习。

IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1371-1384. doi: 10.1109/TPAMI.2020.3025814. Epub 2022 Feb 3.

Label Co-Occurrence Learning With Graph Convolutional Networks for Multi-Label Chest X-Ray Image Classification.基于图卷积网络的标签共现学习在多标签胸部 X 射线图像分类中的应用。

IEEE J Biomed Health Inform. 2020 Aug;24(8):2292-2302. doi: 10.1109/JBHI.2020.2967084. Epub 2020 Jan 16.

Multi-graph Fusion Graph Convolutional Networks with pseudo-label supervision.具有伪标签监督的多图融合图卷积网络

Neural Netw. 2023 Jan;158:305-317. doi: 10.1016/j.neunet.2022.11.027. Epub 2022 Nov 28.

MVS-GCN: A prior brain structure learning-guided multi-view graph convolution network for autism spectrum disorder diagnosis.MVS-GCN：一种基于先验脑结构学习的多视图图卷积网络自闭症谱系障碍诊断方法。

Comput Biol Med. 2022 Mar;142:105239. doi: 10.1016/j.compbiomed.2022.105239. Epub 2022 Jan 19.

Label-Aware Dual Graph Neural Networks for Multi-Label Fundus Image Classification.用于多标签眼底图像分类的标签感知双图神经网络

IEEE J Biomed Health Inform. 2025 Apr;29(4):2731-2743. doi: 10.1109/JBHI.2024.3457232. Epub 2025 Apr 4.

Unsupervised domain selective graph convolutional network for preoperative prediction of lymph node metastasis in gastric cancer.无监督域选择图卷积网络用于胃癌术前淋巴结转移预测。

Med Image Anal. 2022 Jul;79:102467. doi: 10.1016/j.media.2022.102467. Epub 2022 Apr 28.

Locality preserving dense graph convolutional networks with graph context-aware node representations.具有图上下文感知节点表示的局部保持密集图卷积网络

Neural Netw. 2021 Nov;143:108-120. doi: 10.1016/j.neunet.2021.05.031. Epub 2021 Jun 2.

引用本文的文献

Multi-Label Classification in Anime Illustrations Based on Hierarchical Attribute Relationships.基于层次属性关系的动漫插图多标签分类。

Sensors (Basel). 2023 May 16;23(10):4798. doi: 10.3390/s23104798.

Improving QSAR Modeling for Predictive Toxicology using Publicly Aggregated Semantic Graph Data and Graph Neural Networks.利用公共聚合语义图数据和图神经网络提高预测毒理学中的 QSAR 建模。

Pac Symp Biocomput. 2022;27:187-198.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

学习图卷积网络进行多标签识别及应用。

Learning Graph Convolutional Networks for Multi-Label Recognition and Applications.

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献