• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多视图矩阵补全在多标签图像分类中的应用。

Multiview matrix completion for multilabel image classification.

出版信息

IEEE Trans Image Process. 2015 Aug;24(8):2355-68. doi: 10.1109/TIP.2015.2421309. Epub 2015 Apr 9.

DOI:10.1109/TIP.2015.2421309
PMID:25872208
Abstract

There is growing interest in multilabel image classification due to its critical role in web-based image analytics-based applications, such as large-scale image retrieval and browsing. Matrix completion (MC) has recently been introduced as a method for transductive (semisupervised) multilabel classification, and has several distinct advantages, including robustness to missing data and background noise in both feature and label space. However, it is limited by only considering data represented by a single-view feature, which cannot precisely characterize images containing several semantic concepts. To utilize multiple features taken from different views, we have to concatenate the different features as a long vector. However, this concatenation is prone to over-fitting and often leads to very high time complexity in MC-based image classification. Therefore, we propose to weightedly combine the MC outputs of different views, and present the multiview MC (MVMC) framework for transductive multilabel image classification. To learn the view combination weights effectively, we apply a cross-validation strategy on the labeled set. In particular, MVMC splits the labeled set into two parts, and predicts the labels of one part using the known labels of the other part. The predicted labels are then used to learn the view combination coefficients. In the learning process, we adopt the average precision (AP) loss, which is particular suitable for multilabel image classification, since the ranking-based criteria are critical for evaluating a multilabel classification system. A least squares loss formulation is also presented for the sake of efficiency, and the robustness of the algorithm based on the AP loss compared with the other losses is investigated. Experimental evaluation on two real-world data sets (PASCAL VOC' 07 and MIR Flickr) demonstrate the effectiveness of MVMC for transductive (semisupervised) multilabel image classification, and show that MVMC can exploit complementary properties of different features and output-consistent labels for improved multilabel image classification.

摘要

多标签图像分类由于在基于网络的图像分析应用中(如大规模图像检索和浏览)起着至关重要的作用,因此引起了越来越多的关注。矩阵补全 (MC) 最近被引入作为一种转导(半监督)多标签分类方法,它具有几个明显的优势,包括对特征空间和标签空间中的缺失数据和背景噪声的鲁棒性。然而,它仅限于考虑由单个视图特征表示的数据,因此无法精确地描述包含多个语义概念的图像。为了利用来自不同视图的多个特征,我们必须将不同的特征连接成一个长向量。然而,这种连接容易产生过拟合,并且通常会导致基于 MC 的图像分类中的时间复杂度非常高。因此,我们提出加权组合不同视图的 MC 输出,并提出用于转导多标签图像分类的多视图 MC (MVMC) 框架。为了有效地学习视图组合权重,我们在标记集上应用了交叉验证策略。特别是,MVMC 将标记集分为两部分,并使用另一部分的已知标记来预测一部分的标签。然后使用预测的标签来学习视图组合系数。在学习过程中,我们采用平均精度 (AP) 损失,这对于多标签图像分类特别适合,因为基于排序的标准对于评估多标签分类系统至关重要。还提出了一种最小二乘损失公式,以提高效率,并且还研究了基于 AP 损失的算法与其他损失相比的稳健性。在两个真实数据集(PASCAL VOC'07 和 MIR Flickr)上的实验评估表明,MVMC 对于转导(半监督)多标签图像分类是有效的,并且表明 MVMC 可以利用不同特征的互补属性和输出一致的标签来提高多标签图像分类的性能。

相似文献

1
Multiview matrix completion for multilabel image classification.多视图矩阵补全在多标签图像分类中的应用。
IEEE Trans Image Process. 2015 Aug;24(8):2355-68. doi: 10.1109/TIP.2015.2421309. Epub 2015 Apr 9.
2
Multiview vector-valued manifold regularization for multilabel image classification.多视图向量值流形正则化的多标签图像分类。
IEEE Trans Neural Netw Learn Syst. 2013 May;24(5):709-22. doi: 10.1109/TNNLS.2013.2238682.
3
High-order distance-based multiview stochastic learning in image classification.基于高阶距离的图像分类多视图随机学习。
IEEE Trans Cybern. 2014 Dec;44(12):2431-42. doi: 10.1109/TCYB.2014.2307862.
4
Semisupervised multiview distance metric learning for cartoon synthesis.半监督多视图距离度量学习在卡通合成中的应用。
IEEE Trans Image Process. 2012 Nov;21(11):4636-48. doi: 10.1109/TIP.2012.2207395. Epub 2012 Jul 10.
5
Training DCNN by Combining Max-Margin, Max-Correlation Objectives, and Correntropy Loss for Multilabel Image Classification.通过结合最大间隔、最大相关目标和核相关损失训练深度卷积神经网络用于多标签图像分类
IEEE Trans Neural Netw Learn Syst. 2018 Jul;29(7):2896-2908. doi: 10.1109/TNNLS.2017.2705222. Epub 2017 Jun 13.
6
Correlated Logistic Model With Elastic Net Regularization for Multilabel Image Classification.基于弹性网络正则化的相关逻辑模型在多标签图像分类中的应用。
IEEE Trans Image Process. 2016 Aug;25(8):3801-13. doi: 10.1109/TIP.2016.2577382. Epub 2016 Jun 6.
7
Two-dimensional multilabel active learning with an efficient online adaptation model for image classification.用于图像分类的具有高效在线自适应模型的二维多标签主动学习
IEEE Trans Pattern Anal Mach Intell. 2009 Oct;31(10):1880-97. doi: 10.1109/TPAMI.2008.218.
8
Block-Row Sparse Multiview Multilabel Learning for Image Classification.基于块-行稀疏多视图多标签学习的图像分类方法。
IEEE Trans Cybern. 2016 Feb;46(2):450-61. doi: 10.1109/TCYB.2015.2403356. Epub 2015 Feb 27.
9
Manifold regularized multitask learning for semi-supervised multilabel image classification.多流正则化多任务学习在半监督多标签图像分类中的应用。
IEEE Trans Image Process. 2013 Feb;22(2):523-36. doi: 10.1109/TIP.2012.2218825. Epub 2012 Sep 13.
10
Align While Fusion: A Generalized Nonaligned Multiview Multilabel Classification Method.融合时对齐:一种广义的非对齐多视图多标签分类方法。
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):7627-7636. doi: 10.1109/TNNLS.2024.3387577. Epub 2025 Apr 4.

引用本文的文献

1
Deep Matrix Factorization Based on Convolutional Neural Networks for Image Inpainting.基于卷积神经网络的深度矩阵分解用于图像修复
Entropy (Basel). 2022 Oct 20;24(10):1500. doi: 10.3390/e24101500.
2
Label recovery and label correlation co-learning for multi-view multi-label classification with incomplete labels.用于处理带有不完整标签的多视图多标签分类的标签恢复与标签相关性协同学习
Appl Intell (Dordr). 2023;53(8):9444-9462. doi: 10.1007/s10489-022-03945-y. Epub 2022 Aug 9.
3
VIGAN: Missing View Imputation with Generative Adversarial Networks.
VIGAN:使用生成对抗网络进行缺失视图插补
Proc IEEE Int Conf Big Data. 2017;2017:766-775. doi: 10.1109/BigData.2017.8257992. Epub 2018 Jan 15.