多视图 Hessian 正则化的图像标注。

Multiview Hessian regularization for image annotation.

机构信息

College of Information and Control Engineering, China University of Petroleum (East China), Qingdao 266580, China.

出版信息

IEEE Trans Image Process. 2013 Jul;22(7):2676-87. doi: 10.1109/TIP.2013.2255302. Epub 2013 Mar 28.

DOI:10.1109/TIP.2013.2255302

Abstract

The rapid development of computer hardware and Internet technology makes large scale data dependent models computationally tractable, and opens a bright avenue for annotating images through innovative machine learning algorithms. Semisupervised learning (SSL) therefore received intensive attention in recent years and was successfully deployed in image annotation. One representative work in SSL is Laplacian regularization (LR), which smoothes the conditional distribution for classification along the manifold encoded in the graph Laplacian, however, it is observed that LR biases the classification function toward a constant function that possibly results in poor generalization. In addition, LR is developed to handle uniformly distributed data (or single-view data), although instances or objects, such as images and videos, are usually represented by multiview features, such as color, shape, and texture. In this paper, we present multiview Hessian regularization (mHR) to address the above two problems in LR-based image annotation. In particular, mHR optimally combines multiple HR, each of which is obtained from a particular view of instances, and steers the classification function that varies linearly along the data manifold. We apply mHR to kernel least squares and support vector machines as two examples for image annotation. Extensive experiments on the PASCAL VOC'07 dataset validate the effectiveness of mHR by comparing it with baseline algorithms, including LR and HR.

摘要

计算机硬件和互联网技术的飞速发展使得大规模数据依赖模型在计算上变得可行，并为通过创新的机器学习算法对图像进行注释开辟了一条光明的道路。因此，半监督学习（SSL）近年来受到了广泛关注，并成功应用于图像注释。SSL 的一个代表性工作是拉普拉斯正则化（LR），它沿着图拉普拉斯编码的流形平滑分类的条件分布，然而，观察到 LR 使分类函数偏向于可能导致较差泛化的常数函数。此外，LR 是为处理均匀分布的数据（或单视图数据）而开发的，尽管实例或对象，如图像和视频，通常由多视图特征表示，如颜色、形状和纹理。在本文中，我们提出了多视图 Hessian 正则化（mHR）来解决基于 LR 的图像注释中的上述两个问题。特别是，mHR 最优地组合了多个 HR，每个 HR 都是从实例的特定视图获得的，并引导沿着数据流形线性变化的分类函数。我们将 mHR 应用于核最小二乘法和支持向量机作为图像注释的两个示例。在 PASCAL VOC'07 数据集上的广泛实验通过与基线算法（包括 LR 和 HR）进行比较，验证了 mHR 的有效性。

相似文献

Multiview Hessian regularization for image annotation.多视图 Hessian 正则化的图像标注。

IEEE Trans Image Process. 2013 Jul;22(7):2676-87. doi: 10.1109/TIP.2013.2255302. Epub 2013 Mar 28.

p -Laplacian Regularization for Scene Recognition.用于场景识别的p -拉普拉斯正则化

IEEE Trans Cybern. 2019 Aug;49(8):2927-2940. doi: 10.1109/TCYB.2018.2833843. Epub 2018 May 22.

Manifold regularized multitask learning for semi-supervised multilabel image classification.多流正则化多任务学习在半监督多标签图像分类中的应用。

IEEE Trans Image Process. 2013 Feb;22(2):523-36. doi: 10.1109/TIP.2012.2218825. Epub 2012 Sep 13.

Laplacian embedded regression for scalable manifold regularization.拉普拉斯嵌入回归的可扩展流形正则化。

IEEE Trans Neural Netw Learn Syst. 2012 Jun;23(6):902-15. doi: 10.1109/TNNLS.2012.2190420.

Multiview vector-valued manifold regularization for multilabel image classification.多视图向量值流形正则化的多标签图像分类。

IEEE Trans Neural Netw Learn Syst. 2013 May;24(5):709-22. doi: 10.1109/TNNLS.2013.2238682.

Deformed graph laplacian for semisupervised learning.用于半监督学习的变形图拉普拉斯。

IEEE Trans Neural Netw Learn Syst. 2015 Oct;26(10):2261-74. doi: 10.1109/TNNLS.2014.2376936. Epub 2015 Jan 15.

Semisupervised Support Vector Machines With Tangent Space Intrinsic Manifold Regularization.基于切空间内在流形正则化的半监督支持向量机。

IEEE Trans Neural Netw Learn Syst. 2016 Sep;27(9):1827-39. doi: 10.1109/TNNLS.2015.2461009. Epub 2015 Aug 10.

Grassmannian regularized structured multi-view embedding for image classification.基于 Grassmannian 正则化的结构多视图嵌入图像分类方法。

IEEE Trans Image Process. 2013 Jul;22(7):2646-60. doi: 10.1109/TIP.2013.2255300. Epub 2013 Mar 28.

Successive overrelaxation for laplacian support vector machine.拉普拉斯支持向量机的逐次超松弛算法。

IEEE Trans Neural Netw Learn Syst. 2015 Apr;26(4):674-683. doi: 10.1109/TNNLS.2014.2320738.

Multiview matrix completion for multilabel image classification.多视图矩阵补全在多标签图像分类中的应用。

IEEE Trans Image Process. 2015 Aug;24(8):2355-68. doi: 10.1109/TIP.2015.2421309. Epub 2015 Apr 9.

引用本文的文献

Multisource working condition recognition via nonlinear kernel learning and -Laplacian manifold learning.基于非线性核学习和-Laplacian流形学习的多源工况识别

Heliyon. 2024 Feb 20;10(5):e26436. doi: 10.1016/j.heliyon.2024.e26436. eCollection 2024 Mar 15.

Screening obstructive sleep apnea patients via deep learning of knowledge distillation in the lateral cephalogram.通过侧颅片中知识蒸馏的深度学习对阻塞性睡眠呼吸暂停患者进行筛查。

Sci Rep. 2023 Oct 18;13(1):17788. doi: 10.1038/s41598-023-42880-x.

HTRPCA: Hypergraph Regularized Tensor Robust Principal Component Analysis for Sample Clustering in Tumor Omics Data.HTRPCA：用于肿瘤组学数据样本聚类的超图正则化张量鲁棒主成分分析

Interdiscip Sci. 2022 Mar;14(1):22-33. doi: 10.1007/s12539-021-00441-8. Epub 2021 Jun 11.

MHSNMF: multi-view hessian regularization based symmetric nonnegative matrix factorization for microbiome data analysis.MHSNMF：基于多视图海森正则化的对称非负矩阵分解用于微生物组数据分析

BMC Bioinformatics. 2020 Nov 18;21(Suppl 6):234. doi: 10.1186/s12859-020-03555-w.

Visual attention mechanism and support vector machine based automatic image annotation.基于视觉注意机制和支持向量机的自动图像标注。

PLoS One. 2018 Nov 6;13(11):e0206971. doi: 10.1371/journal.pone.0206971. eCollection 2018.

A systematic evaluation of the scale invariance of texture recognition methods.纹理识别方法尺度不变性的系统评估。

Pattern Anal Appl. 2015;18(4):945-969. doi: 10.1007/s10044-014-0435-1. Epub 2014 Dec 9.

Hessian-regularized co-training for social activity recognition.用于社交活动识别的黑森正则化协同训练

PLoS One. 2014 Sep 26;9(9):e108474. doi: 10.1371/journal.pone.0108474. eCollection 2014.

Biview learning for human posture segmentation from 3D points cloud.基于3D点云的人体姿态分割的双视角学习

PLoS One. 2014 Jan 20;9(1):e85811. doi: 10.1371/journal.pone.0085811. eCollection 2014.

Dual-force ISOMAP: a new relevance feedback method for medical image retrieval.双力等距映射：一种用于医学图像检索的新相关反馈方法。

PLoS One. 2013 Dec 31;8(12):e84096. doi: 10.1371/journal.pone.0084096. eCollection 2013.

Multiview locally linear embedding for effective medical image retrieval.多视图局部线性嵌入在医学图像检索中的有效应用。

PLoS One. 2013 Dec 13;8(12):e82409. doi: 10.1371/journal.pone.0082409. eCollection 2013.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

多视图 Hessian 正则化的图像标注。

Multiview Hessian regularization for image annotation.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献