College of Information and Control Engineering, China University of Petroleum (East China), Qingdao 266580, China.
IEEE Trans Image Process. 2013 Jul;22(7):2676-87. doi: 10.1109/TIP.2013.2255302. Epub 2013 Mar 28.
The rapid development of computer hardware and Internet technology makes large scale data dependent models computationally tractable, and opens a bright avenue for annotating images through innovative machine learning algorithms. Semisupervised learning (SSL) therefore received intensive attention in recent years and was successfully deployed in image annotation. One representative work in SSL is Laplacian regularization (LR), which smoothes the conditional distribution for classification along the manifold encoded in the graph Laplacian, however, it is observed that LR biases the classification function toward a constant function that possibly results in poor generalization. In addition, LR is developed to handle uniformly distributed data (or single-view data), although instances or objects, such as images and videos, are usually represented by multiview features, such as color, shape, and texture. In this paper, we present multiview Hessian regularization (mHR) to address the above two problems in LR-based image annotation. In particular, mHR optimally combines multiple HR, each of which is obtained from a particular view of instances, and steers the classification function that varies linearly along the data manifold. We apply mHR to kernel least squares and support vector machines as two examples for image annotation. Extensive experiments on the PASCAL VOC'07 dataset validate the effectiveness of mHR by comparing it with baseline algorithms, including LR and HR.
计算机硬件和互联网技术的飞速发展使得大规模数据依赖模型在计算上变得可行,并为通过创新的机器学习算法对图像进行注释开辟了一条光明的道路。因此,半监督学习(SSL)近年来受到了广泛关注,并成功应用于图像注释。SSL 的一个代表性工作是拉普拉斯正则化(LR),它沿着图拉普拉斯编码的流形平滑分类的条件分布,然而,观察到 LR 使分类函数偏向于可能导致较差泛化的常数函数。此外,LR 是为处理均匀分布的数据(或单视图数据)而开发的,尽管实例或对象,如图像和视频,通常由多视图特征表示,如颜色、形状和纹理。在本文中,我们提出了多视图 Hessian 正则化(mHR)来解决基于 LR 的图像注释中的上述两个问题。特别是,mHR 最优地组合了多个 HR,每个 HR 都是从实例的特定视图获得的,并引导沿着数据流形线性变化的分类函数。我们将 mHR 应用于核最小二乘法和支持向量机作为图像注释的两个示例。在 PASCAL VOC'07 数据集上的广泛实验通过与基线算法(包括 LR 和 HR)进行比较,验证了 mHR 的有效性。