Zhang Chunjie, Cheng Jian, Tian Qi
IEEE Trans Cybern. 2019 Nov;49(11):3834-3843. doi: 10.1109/TCYB.2018.2845912. Epub 2018 Jun 27.
The categorization accuracies of objects have been greatly improved in recent years. However, large quantities of labeled images are needed. many methods fail when only few labeled images are available. To tackle the few-labeled object categorization problem, we need to represent and classify them from multiple views. In this paper, we propose a novel multiview, few-labeled object categorization algorithm by predicting the labels of images with view consistency (MVFL-VC). We use labeled images along with other unlabeled images in a unified framework. A mapping function is learned to model the correlations of images with their labels. Since there are no labeling information for unlabeled images, we simultaneously learn the mapping function and image labels by classification error minimization. We make use of multiview information for joint object categorization. Although different views represent different aspects of images, for one image, the predicted categories of multiple views should be consistent with each other. We learn the mapping function by minimizing the summed classification losses along with the discrepancy of predicted labels between different views in an alternative way. We conduct object categorization experiments on five public image datasets and compare with other semi-supervised methods. Experimental results well demonstrate the effectiveness of the proposed MVFL-VC method.
近年来,物体的分类准确率有了很大提高。然而,这需要大量的标注图像。当只有少量标注图像可用时,许多方法都会失效。为了解决少标注物体分类问题,我们需要从多个视角对其进行表示和分类。在本文中,我们提出了一种新颖的多视角、少标注物体分类算法,即通过预测具有视角一致性的图像标签(MVFL-VC)。我们在一个统一的框架中使用标注图像和其他未标注图像。学习一个映射函数来建模图像与其标签之间的相关性。由于未标注图像没有标注信息,我们通过最小化分类误差同时学习映射函数和图像标签。我们利用多视角信息进行联合物体分类。尽管不同视角代表了图像的不同方面,但对于一幅图像,多个视角预测的类别应该彼此一致。我们通过以交替方式最小化总的分类损失以及不同视角之间预测标签的差异来学习映射函数。我们在五个公开图像数据集上进行物体分类实验,并与其他半监督方法进行比较。实验结果充分证明了所提出的MVFL-VC方法的有效性。