IEEE Trans Image Process. 2018 Jan;27(1):281-292. doi: 10.1109/TIP.2017.2760512. Epub 2017 Oct 6.
Image set classification has attracted much attention because of its broad applications. Despite the success made so far, the problems of intra-class diversity and inter-class similarity still remain two major challenges. To explore a possible solution to these challenges, this paper proposes a novel approach, termed duplex metric learning (DML), for image set classification. The proposed DML consists of two progressive metric learning stages with different objectives used for feature learning and image classification, respectively. The metric learning regularization is not only used to learn powerful feature representations but also well explored to train an effective classifier. At the first stage, we first train a discriminative stacked autoencoder (DSAE) by layer-wisely imposing a metric learning regularization term on the neurons in the hidden layers and meanwhile minimizing the reconstruction error to obtain new feature mappings in which similar samples are mapped closely to each other and dissimilar samples are mapped farther apart. At the second stage, we discriminatively train a classifier and simultaneously fine-tune the DSAE by optimizing a new objective function, which consists of a classification error term and a metric learning regularization term. Finally, two simple voting strategies are devised for image set classification based on the learnt classifier. In the experiments, we extensively evaluate the proposed framework for the tasks of face recognition, object recognition, and face verification on several commonly-used data sets and state-of-the-art results are achieved in comparison with existing methods.
图像集分类由于其广泛的应用而受到了广泛关注。尽管迄今为止已经取得了成功,但类内多样性和类间相似性的问题仍然是两个主要挑战。为了探索解决这些挑战的一种可能方法,本文提出了一种新的方法,称为双度量学习(DML),用于图像集分类。所提出的 DML 由两个具有不同目标的渐进式度量学习阶段组成,分别用于特征学习和图像分类。度量学习正则化不仅用于学习强大的特征表示,而且还被很好地探索用于训练有效的分类器。在第一阶段,我们首先通过在隐藏层的神经元上逐层施加度量学习正则化项并同时最小化重构误差来训练有判别力的堆叠自动编码器(DSAE),以获得新的特征映射,其中相似样本彼此靠近映射,不相似样本彼此远离映射。在第二阶段,我们通过优化由分类误差项和度量学习正则化项组成的新目标函数来有判别地训练分类器,并同时微调 DSAE。最后,根据学习到的分类器,设计了两种简单的投票策略用于图像集分类。在实验中,我们在几个常用的数据集上广泛评估了所提出的框架在人脸识别、目标识别和人脸验证任务中的性能,与现有方法相比取得了优异的结果。