IEEE Trans Image Process. 2017 Sep;26(9):4297-4310. doi: 10.1109/TIP.2017.2698918. Epub 2017 Apr 27.
Relation learning is a fundamental problem in many vision tasks. Recently, high-order Boltzmann machine and its variants have shown their great potentials in learning various types of data relation in a range of tasks. But most of these models are learned in an unsupervised way, i.e., without using relation class labels, which are not very discriminative for some challenging tasks, e.g., face verification. In this paper, with the goal to perform supervised relation learning, we introduce relation class labels into conventional high-order multiplicative interactions with pairwise input samples, and propose a conditional high-order Boltzmann Machine (CHBM), which can learn to classify the data relation in a binary classification way. To be able to deal with more complex data relation, we develop two improved variants of CHBM: 1) latent CHBM, which jointly performs relation feature learning and classification, by using a set of latent variables to block the pathway from pairwise input samples to output relation labels and 2) gated CHBM, which untangles factors of variation in data relation, by exploiting a set of latent variables to multiplicatively gate the classification of CHBM. To reduce the large number of model parameters generated by the multiplicative interactions, we approximately factorize high-order parameter tensors into multiple matrices. Then, we develop efficient supervised learning algorithms, by first pretraining the models using joint likelihood to provide good parameter initialization, and then finetuning them using conditional likelihood to enhance the discriminant ability. We apply the proposed models to a series of tasks including invariant recognition, face verification, and action similarity labeling. Experimental results demonstrate that by exploiting supervised relation labels, our models can greatly improve the performance.
关系学习是许多视觉任务中的一个基本问题。最近,高阶玻尔兹曼机及其变体在学习各种类型的数据关系方面表现出了巨大的潜力,应用于许多任务。但是,这些模型中的大多数都是以无监督的方式学习的,即不使用关系类别标签,而对于一些具有挑战性的任务(例如人脸识别)来说,这些标签的区分能力不是很强。在本文中,为了进行有监督的关系学习,我们将关系类别标签引入到常规的高次乘法交互作用中,并且使用成对的输入样本,提出了一种条件高阶玻尔兹曼机(CHBM),它可以通过二元分类的方式学习对数据关系进行分类。为了能够处理更复杂的数据关系,我们开发了 CHBM 的两种改进变体:1)潜在 CHBM,通过使用一组潜在变量来阻塞从成对输入样本到输出关系标签的通路,从而共同执行关系特征学习和分类;2)门控 CHBM,通过利用一组潜在变量对 CHBM 的分类进行乘法门控,从而解开数据关系中的变化因素。为了减少由乘法交互作用产生的大量模型参数,我们将高阶参数张量近似分解为多个矩阵。然后,我们通过首先使用联合似然来预训练模型以提供良好的参数初始化,然后使用条件似然来微调它们以增强判别能力,从而开发了有效的监督学习算法。我们将所提出的模型应用于一系列任务,包括不变性识别、人脸识别和动作相似性标记。实验结果表明,通过利用有监督的关系标签,我们的模型可以大大提高性能。