Zhang Lei, Zhang David
IEEE Trans Image Process. 2016 Oct;25(10):4959-4973. doi: 10.1109/TIP.2016.2598679. Epub 2016 Aug 10.
We address the problem of visual knowledge adaptation by leveraging labeled patterns from source domain and a very limited number of labeled instances in target domain to learn a robust classifier for visual categorization. This paper proposes a new extreme learning machine based cross-domain network learning framework, that is called Extreme Learning Machine (ELM) based Domain Adaptation (EDA). It allows us to learn a category transformation and an ELM classifier with random projection by minimizing the -norm of the network output weights and the learning error simultaneously. The unlabeled target data, as useful knowledge, is also integrated as a fidelity term to guarantee the stability during cross domain learning. It minimizes the matching error between the learned classifier and a base classifier, such that many existing classifiers can be readily incorporated as base classifiers. The network output weights cannot only be analytically determined, but also transferrable. Additionally, a manifold regularization with Laplacian graph is incorporated, such that it is beneficial to semi-supervised learning. Extensively, we also propose a model of multiple views, referred as MvEDA. Experiments on benchmark visual datasets for video event recognition and object recognition, demonstrate that our EDA methods outperform existing cross-domain learning methods.
我们通过利用源域的标记模式和目标域中非常有限数量的标记实例来解决视觉知识适应问题,以学习用于视觉分类的强大分类器。本文提出了一种基于极端学习机的跨域网络学习框架,即基于极端学习机(ELM)的域适应(EDA)。它允许我们通过同时最小化网络输出权重的 -范数和学习误差,来学习具有随机投影的类别变换和ELM分类器。未标记的目标数据作为有用知识,也被整合为一个保真项,以保证跨域学习期间的稳定性。它最小化了学习到的分类器与基础分类器之间的匹配误差,使得许多现有的分类器可以很容易地被用作基础分类器。网络输出权重不仅可以通过解析确定,而且具有可转移性。此外,还引入了带有拉普拉斯图的流形正则化,这有利于半监督学习。广泛地,我们还提出了一种多视图模型,称为MvEDA。在用于视频事件识别和目标识别的基准视觉数据集上的实验表明,我们的EDA方法优于现有的跨域学习方法。