IEEE Trans Neural Netw Learn Syst. 2018 Feb;29(2):310-323. doi: 10.1109/TNNLS.2016.2618765. Epub 2016 Nov 14.
Transfer learning is generally exploited to adapt well-established source knowledge for learning tasks in weakly labeled or unlabeled target domain. Nowadays, it is common to see multiple sources available for knowledge transfer, each of which, however, may not include complete classes information of the target domain. Naively merging multiple sources together would lead to inferior results due to the large divergence among multiple sources. In this paper, we attempt to utilize incomplete multiple sources for effective knowledge transfer to facilitate the learning task in target domain. To this end, we propose an incomplete multisource transfer learning through two directional knowledge transfer, i.e., cross-domain transfer from each source to target, and cross-source transfer. In particular, in cross-domain direction, we deploy latent low-rank transfer learning guided by iterative structure learning to transfer knowledge from each single source to target domain. This practice reinforces to compensate for any missing data in each source by the complete target data. While in cross-source direction, unsupervised manifold regularizer and effective multisource alignment are explored to jointly compensate for missing data from one portion of source to another. In this way, both marginal and conditional distribution discrepancy in two directions would be mitigated. Experimental results on standard cross-domain benchmarks and synthetic data sets demonstrate the effectiveness of our proposed model in knowledge transfer from incomplete multiple sources.
迁移学习通常被用来利用已建立的源知识来适应弱标注或无标注的目标域中的学习任务。如今,常见的情况是有多个源可供知识转移,但是每个源可能都不包含目标域的完整类信息。由于多个源之间存在较大的差异,因此简单地将多个源合并在一起会导致结果不佳。在本文中,我们尝试利用不完整的多个源进行有效的知识转移,以促进目标域中的学习任务。为此,我们提出了一种通过双向知识转移的不完整多源迁移学习,即从每个源到目标的跨域转移,以及跨源转移。具体来说,在跨域方向上,我们通过迭代结构学习指导的潜在低秩迁移学习来从每个源向目标域转移知识。这种做法通过完整的目标数据来强化对每个源中任何缺失数据的补偿。而在跨源方向上,我们探索了无监督流形正则化和有效的多源对齐,以共同补偿来自源的一部分到另一部分的缺失数据。通过这种方式,两个方向上的边缘和条件分布差异都将得到缓解。在标准的跨域基准和合成数据集上的实验结果表明,我们提出的模型在从不完整的多个源进行知识转移方面是有效的。