Wei Pengfei, Ke Yiping, Goh Chi Keong
IEEE Trans Neural Netw Learn Syst. 2019 May;30(5):1321-1334. doi: 10.1109/TNNLS.2018.2868709. Epub 2018 Sep 27.
Marginalized stacked denoising autoencoder (mSDA), has recently emerged with demonstrated effectiveness in domain adaptation. In this paper, we investigate the rationale for why mSDA benefits domain adaptation tasks from the perspective of adaptive regularization. Our investigations focus on two types of feature corruption noise: Gaussian noise (mSDA ) and Bernoulli dropout noise (mSDA ). Both theoretical and empirical results demonstrate that mSDA successfully boosts the adaptation performance but mSDA fails to do so. We then propose a new mSDA with data-dependent multinomial dropout noise (mSDA ) that overcomes the limitations of mSDA and further improves the adaptation performance. Our mSDA is based on a more realistic assumption: different features are correlated and, thus, should be corrupted with different probabilities. Experimental results demonstrate the superiority of mSDA to mSDA on the adaptation performance and the convergence speed. Finally, we propose a deep transferable feature coding (DTFC) framework for unsupervised domain adaptation. The motivation of DTFC is that mSDA fails to consider the distribution discrepancy across different domains in the feature learning process. We introduce a new element to mSDA: domain divergence minimization by maximum mean discrepancy. This element is essential for domain adaptation as it ensures the extracted deep features to have a small distribution discrepancy. The effectiveness of DTFC is verified by extensive experiments on three benchmark data sets for both Bernoulli dropout noise and multinomial dropout noise.
边缘化堆叠去噪自编码器(mSDA)最近已出现,并在域适应方面展现出有效性。在本文中,我们从自适应正则化的角度研究mSDA有利于域适应任务的基本原理。我们的研究聚焦于两种类型的特征损坏噪声:高斯噪声(mSDA )和伯努利随机失活噪声(mSDA )。理论和实证结果均表明,mSDA 成功提升了适应性能,但mSDA 却未能做到。然后,我们提出了一种带有数据依赖多项随机失活噪声的新型mSDA(mSDA ),它克服了mSDA 的局限性,并进一步提高了适应性能。我们的mSDA 基于一个更现实的假设:不同特征是相关的,因此应以不同概率被损坏。实验结果证明了mSDA 在适应性能和收敛速度方面优于mSDA 。最后,我们提出了一种用于无监督域适应的深度可迁移特征编码(DTFC)框架。DTFC的动机在于mSDA在特征学习过程中未能考虑不同域之间的分布差异。我们在mSDA中引入了一个新元素:通过最大均值差异最小化域差异。该元素对于域适应至关重要,因为它确保所提取的深度特征具有较小的分布差异。通过在三个基准数据集上针对伯努利随机失活噪声和多项随机失活噪声进行的广泛实验,验证了DTFC的有效性。