Kouw Wouter M, Loog Marco
IEEE Trans Pattern Anal Mach Intell. 2021 Mar;43(3):766-785. doi: 10.1109/TPAMI.2019.2945942. Epub 2021 Feb 4.
Domain adaptation has become a prominent problem setting in machine learning and related fields. This review asks the question: How can a classifier learn from a source domain and generalize to a target domain? We present a categorization of approaches, divided into, what we refer to as, sample-based, feature-based, and inference-based methods. Sample-based methods focus on weighting individual observations during training based on their importance to the target domain. Feature-based methods revolve around on mapping, projecting, and representing features such that a source classifier performs well on the target domain and inference-based methods incorporate adaptation into the parameter estimation procedure, for instance through constraints on the optimization procedure. Additionally, we review a number of conditions that allow for formulating bounds on the cross-domain generalization error. Our categorization highlights recurring ideas and raises questions important to further research.
域适应已成为机器学习及相关领域中一个突出的问题设定。本综述提出了这样一个问题:分类器如何从源域学习并推广到目标域?我们对方法进行了分类,分为我们所谓的基于样本、基于特征和基于推理的方法。基于样本的方法侧重于在训练期间根据各个观测值对目标域的重要性对其进行加权。基于特征的方法围绕特征的映射、投影和表示展开,以使源分类器在目标域上表现良好,而基于推理的方法则将适应纳入参数估计过程,例如通过对优化过程施加约束。此外,我们还综述了一些能够对跨域泛化误差制定边界的条件。我们的分类突出了反复出现的观点,并提出了对进一步研究很重要的问题。