Qin Tiexin, Wang Shiqi, Li Haoliang
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14514-14527. doi: 10.1109/TPAMI.2023.3319984. Epub 2023 Nov 3.
Domain generalization (DG) refers to the problem of generalizing machine learning systems to out-of-distribution (OOD) data with knowledge learned from several provided source domains. Most prior works confine themselves to stationary and discrete environments to tackle such generalization issue arising from OOD data. However, in practice, many tasks in non-stationary environments (e.g., autonomous-driving car system, sensor measurement) involve more complex and continuously evolving domain drift, emerging new challenges for model deployment. In this paper, we first formulate this setting as the problem of evolving domain generalization. To deal with the continuously changing domains, we propose MMD-LSAE, a novel framework that learns to capture the evolving patterns among domains for better generalization. Specifically, MMD-LSAE characterizes OOD data in non-stationary environments with two types of distribution shifts: covariate shift and concept shift, and employs deep autoencoder modules to infer their dynamics in latent space separately. In these modules, the inferred posterior distributions of latent codes are optimized to align with their corresponding prior distributions via minimizing maximum mean discrepancy (MMD). We theoretically verify that MMD-LSAE has the inherent capability to implicitly facilitate mutual information maximization, which can promote superior representation learning and improved generalization of the model. Furthermore, the experimental results on both synthetic and real-world datasets show that our proposed approach can consistently achieve favorable performance based on the evolving domain generalization setting.
域泛化(DG)指的是利用从几个给定源域学到的知识,将机器学习系统泛化到分布外(OOD)数据的问题。大多数先前的工作将自身局限于静态和离散环境,以解决由OOD数据引发的此类泛化问题。然而,在实际中,非静态环境中的许多任务(例如,自动驾驶汽车系统、传感器测量)涉及更复杂且不断演变的域漂移,给模型部署带来了新的挑战。在本文中,我们首先将此设置表述为演化域泛化问题。为了应对不断变化的域,我们提出了MMD-LSAE,这是一个新颖的框架,它学习捕捉域之间的演化模式以实现更好的泛化。具体而言,MMD-LSAE用两种类型的分布偏移来表征非静态环境中的OOD数据:协变量偏移和概念偏移,并采用深度自动编码器模块分别推断它们在潜在空间中的动态。在这些模块中,通过最小化最大均值差异(MMD),潜在代码的推断后验分布被优化以与它们相应的先验分布对齐。我们从理论上验证了MMD-LSAE具有隐式促进互信息最大化的内在能力,这可以促进卓越的表示学习并改善模型的泛化。此外,在合成数据集和真实世界数据集上的实验结果表明,我们提出的方法在演化域泛化设置下能够始终如一地实现良好的性能。