Wilson Garrett, Doppa Janardhan Rao, Cook Diane J
IEEE Trans Pattern Anal Mach Intell. 2023 Dec;45(12):14208-14221. doi: 10.1109/TPAMI.2023.3298346. Epub 2023 Nov 6.
Unsupervised domain adaptation (UDA) provides a strategy for improving machine learning performance in data-rich (target) domains where ground truth labels are inaccessible but can be found in related (source) domains. In cases where meta-domain information such as label distributions is available, weak supervision can further boost performance. We propose a novel framework, CALDA, to tackle these two problems. CALDA synergistically combines the principles of contrastive learning and adversarial learning to robustly support multi-source UDA (MS-UDA) for time series data. Similar to prior methods, CALDA utilizes adversarial learning to align source and target feature representations. Unlike prior approaches, CALDA additionally leverages cross-source label information across domains. CALDA pulls examples with the same label close to each other, while pushing apart examples with different labels, reshaping the space through contrastive learning. Unlike prior contrastive adaptation methods, CALDA requires neither data augmentation nor pseudo labeling, which may be more challenging for time series. We empirically validate our proposed approach. Based on results from human activity recognition, electromyography, and synthetic datasets, we find utilizing cross-source information improves performance over prior time series and contrastive methods. Weak supervision further improves performance, even in the presence of noise, allowing CALDA to offer generalizable strategies for MS-UDA.
无监督域适应(UDA)提供了一种策略,用于在数据丰富的(目标)域中提高机器学习性能,在这些域中无法获取真实标签,但可以在相关的(源)域中找到。在诸如标签分布等元域信息可用的情况下,弱监督可以进一步提高性能。我们提出了一种新颖的框架CALDA来解决这两个问题。CALDA将对比学习和对抗学习的原理协同结合起来,以强有力地支持时间序列数据的多源UDA(MS-UDA)。与先前的方法类似,CALDA利用对抗学习来对齐源域和目标域的特征表示。与先前的方法不同,CALDA还利用跨域的跨源标签信息。CALDA将具有相同标签的示例拉近,同时将具有不同标签的示例推开,通过对比学习重塑空间。与先前的对比适应方法不同,CALDA既不需要数据增强也不需要伪标签,这对于时间序列来说可能更具挑战性。我们通过实验验证了我们提出的方法。基于人类活动识别、肌电图和合成数据集的结果,我们发现利用跨源信息比先前的时间序列和对比方法性能更优。即使存在噪声,弱监督也能进一步提高性能,这使得CALDA能够为MS-UDA提供可推广的策略。