Li Keqiuyin, Lu Jie, Zuo Hua, Zhang Guangquan
IEEE Trans Cybern. 2024 Apr;54(4):2193-2205. doi: 10.1109/TCYB.2023.3236008. Epub 2024 Mar 18.
Unsupervised multidomain adaptation attracts increasing attention as it delivers richer information when tackling a target task from an unlabeled target domain by leveraging the knowledge attained from labeled source domains. However, it is the quality of training samples, not just the quantity, that influences transfer performance. In this article, we propose a multidomain adaptation method with sample and source distillation (SSD), which develops a two-step selective strategy to distill source samples and define the importance of source domains. To distill samples, the pseudo-labeled target domain is constructed to learn a series of category classifiers to identify transfer and inefficient source samples. To rank domains, the agreements of accepting a target sample as the insider of source domains are estimated by constructing a domain discriminator based on selected transfer source samples. Using the selected samples and ranked domains, transfer from source domains to the target domain is achieved by adapting multilevel distributions in a latent feature space. Furthermore, to explore more usable target information which is expected to enhance the performance across domains of source predictors, an enhancement mechanism is built by matching selected pseudo-labeled and unlabeled target samples. The degrees of acceptance learned by the domain discriminator are finally employed as source merging weights to predict the target task. Superiority of the proposed SSD is validated on real-world visual classification tasks.
无监督多域适应越来越受到关注,因为它通过利用从有标签的源域获得的知识,在处理来自无标签目标域的目标任务时能提供更丰富的信息。然而,影响迁移性能的是训练样本的质量,而非数量。在本文中,我们提出了一种带有样本和源蒸馏的多域适应方法(SSD),该方法制定了一种两步选择策略来蒸馏源样本并定义源域的重要性。为了蒸馏样本,构建伪标签目标域以学习一系列类别分类器,从而识别迁移和低效的源样本。为了对域进行排序,通过基于选定的迁移源样本构建域判别器,估计将目标样本接受为源域内部样本的一致性。利用选定的样本和排序后的域,通过在潜在特征空间中适配多级分布,实现从源域到目标域的迁移。此外,为了探索更多可用的目标信息,期望这些信息能提高源预测器跨域的性能,通过匹配选定的伪标签和未标记的目标样本构建了一种增强机制。域判别器学习到的接受度最终被用作源合并权重来预测目标任务。所提出的SSD的优越性在实际视觉分类任务中得到了验证。