Wang Hongsong, Liao Shengcai, Shao Ling
IEEE Trans Image Process. 2021;30:4046-4056. doi: 10.1109/TIP.2021.3066046. Epub 2021 Apr 7.
Unsupervised domain adaptation for object detection is a challenging problem with many real-world applications. Unfortunately, it has received much less attention than supervised object detection. Models that try to address this task tend to suffer from a shortage of annotated training samples. Moreover, existing methods of feature alignments are not sufficient to learn domain-invariant representations. To address these limitations, we propose a novel augmented feature alignment network (AFAN) which integrates intermediate domain image generation and domain-adversarial training into a unified framework. An intermediate domain image generator is proposed to enhance feature alignments by domain-adversarial training with automatically generated soft domain labels. The synthetic intermediate domain images progressively bridge the domain divergence and augment the annotated source domain training data. A feature pyramid alignment is designed and the corresponding feature discriminator is used to align multi-scale convolutional features of different semantic levels. Last but not least, we introduce a region feature alignment and an instance discriminator to learn domain-invariant features for object proposals. Our approach significantly outperforms the state-of-the-art methods on standard benchmarks for both similar and dissimilar domain adaptations. Further extensive experiments verify the effectiveness of each component and demonstrate that the proposed network can learn domain-invariant representations.
用于目标检测的无监督域适应是一个具有许多实际应用的挑战性问题。不幸的是,它受到的关注远少于有监督的目标检测。试图解决此任务的模型往往缺乏带注释的训练样本。此外,现有的特征对齐方法不足以学习域不变表示。为了解决这些限制,我们提出了一种新颖的增强特征对齐网络(AFAN),它将中间域图像生成和域对抗训练集成到一个统一框架中。提出了一种中间域图像生成器,通过使用自动生成的软域标签进行域对抗训练来增强特征对齐。合成的中间域图像逐步弥合域差异并扩充带注释的源域训练数据。设计了一种特征金字塔对齐,并使用相应的特征判别器来对齐不同语义级别的多尺度卷积特征。最后但同样重要的是,我们引入了区域特征对齐和实例判别器,以学习目标提议的域不变特征。在相似和不相似域适应的标准基准上,我们的方法显著优于当前的先进方法。进一步的广泛实验验证了每个组件的有效性,并表明所提出的网络可以学习域不变表示。