• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于感知一致性匹配的视频域自适应语义分割。

Video domain adaptation for semantic segmentation using perceptual consistency matching.

机构信息

Department of Robotics and Mechatronics Engineering, Daegu Gyeongbuk Institute of Science and Technology (DGIST), Daegu, South Korea; Division of Intelligent Robotics, Daegu Gyeongbuk Institute of Science and Technology (DGIST), Daegu, South Korea.

Department of Robotics and Mechatronics Engineering, Daegu Gyeongbuk Institute of Science and Technology (DGIST), Daegu, South Korea.

出版信息

Neural Netw. 2024 Nov;179:106505. doi: 10.1016/j.neunet.2024.106505. Epub 2024 Jul 3.

DOI:10.1016/j.neunet.2024.106505
PMID:39002205
Abstract

Unsupervised domain adaptation (UDA) aims to transfer knowledge in previous and related labeled datasets (sources) to a new unlabeled dataset (target). Despite the impressive performance, existing approaches have largely focused on image-based UDA only, and video-based UDA has been relatively understudied and received less attention due to the difficulty of adapting diverse modal video features and modeling temporal associations efficiently. To address this, existing studies use optical flow to capture motion cues between in-domain consecutive frames, but is limited by heavy compute requirements and modeling flow patterns across diverse domains is equally challenging. In this work, we propose an adversarial domain adaptation approach for video semantic segmentation that aims to align temporally associated pixels in successive source and target domain frames without relying on optical flow. Specifically, we introduce a Perceptual Consistency Matching (PCM) strategy that leverages perceptual similarity to identify pixels with high correlation across consecutive frames, and infer that such pixels should correspond to the same class. Therefore, we can enhance prediction accuracy for video-UDA by enforcing consistency not only between in-domain frames, but across domains using PCM objectives during model training. Extensive experiments on public datasets show the benefit of our approach over existing state-of-the-art UDA methods. Our approach not only addresses a crucial task in video domain adaptation but also offers notable improvements in performance with faster inference times.

摘要

无监督领域自适应 (UDA) 的目标是将先前和相关的带标签数据集 (源) 中的知识转移到新的未标记数据集 (目标)。尽管现有方法取得了令人印象深刻的性能,但它们主要集中在基于图像的 UDA 上,而基于视频的 UDA 则相对研究较少,受到的关注也较少,因为难以有效地适应不同模态的视频特征和建模时间关联。为了解决这个问题,现有的研究使用光流来捕捉域内连续帧之间的运动线索,但受到计算要求高的限制,跨不同领域建模流模式也同样具有挑战性。在这项工作中,我们提出了一种用于视频语义分割的对抗性领域自适应方法,旨在在不依赖光流的情况下对齐源域和目标域连续帧中的时间关联像素。具体来说,我们引入了一种感知一致性匹配 (PCM) 策略,该策略利用感知相似性来识别连续帧之间具有高相关性的像素,并推断出这些像素应该对应于同一类。因此,我们可以通过在模型训练期间使用 PCM 目标在域内和跨域之间不仅强制一致性,从而提高视频-UDA 的预测准确性。在公共数据集上的广泛实验表明,我们的方法优于现有的最先进的 UDA 方法。我们的方法不仅解决了视频领域自适应中的关键任务,而且还提高了性能,同时具有更快的推理时间。

相似文献

1
Video domain adaptation for semantic segmentation using perceptual consistency matching.基于感知一致性匹配的视频域自适应语义分割。
Neural Netw. 2024 Nov;179:106505. doi: 10.1016/j.neunet.2024.106505. Epub 2024 Jul 3.
2
Dual domain distribution disruption with semantics preservation: Unsupervised domain adaptation for medical image segmentation.双域分布破坏与语义保持:医学图像分割的无监督域自适应。
Med Image Anal. 2024 Oct;97:103275. doi: 10.1016/j.media.2024.103275. Epub 2024 Jul 14.
3
Unsupervised domain adaptive building semantic segmentation network by edge-enhanced contrastive learning.基于边缘增强对比学习的无监督领域自适应建筑语义分割网络。
Neural Netw. 2024 Nov;179:106581. doi: 10.1016/j.neunet.2024.106581. Epub 2024 Jul 30.
4
Unsupervised domain adaptive segmentation algorithm based on two-level category alignment.基于两级类别对齐的无监督领域自适应分割算法。
Neural Netw. 2024 Sep;177:106399. doi: 10.1016/j.neunet.2024.106399. Epub 2024 May 20.
5
A bidirectional multilayer contrastive adaptation network with anatomical structure preservation for unpaired cross-modality medical image segmentation.一种具有解剖结构保持的双向多层对比适应网络,用于非配对跨模态医学图像分割。
Comput Biol Med. 2022 Oct;149:105964. doi: 10.1016/j.compbiomed.2022.105964. Epub 2022 Aug 19.
6
IAS-NET: Joint intraclassly adaptive GAN and segmentation network for unsupervised cross-domain in neonatal brain MRI segmentation.IAS-NET:用于新生儿脑 MRI 分割的无监督跨领域的联合类内自适应 GAN 和分割网络。
Med Phys. 2021 Nov;48(11):6962-6975. doi: 10.1002/mp.15212. Epub 2021 Sep 25.
7
Emotional Semantics-Preserved and Feature-Aligned CycleGAN for Visual Emotion Adaptation.情感语义保留和特征对齐的循环生成对抗网络用于视觉情感适应。
IEEE Trans Cybern. 2022 Oct;52(10):10000-10013. doi: 10.1109/TCYB.2021.3062750. Epub 2022 Sep 19.
8
C -GAN: Content-consistent generative adversarial networks for unsupervised domain adaptation in medical image segmentation.C-GAN:用于医学图像分割中无监督域自适应的内容一致生成对抗网络。
Med Phys. 2022 Oct;49(10):6491-6504. doi: 10.1002/mp.15944. Epub 2022 Aug 27.
9
Birds of a Feather Flock Together: Category-Divergence Guidance for Domain Adaptive Segmentation.物以类聚:用于领域自适应分割的类别分歧引导。
IEEE Trans Image Process. 2022;31:2878-2892. doi: 10.1109/TIP.2022.3162471. Epub 2022 Apr 8.
10
Unsupervised model adaptation for source-free segmentation of medical images.用于医学图像无源分割的无监督模型自适应
Med Image Anal. 2024 Jul;95:103179. doi: 10.1016/j.media.2024.103179. Epub 2024 Apr 14.