• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

学习用于可泛化语义分割的形状不变表示。

Learning Shape-Invariant Representation for Generalizable Semantic Segmentation.

作者信息

Zhang Yuhang, Tian Shishun, Liao Muxin, Hua Guoguang, Zou Wenbin, Xu Chen

出版信息

IEEE Trans Image Process. 2023;32:5031-5045. doi: 10.1109/TIP.2023.3287506. Epub 2023 Sep 8.

DOI:10.1109/TIP.2023.3287506
PMID:37347635
Abstract

Semantic segmentation assigns a category for each pixel and has achieved great success in a supervised manner. However, it fails to generalize well in new domains due to the domain gap. Domain adaptation is a popular way to solve this issue, but it needs target data and cannot handle unavailable domains. In domain generalization (DG), the model is trained without the target data and DG aims to generalize well in new unavailable domains. Recent works reveal that shape recognition is beneficial for generalization but still lack exploration in semantic segmentation. Meanwhile, the object shapes also exist a discrepancy in different domains, which is often ignored by the existing works. Thus, we propose a Shape-Invariant Learning (SIL) framework to focus on learning shape-invariant representation for better generalization. Specifically, we first define the structural edge, which considers both the object boundary and the inner structure of the object to provide more discrimination cues. Then, a shape perception learning strategy including a texture feature discrepancy reduction loss and a structural feature discrepancy enlargement loss is proposed to enhance the shape perception ability of the model by embedding the structural edge as a shape prior. Finally, we use shape deformation augmentation to generate samples with the same content and different shapes. Essentially, our SIL framework performs implicit shape distribution alignment at the domain-level to learn shape-invariant representation. Extensive experiments show that our SIL framework achieves state-of-the-art performance.

摘要

语义分割为每个像素分配一个类别,并在监督方式下取得了巨大成功。然而,由于域差距,它在新领域中不能很好地泛化。域适应是解决此问题的一种流行方法,但它需要目标数据,并且无法处理不可用的域。在域泛化(DG)中,模型在没有目标数据的情况下进行训练,并且DG旨在在新的不可用域中很好地泛化。最近的工作表明,形状识别有助于泛化,但在语义分割中仍缺乏探索。同时,不同域中的物体形状也存在差异,这一点在现有工作中常常被忽略。因此,我们提出了一种形状不变学习(SIL)框架,专注于学习形状不变表示以实现更好的泛化。具体来说,我们首先定义结构边缘,它同时考虑物体边界和物体内部结构以提供更多判别线索。然后,提出一种形状感知学习策略,包括纹理特征差异减少损失和结构特征差异扩大损失,通过将结构边缘作为形状先验嵌入来增强模型的形状感知能力。最后,我们使用形状变形增强来生成具有相同内容但不同形状的样本。本质上,我们的SIL框架在域级别执行隐式形状分布对齐以学习形状不变表示。大量实验表明,我们的SIL框架实现了当前最优的性能。

相似文献

1
Learning Shape-Invariant Representation for Generalizable Semantic Segmentation.学习用于可泛化语义分割的形状不变表示。
IEEE Trans Image Process. 2023;32:5031-5045. doi: 10.1109/TIP.2023.3287506. Epub 2023 Sep 8.
2
Invariant Content Representation for Generalizable Medical Image Segmentation.用于可泛化医学图像分割的不变内容表示
J Imaging Inform Med. 2024 Dec;37(6):3193-3207. doi: 10.1007/s10278-024-01088-9. Epub 2024 May 17.
3
IAS-NET: Joint intraclassly adaptive GAN and segmentation network for unsupervised cross-domain in neonatal brain MRI segmentation.IAS-NET:用于新生儿脑 MRI 分割的无监督跨领域的联合类内自适应 GAN 和分割网络。
Med Phys. 2021 Nov;48(11):6962-6975. doi: 10.1002/mp.15212. Epub 2021 Sep 25.
4
SETA: Semantic-Aware Edge-Guided Token Augmentation for Domain Generalization.SETA:用于领域泛化的语义感知边缘引导令牌增强
IEEE Trans Image Process. 2024;33:5622-5636. doi: 10.1109/TIP.2024.3470517. Epub 2024 Oct 9.
5
Affinity Space Adaptation for Semantic Segmentation Across Domains.跨域语义分割的亲和空间自适应。
IEEE Trans Image Process. 2021;30:2549-2561. doi: 10.1109/TIP.2020.3018221. Epub 2021 Feb 5.
6
A Patch Diversity Transformer for Domain Generalized Semantic Segmentation.用于领域通用语义分割的补丁多样性变换器
IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):14138-14150. doi: 10.1109/TNNLS.2023.3274760. Epub 2024 Oct 7.
7
Style Uncertainty Based Self-Paced Meta Learning for Generalizable Person Re-Identification.基于风格不确定性的自定步元学习在通用人像再识别中的应用
IEEE Trans Image Process. 2023;32:2107-2119. doi: 10.1109/TIP.2023.3263112.
8
Global and Local Texture Randomization for Synthetic-to-Real Semantic Segmentation.用于合成到真实语义分割的全局和局部纹理随机化
IEEE Trans Image Process. 2021;30:6594-6608. doi: 10.1109/TIP.2021.3096334. Epub 2021 Jul 21.
9
Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation.像素级域适应:增强弱监督语义分割的新视角。
IEEE Trans Image Process. 2024;33:4654-4669. doi: 10.1109/TIP.2024.3444190. Epub 2024 Aug 28.
10
Category-Level Adversarial Adaptation for Semantic Segmentation Using Purified Features.基于纯化特征的语义分割的类别级对抗自适应。
IEEE Trans Pattern Anal Mach Intell. 2022 Aug;44(8):3940-3956. doi: 10.1109/TPAMI.2021.3064379. Epub 2022 Jul 1.