• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于扩展语义分割数据集的无监督类别生成

Unsupervised Class Generation to Expand Semantic Segmentation Datasets.

作者信息

Montalvo Javier, García-Martín Álvaro, Carballeira Pablo, SanMiguel Juan C

机构信息

Video Processing and Understanding Lab, Escuela Politécnica Superior, Universidad Autónoma de Madrid, 28049 Madrid, Spain.

出版信息

J Imaging. 2025 May 22;11(6):172. doi: 10.3390/jimaging11060172.

DOI:10.3390/jimaging11060172
PMID:40558771
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12194140/
Abstract

Semantic segmentation is a computer vision task where classification is performed at the pixel level. Due to this, the process of labeling images for semantic segmentation is time-consuming and expensive. To mitigate this cost there has been a surge in the use of synthetically generated data-usually created using simulators or videogames-which, in combination with domain adaptation methods, can effectively learn how to segment real data. Still, these datasets have a particular limitation: due to their closed-set nature, it is not possible to include novel classes without modifying the tool used to generate them, which is often not public. Concurrently, generative models have made remarkable progress, particularly with the introduction of diffusion models, enabling the creation of high-quality images from text prompts without additional supervision. In this work, we propose an unsupervised pipeline that leverages Stable Diffusion and Segment Anything Module to generate class examples with an associated segmentation mask, and a method to integrate generated cutouts for novel classes in semantic segmentation datasets, all with minimal user input. Our approach aims to improve the performance of unsupervised domain adaptation methods by introducing novel samples into the training data without modifications to the underlying algorithms. With our methods, we show how models can not only effectively learn how to segment novel classes, with an average performance of 51% intersection over union for novel classes, but also reduce errors for other, already existing classes, reaching a higher performance level overall.

摘要

语义分割是一种计算机视觉任务,在像素级别上进行分类。因此,为语义分割标注图像的过程既耗时又昂贵。为了降低这种成本,合成生成数据的使用激增,这些数据通常是使用模拟器或视频游戏创建的,结合域适应方法,可以有效地学习如何分割真实数据。然而,这些数据集有一个特殊的局限性:由于它们的封闭集性质,如果不修改用于生成它们的工具(而这些工具通常不是公开的),就不可能包含新的类别。与此同时,生成模型取得了显著进展,特别是随着扩散模型的引入,能够在没有额外监督的情况下根据文本提示创建高质量图像。在这项工作中,我们提出了一种无监督的流程,利用Stable Diffusion和分割一切模型(Segment Anything Module)来生成带有相关分割掩码的类别示例,以及一种将生成的新类别抠图集成到语义分割数据集中的方法,所有这些都只需最少的用户输入。我们的方法旨在通过在不修改底层算法的情况下将新样本引入训练数据来提高无监督域适应方法的性能。通过我们的方法,我们展示了模型不仅可以有效地学习如何分割新类别,新类别的平均交并比性能达到51%,而且还可以减少其他现有类别的错误,总体上达到更高的性能水平。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d47f/12194140/aec88ee834ec/jimaging-11-00172-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d47f/12194140/8078f44755ec/jimaging-11-00172-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d47f/12194140/4be22517bf1a/jimaging-11-00172-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d47f/12194140/b9edc3d0c55d/jimaging-11-00172-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d47f/12194140/551a5614dd2d/jimaging-11-00172-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d47f/12194140/c1a4c47cfa2b/jimaging-11-00172-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d47f/12194140/aec88ee834ec/jimaging-11-00172-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d47f/12194140/8078f44755ec/jimaging-11-00172-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d47f/12194140/4be22517bf1a/jimaging-11-00172-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d47f/12194140/b9edc3d0c55d/jimaging-11-00172-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d47f/12194140/551a5614dd2d/jimaging-11-00172-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d47f/12194140/c1a4c47cfa2b/jimaging-11-00172-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d47f/12194140/aec88ee834ec/jimaging-11-00172-g006.jpg

相似文献

1
Unsupervised Class Generation to Expand Semantic Segmentation Datasets.用于扩展语义分割数据集的无监督类别生成
J Imaging. 2025 May 22;11(6):172. doi: 10.3390/jimaging11060172.
2
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of paclitaxel, docetaxel, gemcitabine and vinorelbine in non-small-cell lung cancer.对紫杉醇、多西他赛、吉西他滨和长春瑞滨在非小细胞肺癌中的临床疗效和成本效益进行的快速系统评价。
Health Technol Assess. 2001;5(32):1-195. doi: 10.3310/hta5320.
3
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
4
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.
5
Stigma Management Strategies of Autistic Social Media Users.自闭症社交媒体用户的污名管理策略
Autism Adulthood. 2025 May 28;7(3):273-282. doi: 10.1089/aut.2023.0095. eCollection 2025 Jun.
6
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
7
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
8
Direct composite resin fillings versus amalgam fillings for permanent posterior teeth.直接复合树脂充填与银汞合金充填用于永久性后牙。
Cochrane Database Syst Rev. 2021 Aug 13;8(8):CD005620. doi: 10.1002/14651858.CD005620.pub3.
9
Interventions to reduce harm from continued tobacco use.减少持续吸烟危害的干预措施。
Cochrane Database Syst Rev. 2016 Oct 13;10(10):CD005231. doi: 10.1002/14651858.CD005231.pub3.
10
Electronic cigarettes for smoking cessation.电子烟戒烟。
Cochrane Database Syst Rev. 2022 Nov 17;11(11):CD010216. doi: 10.1002/14651858.CD010216.pub7.

引用本文的文献

1
A Review on Deep Learning Methods for Glioma Segmentation, Limitations, and Future Perspectives.胶质瘤分割的深度学习方法、局限性及未来展望综述
J Imaging. 2025 Aug 11;11(8):269. doi: 10.3390/jimaging11080269.