• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SinGAN-Seg:用于医学图像分割的合成训练数据生成。

SinGAN-Seg: Synthetic training data generation for medical image segmentation.

机构信息

SimulaMet, Oslo, Norway.

Oslo Metropolitan University, Oslo, Norway.

出版信息

PLoS One. 2022 May 2;17(5):e0267976. doi: 10.1371/journal.pone.0267976. eCollection 2022.

DOI:10.1371/journal.pone.0267976
PMID:35500005
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9060378/
Abstract

Analyzing medical data to find abnormalities is a time-consuming and costly task, particularly for rare abnormalities, requiring tremendous efforts from medical experts. Therefore, artificial intelligence has become a popular tool for the automatic processing of medical data, acting as a supportive tool for doctors. However, the machine learning models used to build these tools are highly dependent on the data used to train them. Large amounts of data can be difficult to obtain in medicine due to privacy reasons, expensive and time-consuming annotations, and a general lack of data samples for infrequent lesions. In this study, we present a novel synthetic data generation pipeline, called SinGAN-Seg, to produce synthetic medical images with corresponding masks using a single training image. Our method is different from the traditional generative adversarial networks (GANs) because our model needs only a single image and the corresponding ground truth to train. We also show that the synthetic data generation pipeline can be used to produce alternative artificial segmentation datasets with corresponding ground truth masks when real datasets are not allowed to share. The pipeline is evaluated using qualitative and quantitative comparisons between real data and synthetic data to show that the style transfer technique used in our pipeline significantly improves the quality of the generated data and our method is better than other state-of-the-art GANs to prepare synthetic images when the size of training datasets are limited. By training UNet++ using both real data and the synthetic data generated from the SinGAN-Seg pipeline, we show that the models trained on synthetic data have very close performances to those trained on real data when both datasets have a considerable amount of training data. In contrast, we show that synthetic data generated from the SinGAN-Seg pipeline improves the performance of segmentation models when training datasets do not have a considerable amount of data. All experiments were performed using an open dataset and the code is publicly available on GitHub.

摘要

分析医学数据以发现异常是一项耗时且昂贵的任务,尤其是对于罕见的异常情况,需要医学专家付出巨大的努力。因此,人工智能已成为医学数据自动处理的流行工具,可作为医生的辅助工具。但是,用于构建这些工具的机器学习模型高度依赖于用于训练它们的数据。由于隐私原因、昂贵且耗时的注释以及罕见病变的一般缺乏数据样本,医学中大量数据难以获取。在这项研究中,我们提出了一种新颖的合成数据生成管道,称为 SinGAN-Seg,可使用单个训练图像生成具有相应掩模的合成医学图像。我们的方法与传统的生成对抗网络(GAN)不同,因为我们的模型仅需要单个图像和相应的地面实况即可进行训练。我们还表明,当不允许共享真实数据集时,合成数据生成管道可用于生成具有相应地面实况掩模的替代人工分割数据集。通过在真实数据和合成数据之间进行定性和定量比较来评估该管道,以表明我们的管道中使用的样式转移技术可显著提高生成数据的质量,并且当训练数据集较小时,我们的方法比其他最先进的 GAN 更适合准备合成图像。通过在真实数据和从 SinGAN-Seg 管道生成的合成数据上同时训练 UNet++,我们表明,当两个数据集都具有相当数量的训练数据时,在合成数据上训练的模型的性能与在真实数据上训练的模型非常接近。相比之下,我们表明,当训练数据集没有相当数量的数据时,从 SinGAN-Seg 管道生成的合成数据可以提高分割模型的性能。所有实验均使用公开数据集进行,代码可在 GitHub 上公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/d8a636c71397/pone.0267976.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/64086d5fb81a/pone.0267976.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/0a97c9dcebef/pone.0267976.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/29797fe0ebc6/pone.0267976.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/1d3bde01c1e3/pone.0267976.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/5942c87a7609/pone.0267976.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/f91e59d124bd/pone.0267976.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/9c5586fd513f/pone.0267976.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/47c741c96dc6/pone.0267976.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/73363fe56b1c/pone.0267976.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/0634a6222e4b/pone.0267976.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/62aee20ea6c0/pone.0267976.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/7bd0d776b21d/pone.0267976.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/d8a636c71397/pone.0267976.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/64086d5fb81a/pone.0267976.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/0a97c9dcebef/pone.0267976.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/29797fe0ebc6/pone.0267976.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/1d3bde01c1e3/pone.0267976.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/5942c87a7609/pone.0267976.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/f91e59d124bd/pone.0267976.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/9c5586fd513f/pone.0267976.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/47c741c96dc6/pone.0267976.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/73363fe56b1c/pone.0267976.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/0634a6222e4b/pone.0267976.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/62aee20ea6c0/pone.0267976.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/7bd0d776b21d/pone.0267976.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/95db/9060378/d8a636c71397/pone.0267976.g013.jpg

相似文献

1
SinGAN-Seg: Synthetic training data generation for medical image segmentation.SinGAN-Seg:用于医学图像分割的合成训练数据生成。
PLoS One. 2022 May 2;17(5):e0267976. doi: 10.1371/journal.pone.0267976. eCollection 2022.
2
Image generation by GAN and style transfer for agar plate image segmentation.基于 GAN 和风格迁移的琼脂平板图像分割的图像生成。
Comput Methods Programs Biomed. 2020 Feb;184:105268. doi: 10.1016/j.cmpb.2019.105268. Epub 2019 Dec 17.
3
Generative Adversarial Network for Medical Images (MI-GAN).生成对抗网络在医学图像上的应用(MI-GAN)。
J Med Syst. 2018 Oct 12;42(11):231. doi: 10.1007/s10916-018-1072-9.
4
Generating Synthetic Labeled Data From Existing Anatomical Models: An Example With Echocardiography Segmentation.从现有解剖模型生成合成标记数据:以心脏超声分割为例。
IEEE Trans Med Imaging. 2021 Oct;40(10):2783-2794. doi: 10.1109/TMI.2021.3051806. Epub 2021 Sep 30.
5
SpeckleGAN: a generative adversarial network with an adaptive speckle layer to augment limited training data for ultrasound image processing.SpeckleGAN:一种具有自适应散斑层的生成对抗网络,用于扩充有限的超声图像处理训练数据。
Int J Comput Assist Radiol Surg. 2020 Sep;15(9):1427-1436. doi: 10.1007/s11548-020-02203-1. Epub 2020 Jun 18.
6
Improved automatic detection of herpesvirus secondary envelopment stages in electron microscopy by augmenting training data with synthetic labelled images generated by a generative adversarial network.通过使用生成对抗网络生成的合成标记图像来扩充训练数据,提高电子显微镜中疱疹病毒二次包膜阶段的自动检测。
Cell Microbiol. 2021 Feb;23(2):e13280. doi: 10.1111/cmi.13280. Epub 2020 Nov 16.
7
Patient-specific placental vessel segmentation with limited data.基于有限数据的个体化胎盘血管分割。
J Robot Surg. 2024 Jun 4;18(1):237. doi: 10.1007/s11701-024-01981-z.
8
On the objectivity, reliability, and validity of deep learning enabled bioimage analyses.深度学习赋能的生物影像分析的客观性、可靠性和有效性。
Elife. 2020 Oct 19;9:e59780. doi: 10.7554/eLife.59780.
9
Creating High Fidelity Synthetic Pelvis Radiographs Using Generative Adversarial Networks: Unlocking the Potential of Deep Learning Models Without Patient Privacy Concerns.利用生成对抗网络生成高保真骨盆 X 射线:在不涉及患者隐私问题的情况下挖掘深度学习模型的潜力。
J Arthroplasty. 2023 Oct;38(10):2037-2043.e1. doi: 10.1016/j.arth.2022.12.013. Epub 2022 Dec 17.
10
Using Synthetic Training Data for Deep Learning-Based GBM Segmentation.使用合成训练数据进行基于深度学习的胶质母细胞瘤分割
Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:6724-6729. doi: 10.1109/EMBC.2019.8856297.

引用本文的文献

1
MeVGAN: GAN-based plugin model for video generation with applications in colonoscopy.MeVGAN:基于生成对抗网络的视频生成插件模型及其在结肠镜检查中的应用
PLoS One. 2025 May 27;20(5):e0312038. doi: 10.1371/journal.pone.0312038. eCollection 2025.
2
Guided synthesis of annotated lung CT images with pathologies using a multi-conditioned denoising diffusion probabilistic model (mDDPM).使用多条件去噪扩散概率模型(mDDPM)对带有病变的标注肺部CT图像进行引导合成。
Phys Med Biol. 2025 Mar 6;70(6). doi: 10.1088/1361-6560/adb9b3.
3
Is synthetic data generation effective in maintaining clinical biomarkers? Investigating diffusion models across diverse imaging modalities.

本文引用的文献

1
Differential privacy in health research: A scoping review.健康研究中的差分隐私:范围综述。
J Am Med Inform Assoc. 2021 Sep 18;28(10):2269-2276. doi: 10.1093/jamia/ocab135.
2
Real-Time Polyp Detection, Localization and Segmentation in Colonoscopy Using Deep Learning.使用深度学习在结肠镜检查中进行实时息肉检测、定位和分割
IEEE Access. 2021 Mar 4;9:40496-40510. doi: 10.1109/ACCESS.2021.3063716. eCollection 2021.
3
HyperKvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy.HyperKvasir,一个用于胃肠道内镜的全面多类图像和视频数据集。
合成数据生成在维持临床生物标志物方面是否有效?跨多种成像模态研究扩散模型。
Front Artif Intell. 2025 Jan 31;7:1454441. doi: 10.3389/frai.2024.1454441. eCollection 2024.
4
Synthetic data in generalizable, learning-based neuroimaging.可推广的基于学习的神经影像学中的合成数据。
Imaging Neurosci (Camb). 2024 Nov 19;2:1-22. doi: 10.1162/imag_a_00337. eCollection 2024 Nov 1.
5
Bibliometric analysis of research on the application of deep learning to ophthalmology.深度学习在眼科应用研究的文献计量分析
Quant Imaging Med Surg. 2025 Jan 2;15(1):852-866. doi: 10.21037/qims-24-1340. Epub 2024 Dec 30.
6
Enhancing Amyloid PET Quantification: MRI-Guided Super-Resolution Using Latent Diffusion Models.增强淀粉样蛋白PET定量:使用潜在扩散模型的MRI引导超分辨率
Life (Basel). 2024 Dec 1;14(12):1580. doi: 10.3390/life14121580.
7
Synthetic Data Generation via Generative Adversarial Networks in Healthcare: A Systematic Review of Image- and Signal-Based Studies.医疗保健领域中通过生成对抗网络生成合成数据:基于图像和信号研究的系统综述。
IEEE Open J Eng Med Biol. 2024 Nov 28;6:183-192. doi: 10.1109/OJEMB.2024.3508472. eCollection 2025.
8
Self-improving generative foundation model for synthetic medical image generation and clinical applications.用于合成医学图像生成和临床应用的自我改进生成基础模型。
Nat Med. 2025 Feb;31(2):609-617. doi: 10.1038/s41591-024-03359-y. Epub 2024 Dec 11.
9
Privacy enhancing and generalizable deep learning with synthetic data for mediastinal neoplasm diagnosis.利用合成数据进行纵隔肿瘤诊断的隐私增强与通用深度学习。
NPJ Digit Med. 2024 Oct 20;7(1):293. doi: 10.1038/s41746-024-01290-7.
10
Recommendations for the creation of benchmark datasets for reproducible artificial intelligence in radiology.关于创建用于放射学中可重复人工智能的基准数据集的建议。
Insights Imaging. 2024 Oct 14;15(1):248. doi: 10.1186/s13244-024-01833-2.
Sci Data. 2020 Aug 28;7(1):283. doi: 10.1038/s41597-020-00622-y.
4
Variability and reproducibility in deep learning for medical image segmentation.深度学习在医学图像分割中的可变性和可重复性。
Sci Rep. 2020 Aug 13;10(1):13724. doi: 10.1038/s41598-020-69920-0.
5
UNet++: A Nested U-Net Architecture for Medical Image Segmentation.U-Net++:一种用于医学图像分割的嵌套U-Net架构。
Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2018). 2018 Sep;11045:3-11. doi: 10.1007/978-3-030-00889-5_1. Epub 2018 Sep 20.
6
Robustness study of noisy annotation in deep learning based medical image segmentation.基于深度学习的医学图像分割中噪声标注的稳健性研究。
Phys Med Biol. 2020 Aug 27;65(17):175007. doi: 10.1088/1361-6560/ab99e5.
7
Preparing Medical Imaging Data for Machine Learning.医学影像数据的机器学习准备
Radiology. 2020 Apr;295(1):4-15. doi: 10.1148/radiol.2020192224. Epub 2020 Feb 18.
8
Using Synthetic Training Data for Deep Learning-Based GBM Segmentation.使用合成训练数据进行基于深度学习的胶质母细胞瘤分割
Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:6724-6729. doi: 10.1109/EMBC.2019.8856297.
9
DeephESC 2.0: Deep Generative Multi Adversarial Networks for improving the classification of hESC.DeephESC 2.0:用于改善 hESC 分类的深度生成式多对抗网络。
PLoS One. 2019 Mar 6;14(3):e0212849. doi: 10.1371/journal.pone.0212849. eCollection 2019.
10
Artificial intelligence in healthcare: past, present and future.人工智能在医疗保健中的应用:过去、现在和未来。
Stroke Vasc Neurol. 2017 Jun 21;2(4):230-243. doi: 10.1136/svn-2017-000101. eCollection 2017 Dec.