• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

克服医学图像生成中数据共享的障碍:一项综合评估。

Overcoming barriers to data sharing with medical image generation: a comprehensive evaluation.

作者信息

DuMont Schütte August, Hetzel Jürgen, Gatidis Sergios, Hepp Tobias, Dietz Benedikt, Bauer Stefan, Schwab Patrick

机构信息

ETH Zurich, Zurich, Switzerland.

Max Planck Institute for Intelligent Systems, Tübingen, Germany.

出版信息

NPJ Digit Med. 2021 Sep 24;4(1):141. doi: 10.1038/s41746-021-00507-3.

DOI:10.1038/s41746-021-00507-3
PMID:34561528
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8463544/
Abstract

Privacy concerns around sharing personally identifiable information are a major barrier to data sharing in medical research. In many cases, researchers have no interest in a particular individual's information but rather aim to derive insights at the level of cohorts. Here, we utilise generative adversarial networks (GANs) to create medical imaging datasets consisting entirely of synthetic patient data. The synthetic images ideally have, in aggregate, similar statistical properties to those of a source dataset but do not contain sensitive personal information. We assess the quality of synthetic data generated by two GAN models for chest radiographs with 14 radiology findings and brain computed tomography (CT) scans with six types of intracranial haemorrhages. We measure the synthetic image quality by the performance difference of predictive models trained on either the synthetic or the real dataset. We find that synthetic data performance disproportionately benefits from a reduced number of classes. Our benchmark also indicates that at low numbers of samples per class, label overfitting effects start to dominate GAN training. We conducted a reader study in which trained radiologists discriminate between synthetic and real images. In accordance with our benchmark results, the classification accuracy of radiologists improves with an increasing resolution. Our study offers valuable guidelines and outlines practical conditions under which insights derived from synthetic images are similar to those that would have been derived from real data. Our results indicate that synthetic data sharing may be an attractive alternative to sharing real patient-level data in the right setting.

摘要

围绕共享个人身份信息的隐私担忧是医学研究数据共享的一大障碍。在许多情况下,研究人员对特定个人的信息并不感兴趣,而是旨在从队列层面得出见解。在此,我们利用生成对抗网络(GAN)来创建完全由合成患者数据组成的医学影像数据集。理想情况下,合成图像总体上具有与源数据集相似的统计特性,但不包含敏感的个人信息。我们评估了两种GAN模型生成的合成数据的质量,一种用于有14种放射学发现的胸部X光片,另一种用于有六种颅内出血类型的脑部计算机断层扫描(CT)。我们通过在合成数据集或真实数据集上训练的预测模型的性能差异来衡量合成图像质量。我们发现合成数据性能从减少的类别数量中受益过多。我们的基准测试还表明,在每个类别样本数量较少时,标签过拟合效应开始主导GAN训练。我们进行了一项读者研究,让训练有素的放射科医生区分合成图像和真实图像。根据我们的基准测试结果,放射科医生的分类准确率随着分辨率的提高而提高。我们的研究提供了有价值的指导方针,并概述了实际条件,在这些条件下,从合成图像得出的见解与从真实数据得出的见解相似。我们的结果表明,在合适的环境下,合成数据共享可能是共享真实患者层面数据的一个有吸引力的替代方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eee2/8463544/5d5bd174ce7b/41746_2021_507_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eee2/8463544/e3735d5af093/41746_2021_507_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eee2/8463544/d48e0eb0abed/41746_2021_507_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eee2/8463544/9d2417990ad8/41746_2021_507_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eee2/8463544/e52c5db701a0/41746_2021_507_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eee2/8463544/5d5bd174ce7b/41746_2021_507_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eee2/8463544/e3735d5af093/41746_2021_507_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eee2/8463544/d48e0eb0abed/41746_2021_507_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eee2/8463544/9d2417990ad8/41746_2021_507_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eee2/8463544/e52c5db701a0/41746_2021_507_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eee2/8463544/5d5bd174ce7b/41746_2021_507_Fig5_HTML.jpg

相似文献

1
Overcoming barriers to data sharing with medical image generation: a comprehensive evaluation.克服医学图像生成中数据共享的障碍:一项综合评估。
NPJ Digit Med. 2021 Sep 24;4(1):141. doi: 10.1038/s41746-021-00507-3.
2
Creating High Fidelity Synthetic Pelvis Radiographs Using Generative Adversarial Networks: Unlocking the Potential of Deep Learning Models Without Patient Privacy Concerns.利用生成对抗网络生成高保真骨盆 X 射线:在不涉及患者隐私问题的情况下挖掘深度学习模型的潜力。
J Arthroplasty. 2023 Oct;38(10):2037-2043.e1. doi: 10.1016/j.arth.2022.12.013. Epub 2022 Dec 17.
3
Synthesizing anonymized and labeled TOF-MRA patches for brain vessel segmentation using generative adversarial networks.使用生成对抗网络对 TOF-MRA 斑块进行匿名和标记,以进行脑部血管分割。
Comput Biol Med. 2021 Apr;131:104254. doi: 10.1016/j.compbiomed.2021.104254. Epub 2021 Feb 15.
4
Brain tumor segmentation using synthetic MR images - A comparison of GANs and diffusion models.基于合成磁共振图像的脑肿瘤分割——GANs 和扩散模型的比较。
Sci Data. 2024 Feb 29;11(1):259. doi: 10.1038/s41597-024-03073-x.
5
Demonstrating the successful application of synthetic learning in spine surgery for training multi-center models with increased patient privacy.展示了合成学习在脊柱外科中的成功应用,该方法用于训练具有更高患者隐私保护的多中心模型。
Sci Rep. 2023 Aug 1;13(1):12481. doi: 10.1038/s41598-023-39458-y.
6
SynthEye: Investigating the Impact of Synthetic Data on Artificial Intelligence-assisted Gene Diagnosis of Inherited Retinal Disease.SynthEye:研究合成数据对遗传性视网膜疾病人工智能辅助基因诊断的影响。
Ophthalmol Sci. 2022 Nov 22;3(2):100258. doi: 10.1016/j.xops.2022.100258. eCollection 2023 Jun.
7
Deepfakes in Ophthalmology: Applications and Realism of Synthetic Retinal Images from Generative Adversarial Networks.眼科中的深度伪造技术:生成对抗网络合成视网膜图像的应用与逼真度
Ophthalmol Sci. 2021 Nov 16;1(4):100079. doi: 10.1016/j.xops.2021.100079. eCollection 2021 Dec.
8
Evaluation of Generative Adversarial Networks for High-Resolution Synthetic Image Generation of Circumpapillary Optical Coherence Tomography Images for Glaucoma.用于青光眼的周边视网膜光相干断层扫描图像高分辨率合成图像生成的生成对抗网络评估。
JAMA Ophthalmol. 2022 Oct 1;140(10):974-981. doi: 10.1001/jamaophthalmol.2022.3375.
9
Generating 3D TOF-MRA volumes and segmentation labels using generative adversarial networks.使用生成对抗网络生成 3D TOF-MRA 容积和分割标签。
Med Image Anal. 2022 May;78:102396. doi: 10.1016/j.media.2022.102396. Epub 2022 Feb 24.
10
A deep learning approach to private data sharing of medical images using conditional generative adversarial networks (GANs).基于条件生成对抗网络(GANs)的医学图像私有数据共享的深度学习方法。
PLoS One. 2023 Jul 6;18(7):e0280316. doi: 10.1371/journal.pone.0280316. eCollection 2023.

引用本文的文献

1
Large Language Models in Medical Image Analysis: A Systematic Survey and Future Directions.医学图像分析中的大语言模型:系统综述与未来方向
Bioengineering (Basel). 2025 Jul 29;12(8):818. doi: 10.3390/bioengineering12080818.
2
Unconditional latent diffusion models memorize patient imaging data.无条件潜在扩散模型会记住患者的影像数据。
Nat Biomed Eng. 2025 Aug 11. doi: 10.1038/s41551-025-01468-8.
3
Scorecard for synthetic medical data evaluation.合成医学数据评估记分卡。

本文引用的文献

1
Construction of a Machine Learning Dataset through Collaboration: The RSNA 2019 Brain CT Hemorrhage Challenge.通过合作构建机器学习数据集:2019年RSNA脑CT出血挑战赛
Radiol Artif Intell. 2020 Apr 29;2(3):e190211. doi: 10.1148/ryai.2020190211. eCollection 2020 May.
2
Multiclass semantic segmentation and quantification of traumatic brain injury lesions on head CT using deep learning: an algorithm development and multicentre validation study.基于深度学习的头部 CT 外伤性脑损伤病灶的多类语义分割和定量:一项算法开发和多中心验证研究。
Lancet Digit Health. 2020 Jun;2(6):e314-e322. doi: 10.1016/S2589-7500(20)30085-6. Epub 2020 May 14.
3
Commun Eng. 2025 Jul 21;4(1):130. doi: 10.1038/s44172-025-00450-1.
4
Improving AI models for rare thyroid cancer subtype by text guided diffusion models.通过文本引导扩散模型改进罕见甲状腺癌亚型的人工智能模型。
Nat Commun. 2025 May 13;16(1):4449. doi: 10.1038/s41467-025-59478-8.
5
Perceptual and technical barriers in sharing and formatting metadata accompanying omics studies.组学研究中伴随元数据的共享与格式化方面的感知和技术障碍。
Cell Genom. 2025 May 14;5(5):100845. doi: 10.1016/j.xgen.2025.100845. Epub 2025 Apr 10.
6
Acceptability, benefits and barriers of electronic health record radiology image sharing: A mixed-method study.电子健康记录放射影像共享的可接受性、益处与障碍:一项混合方法研究。
J Med Radiat Sci. 2025 Jun;72(2):244-254. doi: 10.1002/jmrs.853. Epub 2025 Mar 27.
7
A data-efficient strategy for building high-performing medical foundation models.一种构建高性能医学基础模型的数据高效策略。
Nat Biomed Eng. 2025 Apr;9(4):539-551. doi: 10.1038/s41551-025-01365-0. Epub 2025 Mar 5.
8
Generative artificial intelligence enables the generation of bone scintigraphy images and improves generalization of deep learning models in data-constrained environments.生成式人工智能能够生成骨闪烁显像图像,并在数据受限的环境中提高深度学习模型的泛化能力。
Eur J Nucl Med Mol Imaging. 2025 Jun;52(7):2355-2368. doi: 10.1007/s00259-025-07091-8. Epub 2025 Jan 29.
9
Generative models of MRI-derived neuroimaging features and associated dataset of 18,000 samples.基于MRI的神经影像特征生成模型及包含18000个样本的相关数据集。
Sci Data. 2024 Dec 5;11(1):1330. doi: 10.1038/s41597-024-04157-4.
10
Report on the AAPM grand challenge on deep generative modeling for learning medical image statistics.关于美国医学物理学会(AAPM)深度生成建模用于学习医学图像统计学重大挑战的报告。
Med Phys. 2025 Jan;52(1):4-20. doi: 10.1002/mp.17473. Epub 2024 Oct 24.
Breaking medical data sharing boundaries by using synthesized radiographs.
利用合成X线照片突破医学数据共享的界限。
Sci Adv. 2020 Dec 2;6(49). doi: 10.1126/sciadv.abb7973. Print 2020 Dec.
4
A deep learning system for differential diagnosis of skin diseases.深度学习系统用于皮肤病的鉴别诊断。
Nat Med. 2020 Jun;26(6):900-908. doi: 10.1038/s41591-020-0842-3. Epub 2020 May 18.
5
A Style-Based Generator Architecture for Generative Adversarial Networks.基于风格的生成对抗网络生成器架构。
IEEE Trans Pattern Anal Mach Intell. 2021 Dec;43(12):4217-4228. doi: 10.1109/TPAMI.2020.2970919. Epub 2021 Nov 3.
6
MedGAN: Medical image translation using GANs.MedGAN:使用 GAN 进行医学图像翻译。
Comput Med Imaging Graph. 2020 Jan;79:101684. doi: 10.1016/j.compmedimag.2019.101684. Epub 2019 Nov 22.
7
Deep learning-based classification of mesothelioma improves prediction of patient outcome.基于深度学习的间皮瘤分类提高了患者预后的预测能力。
Nat Med. 2019 Oct;25(10):1519-1525. doi: 10.1038/s41591-019-0583-3. Epub 2019 Oct 7.
8
Deep learning enables rapid identification of potent DDR1 kinase inhibitors.深度学习可快速鉴定有效的 DDR1 激酶抑制剂。
Nat Biotechnol. 2019 Sep;37(9):1038-1040. doi: 10.1038/s41587-019-0224-x. Epub 2019 Sep 2.
9
Bi-Modality Medical Image Synthesis Using Semi-Supervised Sequential Generative Adversarial Networks.基于半监督序贯生成对抗网络的双模态医学图像合成。
IEEE J Biomed Health Inform. 2020 Mar;24(3):855-865. doi: 10.1109/JBHI.2019.2922986. Epub 2019 Jun 14.
10
Causal relationships among the gut microbiome, short-chain fatty acids and metabolic diseases.肠道微生物组、短链脂肪酸与代谢性疾病之间的因果关系。
Nat Genet. 2019 Apr;51(4):600-605. doi: 10.1038/s41588-019-0350-x. Epub 2019 Feb 18.