• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

医疗保健中的合成数据:一篇叙述性综述。

Synthetic data in health care: A narrative review.

作者信息

Gonzales Aldren, Guruswamy Guruprabha, Smith Scott R

机构信息

Office of the Assistant Secretary Planning and Evaluation, US Department of Health and Human Services, Washington, District of Columbia, United States of America.

Department of Health Administration and Policy, George Mason University, Virginia, United States of America.

出版信息

PLOS Digit Health. 2023 Jan 6;2(1):e0000082. doi: 10.1371/journal.pdig.0000082. eCollection 2023 Jan.

DOI:10.1371/journal.pdig.0000082
PMID:36812604
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9931305/
Abstract

Data are central to research, public health, and in developing health information technology (IT) systems. Nevertheless, access to most data in health care is tightly controlled, which may limit innovation, development, and efficient implementation of new research, products, services, or systems. Using synthetic data is one of the many innovative ways that can allow organizations to share datasets with broader users. However, only a limited set of literature is available that explores its potentials and applications in health care. In this review paper, we examined existing literature to bridge the gap and highlight the utility of synthetic data in health care. We searched PubMed, Scopus, and Google Scholar to identify peer-reviewed articles, conference papers, reports, and thesis/dissertations articles related to the generation and use of synthetic datasets in health care. The review identified seven use cases of synthetic data in health care: a) simulation and prediction research, b) hypothesis, methods, and algorithm testing, c) epidemiology/public health research, d) health IT development, e) education and training, f) public release of datasets, and g) linking data. The review also identified readily and publicly accessible health care datasets, databases, and sandboxes containing synthetic data with varying degrees of utility for research, education, and software development. The review provided evidence that synthetic data are helpful in different aspects of health care and research. While the original real data remains the preferred choice, synthetic data hold possibilities in bridging data access gaps in research and evidence-based policymaking.

摘要

数据对于研究、公共卫生以及健康信息技术(IT)系统的开发至关重要。然而,医疗保健领域中大多数数据的访问受到严格控制,这可能会限制新研究、产品、服务或系统的创新、开发和有效实施。使用合成数据是众多创新方式之一,可使组织与更广泛的用户共享数据集。然而,探讨其在医疗保健领域的潜力和应用的文献有限。在这篇综述论文中,我们研究了现有文献以弥合差距,并突出合成数据在医疗保健中的实用性。我们检索了PubMed、Scopus和谷歌学术,以识别与医疗保健中合成数据集的生成和使用相关的同行评审文章、会议论文、报告以及论文/学位论文。该综述确定了合成数据在医疗保健中的七个用例:a)模拟和预测研究,b)假设、方法和算法测试,c)流行病学/公共卫生研究,d)健康IT开发,e)教育和培训,f)数据集的公开发布,以及g)数据链接。该综述还确定了易于获取且公开可用的医疗保健数据集、数据库和沙盒,其中包含对研究、教育和软件开发具有不同程度实用性的合成数据。该综述提供了证据表明合成数据在医疗保健和研究的不同方面都有帮助。虽然原始真实数据仍然是首选,但合成数据在弥合研究和循证决策中的数据访问差距方面具有潜力。

相似文献

1
Synthetic data in health care: A narrative review.医疗保健中的合成数据:一篇叙述性综述。
PLOS Digit Health. 2023 Jan 6;2(1):e0000082. doi: 10.1371/journal.pdig.0000082. eCollection 2023 Jan.
2
Beyond the black stump: rapid reviews of health research issues affecting regional, rural and remote Australia.超越黑木树:影响澳大利亚地区、农村和偏远地区的健康研究问题的快速综述。
Med J Aust. 2020 Dec;213 Suppl 11:S3-S32.e1. doi: 10.5694/mja2.50881.
3
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
6
Telemedicine for the Medicare population: pediatric, obstetric, and clinician-indirect home interventions.面向医疗保险人群的远程医疗:儿科、产科及临床医生间接居家干预措施
Evid Rep Technol Assess (Summ). 2001 Aug(24 Suppl):1-32.
7
A framework and methodology for navigating disaster and global health in crisis literature.危机文献中应对灾难与全球健康的框架及方法
PLoS Curr. 2013 Apr 4;5:ecurrents.dis.9af6948e381dafdd3e877c441527cba0. doi: 10.1371/currents.dis.9af6948e381dafdd3e877c441527cba0.
8
The use of narrative text for injury surveillance research: a systematic review.利用叙事文本进行伤害监测研究:系统评价。
Accid Anal Prev. 2010 Mar;42(2):354-63. doi: 10.1016/j.aap.2009.09.020. Epub 2009 Oct 24.
9
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
10
Promoting and supporting self-management for adults living in the community with physical chronic illness: A systematic review of the effectiveness and meaningfulness of the patient-practitioner encounter.促进和支持社区中患有慢性身体疾病的成年人进行自我管理:对医患互动的有效性和意义的系统评价。
JBI Libr Syst Rev. 2009;7(13):492-582. doi: 10.11124/01938924-200907130-00001.

引用本文的文献

1
Artificial intelligence in disease diagnostics: a comprehensive narrative review of current advances, applications, and future challenges in healthcare.疾病诊断中的人工智能:对医疗保健领域当前进展、应用及未来挑战的全面叙述性综述
Ann Med Surg (Lond). 2025 May 26;87(7):4237-4245. doi: 10.1097/MS9.0000000000003423. eCollection 2025 Jul.
2
Advancing the Integration of 'Basic/Fundamental' and Translational Cellular and Gene Therapy Science within the EBMT: Accelerating the Pathway to Progress.推动欧洲血液与骨髓移植学会(EBMT)内“基础/基本”与转化细胞及基因治疗科学的整合:加速迈向进展之路。
Bone Marrow Transplant. 2025 Aug 7. doi: 10.1038/s41409-025-02688-x.
3
Can Synthetic Data Allow for Smaller Sample Sizes in Chronic Urticaria Research?合成数据能否在慢性荨麻疹研究中减少样本量?
Clin Transl Allergy. 2025 Aug;15(8):e70087. doi: 10.1002/clt2.70087.
4
Transporting trial results to synthetic real-world populations in order to estimate real-world effectiveness of newly marketed medicines.将试验结果应用于合成的真实世界人群,以评估新上市药物的真实世界疗效。
BMJ Open. 2025 Jul 24;15(7):e089218. doi: 10.1136/bmjopen-2024-089218.
5
Synthetic data in medicine: Legal and ethical considerations for patient profiling.医学中的合成数据:患者画像的法律和伦理考量
Comput Struct Biotechnol J. 2025 May 29;28:190-198. doi: 10.1016/j.csbj.2025.05.026. eCollection 2025.
6
GenECG: a synthetic image-based ECG dataset to augment artificial intelligence-enhanced algorithm development.GenECG:一个基于合成图像的心电图数据集,用于促进人工智能增强算法的开发。
BMJ Health Care Inform. 2025 May 31;32(1):e101335. doi: 10.1136/bmjhci-2024-101335.
7
MeVGAN: GAN-based plugin model for video generation with applications in colonoscopy.MeVGAN:基于生成对抗网络的视频生成插件模型及其在结肠镜检查中的应用
PLoS One. 2025 May 27;20(5):e0312038. doi: 10.1371/journal.pone.0312038. eCollection 2025.
8
Tempered enthusiasm by interviewed experts for synthetic data and ELSI checklists for AI in medicine.受访专家对医学人工智能合成数据和伦理、法律与社会影响(ELSI)清单的热情有所降温。
AI Ethics. 2025;5(3):3241-3254. doi: 10.1007/s43681-024-00652-x. Epub 2025 Jan 10.
9
Exploring the Utilization of Synthetic Data in Unsupervised Clustering for Opioid Misuse Analysis.探索合成数据在阿片类药物滥用分析的无监督聚类中的应用。
AMIA Annu Symp Proc. 2025 May 22;2024:1313-1322. eCollection 2024.
10
Validity of tremor analysis using smartphone compatible computer vision frameworks.使用与智能手机兼容的计算机视觉框架进行震颤分析的有效性。
Sci Rep. 2025 Apr 18;15(1):13391. doi: 10.1038/s41598-025-97252-4.

本文引用的文献

1
Generation of Synthetic Chest X-ray Images and Detection of COVID-19: A Deep Learning Based Approach.合成胸部X光图像的生成与新冠肺炎检测:一种基于深度学习的方法。
Diagnostics (Basel). 2021 May 18;11(5):895. doi: 10.3390/diagnostics11050895.
2
A deep learning approach to generate synthetic CT in low field MR-guided adaptive radiotherapy for abdominal and pelvic cases.一种深度学习方法,用于在低场磁共振引导自适应放疗中生成腹部和盆腔病例的合成 CT。
Radiother Oncol. 2020 Dec;153:205-212. doi: 10.1016/j.radonc.2020.10.018. Epub 2020 Oct 17.
3
Using deep learning to generate synthetic B-mode musculoskeletal ultrasound images.利用深度学习生成合成B模式肌肉骨骼超声图像。
Comput Methods Programs Biomed. 2020 Nov;196:105583. doi: 10.1016/j.cmpb.2020.105583. Epub 2020 Jun 4.
4
MicroEnv: A microsimulation model for quantifying the impacts of environmental policies on population health and health inequalities.微环境:一种用于量化环境政策对人口健康和健康不平等影响的微观模拟模型。
Sci Total Environ. 2019 Dec 20;697:134105. doi: 10.1016/j.scitotenv.2019.134105. Epub 2019 Aug 29.
5
Analyzing Medical Research Results Based on Synthetic Data and Their Relation to Real Data Results: Systematic Comparison From Five Observational Studies.基于合成数据的医学研究结果分析及其与真实数据结果的关系:五项观察性研究的系统比较
JMIR Med Inform. 2020 Feb 20;8(2):e16492. doi: 10.2196/16492.
6
The sensitivity of reported effects of EMF on childhood leukemia to uncontrolled confounding by residential mobility: a hybrid simulation study and an empirical analysis using CAPS data.电磁场对儿童白血病影响的报告效应受居住流动性未得到控制的混杂因素的影响的敏感性:混合模拟研究和使用 CAPS 数据的实证分析。
Cancer Causes Control. 2019 Aug;30(8):901-908. doi: 10.1007/s10552-019-01189-9. Epub 2019 May 29.
7
The validity of synthetic clinical data: a validation study of a leading synthetic data generator (Synthea) using clinical quality measures.合成临床数据的有效性:使用临床质量指标对领先的合成数据生成器(Synthea)进行验证研究。
BMC Med Inform Decis Mak. 2019 Mar 14;19(1):44. doi: 10.1186/s12911-019-0793-0.
8
Feasibility of Reidentifying Individuals in Large National Physical Activity Data Sets From Which Protected Health Information Has Been Removed With Use of Machine Learning.利用机器学习对已去除保护健康信息的大型国家体力活动数据集进行重新识别个体的可行性。
JAMA Netw Open. 2018 Dec 7;1(8):e186040. doi: 10.1001/jamanetworkopen.2018.6040.
9
Epidemiologic and economic impact of pharmacies as vaccination locations during an influenza epidemic.流感大流行期间,将药店作为疫苗接种点的流行病学和经济学影响。
Vaccine. 2018 Nov 12;36(46):7054-7063. doi: 10.1016/j.vaccine.2018.09.040. Epub 2018 Oct 16.
10
Mixed effect machine learning: A framework for predicting longitudinal change in hemoglobin A1c.混合效应机器学习:预测血红蛋白 A1c 纵向变化的框架。
J Biomed Inform. 2019 Jan;89:56-67. doi: 10.1016/j.jbi.2018.09.001. Epub 2018 Sep 4.