• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

合成健康数据可助力社区研究工作,以便在新发大流行期间更好地为公众提供信息。

Synthetic Health Data Can Augment Community Research Efforts to Better Inform the Public During Emerging Pandemics.

作者信息

Prasanna Anish, Jing Bocheng, Plopper George, Miller Kristina Krasnov, Sanjak Jaleal, Feng Alice, Prezek Sarah, Vidyaprakash Eshaw, Thovarai Vishal, Maier Ezekiel J, Bhattacharya Avik, Naaman Lama, Stephens Holly, Watford Sean, Boscardin W John, Johanson Elaine, Lienau Amanda

机构信息

Booz Allen Hamilton.

Northern California Institute for Research and Education.

出版信息

medRxiv. 2023 Dec 13:2023.12.11.23298687. doi: 10.1101/2023.12.11.23298687.

DOI:10.1101/2023.12.11.23298687
PMID:38168217
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10760275/
Abstract

The COVID-19 pandemic had disproportionate effects on the Veteran population due to the increased prevalence of medical and environmental risk factors. Synthetic electronic health record (EHR) data can help meet the acute need for Veteran population-specific predictive modeling efforts by avoiding the strict barriers to access, currently present within Veteran Health Administration (VHA) datasets. The U.S. Food and Drug Administration (FDA) and the VHA launched the precisionFDA COVID-19 Risk Factor Modeling Challenge to develop COVID-19 diagnostic and prognostic models; identify Veteran population-specific risk factors; and test the usefulness of synthetic data as a substitute for real data. The use of synthetic data boosted challenge participation by providing a dataset that was accessible to all competitors. Models trained on synthetic data showed similar but systematically inflated model performance metrics to those trained on real data. The important risk factors identified in the synthetic data largely overlapped with those identified from the real data, and both sets of risk factors were validated in the literature. Tradeoffs exist between synthetic data generation approaches based on whether a real EHR dataset is required as input. Synthetic data generated directly from real EHR input will more closely align with the characteristics of the relevant cohort. This work shows that synthetic EHR data will have practical value to the Veterans' health research community for the foreseeable future.

摘要

由于医疗和环境风险因素的患病率增加,新冠疫情对退伍军人产生了不成比例的影响。合成电子健康记录(EHR)数据可以通过避免目前退伍军人健康管理局(VHA)数据集中存在的严格访问障碍,帮助满足针对退伍军人的特定预测建模工作的迫切需求。美国食品药品监督管理局(FDA)和VHA发起了precisionFDA新冠风险因素建模挑战赛,以开发新冠诊断和预后模型;识别退伍军人特定的风险因素;并测试合成数据作为真实数据替代品的有用性。合成数据的使用通过提供一个所有参赛者都能访问的数据集,提高了挑战赛的参与度。在合成数据上训练的模型显示出与在真实数据上训练的模型相似但系统性夸大的模型性能指标。在合成数据中识别出的重要风险因素与从真实数据中识别出的风险因素在很大程度上重叠,并且两组风险因素都在文献中得到了验证。基于是否需要真实EHR数据集作为输入,合成数据生成方法之间存在权衡。直接从真实EHR输入生成的合成数据将更紧密地符合相关队列的特征。这项工作表明,在可预见的未来,合成EHR数据将对退伍军人健康研究界具有实用价值。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecb2/10760275/da67da2eb13a/nihpp-2023.12.11.23298687v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecb2/10760275/8e1e1c5b6f32/nihpp-2023.12.11.23298687v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecb2/10760275/0a75bc5981aa/nihpp-2023.12.11.23298687v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecb2/10760275/81b1e26659a4/nihpp-2023.12.11.23298687v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecb2/10760275/debc0c285acf/nihpp-2023.12.11.23298687v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecb2/10760275/f594a9ec3e02/nihpp-2023.12.11.23298687v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecb2/10760275/6a5102def57b/nihpp-2023.12.11.23298687v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecb2/10760275/da67da2eb13a/nihpp-2023.12.11.23298687v1-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecb2/10760275/8e1e1c5b6f32/nihpp-2023.12.11.23298687v1-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecb2/10760275/0a75bc5981aa/nihpp-2023.12.11.23298687v1-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecb2/10760275/81b1e26659a4/nihpp-2023.12.11.23298687v1-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecb2/10760275/debc0c285acf/nihpp-2023.12.11.23298687v1-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecb2/10760275/f594a9ec3e02/nihpp-2023.12.11.23298687v1-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecb2/10760275/6a5102def57b/nihpp-2023.12.11.23298687v1-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ecb2/10760275/da67da2eb13a/nihpp-2023.12.11.23298687v1-f0007.jpg

相似文献

1
Synthetic Health Data Can Augment Community Research Efforts to Better Inform the Public During Emerging Pandemics.合成健康数据可助力社区研究工作,以便在新发大流行期间更好地为公众提供信息。
medRxiv. 2023 Dec 13:2023.12.11.23298687. doi: 10.1101/2023.12.11.23298687.
2
3
Competing Demands: Scheduling Challenges in Being Veteran-centric in the Setting of Health System Initiatives to Improve Access.竞争需求:在改善医疗服务获取的卫生系统计划中以退伍军人为中心所面临的调度挑战。
Mil Med. 2021 Nov 2;186(11-12):e1233-e1240. doi: 10.1093/milmed/usaa520.
4
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
5
Identifying Family and Unpaid Caregivers in Electronic Health Records: Descriptive Analysis.在电子健康记录中识别家庭和无偿护理人员:描述性分析。
JMIR Form Res. 2022 Jul 18;6(7):e35623. doi: 10.2196/35623.
6
Exploring Research Engagement and Priorities of Transgender and Gender Diverse Veterans.探索跨性别和性别多样化退伍军人的研究参与度及优先事项。
Mil Med. 2023 May 16;188(5-6):e1224-e1231. doi: 10.1093/milmed/usab460.
7
Veterans' ambulatory care experience during COVID-19: veterans' access to and satisfaction with primary care early in the pandemic.退役军人在 COVID-19 期间的流动医疗体验:疫情早期退役军人对初级保健的可及性和满意度。
BMC Prim Care. 2022 Sep 21;23(1):245. doi: 10.1186/s12875-022-01851-3.
8
Chronic Pain, Cannabis Legalization and Cannabis Use Disorder in Veterans Health Administration Patients, 2005 to 2019.2005年至2019年退伍军人健康管理局患者中的慢性疼痛、大麻合法化与大麻使用障碍
medRxiv. 2023 Jul 12:2023.07.10.23292453. doi: 10.1101/2023.07.10.23292453.
9
Documentation of Screening for Firearm Access by Healthcare Providers in the Veterans Healthcare System: A Retrospective Study.退伍军人医疗系统中医疗服务提供者对枪支获取情况筛查的记录:一项回顾性研究。
West J Emerg Med. 2021 May 19;22(3):525-532. doi: 10.5811/westjem.2021.4.51203.
10
User Testing of the Veteran Delegation Tool: Qualitative Inquiry.退伍军人授权工具的用户测试:定性研究。
J Med Internet Res. 2023 Feb 23;25:e40634. doi: 10.2196/40634.

本文引用的文献

1
Risk Factors Associated With Post-COVID-19 Condition: A Systematic Review and Meta-analysis.与新冠后状况相关的风险因素:系统评价和荟萃分析。
JAMA Intern Med. 2023 Jun 1;183(6):566-580. doi: 10.1001/jamainternmed.2023.0750.
2
COVID-19 and diabetes in 2020: a systematic review.2020年新型冠状病毒肺炎与糖尿病:一项系统综述
J Pharm Policy Pract. 2023 Mar 9;16(1):42. doi: 10.1186/s40545-023-00546-z.
3
Synthetic data in health care: A narrative review.医疗保健中的合成数据:一篇叙述性综述。
PLOS Digit Health. 2023 Jan 6;2(1):e0000082. doi: 10.1371/journal.pdig.0000082. eCollection 2023 Jan.
4
Real-world data mining meets clinical practice: Research challenges and perspective.真实世界数据挖掘与临床实践:研究挑战与展望。
Front Big Data. 2022 Oct 21;5:1021621. doi: 10.3389/fdata.2022.1021621. eCollection 2022.
5
Leveraging electronic health records for data science: common pitfalls and how to avoid them.利用电子健康记录进行数据科学:常见陷阱及规避方法。
Lancet Digit Health. 2022 Dec;4(12):e893-e898. doi: 10.1016/S2589-7500(22)00154-6. Epub 2022 Sep 22.
6
From real-world electronic health record data to real-world results using artificial intelligence.从真实世界的电子健康记录数据到使用人工智能获得真实世界的结果。
Ann Rheum Dis. 2023 Mar;82(3):306-311. doi: 10.1136/ard-2022-222626. Epub 2022 Sep 23.
7
You Can't Have AI Both Ways: Balancing Health Data Privacy and Access Fairly.人工智能不能两全其美:公平平衡健康数据隐私与获取
Front Genet. 2022 Jun 13;13:929453. doi: 10.3389/fgene.2022.929453. eCollection 2022.
8
Association of Obesity With COVID-19 Severity and Mortality: An Updated Systemic Review, Meta-Analysis, and Meta-Regression.肥胖与 COVID-19 严重程度和死亡率的关联:一项更新的系统评价、荟萃分析和荟萃回归研究。
Front Endocrinol (Lausanne). 2022 Jun 3;13:780872. doi: 10.3389/fendo.2022.780872. eCollection 2022.
9
PrecisionFDA Truth Challenge V2: Calling variants from short and long reads in difficult-to-map regions.精准FDA真相挑战V2:在难以映射的区域中从短读长和长读长中识别变异体。
Cell Genom. 2022 May 11;2(5). doi: 10.1016/j.xgen.2022.100129. Epub 2022 Apr 27.
10
COVID-19 in Veterans: A Narrative Review.退伍军人中的新型冠状病毒肺炎:一篇叙述性综述
Risk Manag Healthc Policy. 2022 Apr 26;15:805-815. doi: 10.2147/RMHP.S354814. eCollection 2022.