• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

为低收入和中等收入国家(LMICs)的预测模型整合电子健康记录

Synthesizing Electronic Health Records for Predictive Models in Low-Middle-Income Countries (LMICs).

作者信息

Ghosheh Ghadeer O, Thwaites C Louise, Zhu Tingting

机构信息

Department of Engineering Sciences, University of Oxford, Oxford OX1 3PJ, UK.

Oxford University Clinical Research Unit (OUCRU), Ho Chi Minh City 710400, Vietnam.

出版信息

Biomedicines. 2023 Jun 18;11(6):1749. doi: 10.3390/biomedicines11061749.

DOI:10.3390/biomedicines11061749
PMID:37371844
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10295936/
Abstract

The spread of machine learning models, coupled with by the growing adoption of electronic health records (EHRs), has opened the door for developing clinical decision support systems. However, despite the great promise of machine learning for healthcare in low-middle-income countries (LMICs), many data-specific limitations, such as the small size and irregular sampling, hinder the progress in such applications. Recently, deep generative models have been proposed to generate realistic-looking synthetic data, including EHRs, by learning the underlying data distribution without compromising patient privacy. In this study, we first use a deep generative model to generate synthetic data based on a small dataset (364 patients) from a LMIC setting. Next, we use synthetic data to build models that predict the onset of hospital-acquired infections based on minimal information collected at patient ICU admission. The performance of the diagnostic model trained on the synthetic data outperformed models trained on the original and oversampled data using techniques such as SMOTE. We also experiment with varying the size of the synthetic data and observe the impact on the performance and interpretability of the models. Our results show the promise of using deep generative models in enabling healthcare data owners to develop and validate models that serve their needs and applications, despite limitations in dataset size.

摘要

机器学习模型的传播,再加上电子健康记录(EHR)的日益普及,为临床决策支持系统的开发打开了大门。然而,尽管机器学习在中低收入国家(LMIC)的医疗保健领域前景广阔,但许多特定于数据的限制,如规模小和采样不规则,阻碍了此类应用的进展。最近,有人提出了深度生成模型,通过学习潜在的数据分布来生成逼真的合成数据,包括电子健康记录,同时不损害患者隐私。在本研究中,我们首先使用深度生成模型,基于来自中低收入国家环境的一个小数据集(364名患者)生成合成数据。接下来,我们使用合成数据构建模型,这些模型根据患者入住重症监护病房时收集的最少信息来预测医院获得性感染的发生。在合成数据上训练的诊断模型的性能优于使用SMOTE等技术在原始数据和过采样数据上训练的模型。我们还对合成数据的大小进行了变化实验,并观察其对模型性能和可解释性的影响。我们的结果表明,尽管数据集规模有限,但使用深度生成模型有望使医疗保健数据所有者开发和验证满足其需求和应用的模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f04b/10295936/395b16d6a604/biomedicines-11-01749-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f04b/10295936/6987e126e914/biomedicines-11-01749-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f04b/10295936/467ba622071c/biomedicines-11-01749-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f04b/10295936/395b16d6a604/biomedicines-11-01749-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f04b/10295936/6987e126e914/biomedicines-11-01749-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f04b/10295936/467ba622071c/biomedicines-11-01749-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f04b/10295936/395b16d6a604/biomedicines-11-01749-g003.jpg

相似文献

1
Synthesizing Electronic Health Records for Predictive Models in Low-Middle-Income Countries (LMICs).为低收入和中等收入国家(LMICs)的预测模型整合电子健康记录
Biomedicines. 2023 Jun 18;11(6):1749. doi: 10.3390/biomedicines11061749.
2
Natural language generation for electronic health records.电子健康记录的自然语言生成
NPJ Digit Med. 2018 Nov 19;1:63. doi: 10.1038/s41746-018-0070-0. Print 2018.
3
Predicting the onset of type 2 diabetes using wide and deep learning with electronic health records.利用电子健康记录的宽深学习预测 2 型糖尿病发病。
Comput Methods Programs Biomed. 2019 Dec;182:105055. doi: 10.1016/j.cmpb.2019.105055. Epub 2019 Aug 27.
4
Synthesizing electronic health records using improved generative adversarial networks.使用改进的生成对抗网络合成电子健康记录。
J Am Med Inform Assoc. 2019 Mar 1;26(3):228-241. doi: 10.1093/jamia/ocy142.
5
Privacy preserving Generative Adversarial Networks to model Electronic Health Records.用于建模电子健康记录的隐私保护生成对抗网络。
Neural Netw. 2022 Sep;153:339-348. doi: 10.1016/j.neunet.2022.06.022. Epub 2022 Jun 25.
6
Generating sequential electronic health records using dual adversarial autoencoder.使用对偶对抗自动编码器生成连续的电子健康记录。
J Am Med Inform Assoc. 2020 Jul 1;27(9):1411-1419. doi: 10.1093/jamia/ocaa119.
7
Kenyan health stakeholder views on individual consent, general notification and governance processes for the re-use of hospital inpatient data to support learning on healthcare systems.肯尼亚卫生利益相关者对重新使用医院住院数据以支持医疗系统学习的个人同意、一般通知和治理流程的看法。
BMC Med Ethics. 2019 Jan 8;20(1):3. doi: 10.1186/s12910-018-0343-9.
8
Improving irregular temporal modeling by integrating synthetic data to the electronic medical record using conditional GANs: a case study of fluid overload prediction in the intensive care unit.通过使用条件生成对抗网络将合成数据集成到电子病历中来改进不规则时间建模:重症监护病房液体超负荷预测的案例研究
medRxiv. 2023 Jun 27:2023.06.20.23291680. doi: 10.1101/2023.06.20.23291680.
9
A Distributed Ensemble Approach for Mining Healthcare Data under Privacy Constraints.一种隐私约束下挖掘医疗保健数据的分布式集成方法。
Inf Sci (N Y). 2016 Feb 10;330:245-259. doi: 10.1016/j.ins.2015.10.011.
10
Creating High-Quality Synthetic Health Data: Framework for Model Development and Validation.创建高质量合成健康数据:模型开发与验证框架。
JMIR Form Res. 2024 Apr 22;8:e53241. doi: 10.2196/53241.

引用本文的文献

1
Addressing social determinants of health through customization: Quality improvement, telemedicine, and care coordination to serve immigrant families.通过定制化解决健康的社会决定因素:改善医疗质量、远程医疗和协调护理,为移民家庭提供服务。
J Pediatr Rehabil Med. 2023;16(4):665-674. doi: 10.3233/PRM-230036.

本文引用的文献

1
Generating synthetic mixed-type longitudinal electronic health records for artificial intelligent applications.为人工智能应用生成合成混合型纵向电子健康记录。
NPJ Digit Med. 2023 May 27;6(1):98. doi: 10.1038/s41746-023-00834-7.
2
A Catalogue of Machine Learning Algorithms for Healthcare Risk Predictions.医疗保健风险预测的机器学习算法目录。
Sensors (Basel). 2022 Nov 8;22(22):8615. doi: 10.3390/s22228615.
3
Application of Artificial Intelligence in Shared Decision Making: Scoping Review.人工智能在共同决策中的应用:范围综述
JMIR Med Inform. 2022 Aug 9;10(8):e36199. doi: 10.2196/36199.
4
Electronic health records post-implementation challenges in selected hospitals: A qualitative study in the Central Region of southern Ghana.加纳南部中部地区部分医院电子健康记录实施后的挑战:一项定性研究
Health Inf Manag. 2023 Sep;52(3):204-211. doi: 10.1177/18333583221096899. Epub 2022 May 15.
5
Global, regional, and national prevalence of, and risk factors for, chronic obstructive pulmonary disease (COPD) in 2019: a systematic review and modelling analysis.全球、区域和国家 2019 年慢性阻塞性肺疾病(COPD)的患病率、危险因素:系统评价和建模分析。
Lancet Respir Med. 2022 May;10(5):447-458. doi: 10.1016/S2213-2600(21)00511-7. Epub 2022 Mar 10.
6
Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis.2019 年全球细菌对抗菌药物耐药性的负担:系统分析。
Lancet. 2022 Feb 12;399(10325):629-655. doi: 10.1016/S0140-6736(21)02724-0. Epub 2022 Jan 19.
7
An explainable machine learning-based clinical decision support system for prediction of gestational diabetes mellitus.基于可解释机器学习的妊娠期糖尿病预测临床决策支持系统。
Sci Rep. 2022 Jan 21;12(1):1170. doi: 10.1038/s41598-022-05112-2.
8
Comprehensive risk assessment for hospital-acquired pneumonia: sociodemographic, clinical, and hospital environmental factors associated with the incidence of hospital-acquired pneumonia.医院获得性肺炎的综合风险评估:与医院获得性肺炎发病率相关的社会人口学、临床和医院环境因素。
BMC Pulm Med. 2022 Jan 12;22(1):21. doi: 10.1186/s12890-021-01816-9.
9
Comparing COVID-19 risk factors in Brazil using machine learning: the importance of socioeconomic, demographic and structural factors.使用机器学习比较巴西的 COVID-19 风险因素:社会经济、人口和结构因素的重要性。
Sci Rep. 2021 Aug 2;11(1):15591. doi: 10.1038/s41598-021-95004-8.
10
Health informatics and EHR to support clinical research in the COVID-19 pandemic: an overview.健康信息学和电子健康记录在支持 COVID-19 大流行中的临床研究中的作用:概述。
Brief Bioinform. 2021 Mar 22;22(2):812-822. doi: 10.1093/bib/bbaa418.