• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

电子健康记录联合学习中的数据异质性:重症监护中急性肾损伤和脓毒症疾病风险预测的案例研究

Data heterogeneity in federated learning with Electronic Health Records: Case studies of risk prediction for acute kidney injury and sepsis diseases in critical care.

作者信息

Rajendran Suraj, Xu Zhenxing, Pan Weishen, Ghosh Arnab, Wang Fei

机构信息

Tri-Institutional Computational Biology & Medicine Program, Cornell University, New York, New York, United States of America.

Division of Health Informatics, Department of Population Health Sciences, Weill Cornell Medicine, New York, New York, United States of America.

出版信息

PLOS Digit Health. 2023 Mar 15;2(3):e0000117. doi: 10.1371/journal.pdig.0000117. eCollection 2023 Mar.

DOI:10.1371/journal.pdig.0000117
PMID:36920974
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10016691/
Abstract

With the wider availability of healthcare data such as Electronic Health Records (EHR), more and more data-driven based approaches have been proposed to improve the quality-of-care delivery. Predictive modeling, which aims at building computational models for predicting clinical risk, is a popular research topic in healthcare analytics. However, concerns about privacy of healthcare data may hinder the development of effective predictive models that are generalizable because this often requires rich diverse data from multiple clinical institutions. Recently, federated learning (FL) has demonstrated promise in addressing this concern. However, data heterogeneity from different local participating sites may affect prediction performance of federated models. Due to acute kidney injury (AKI) and sepsis' high prevalence among patients admitted to intensive care units (ICU), the early prediction of these conditions based on AI is an important topic in critical care medicine. In this study, we take AKI and sepsis onset risk prediction in ICU as two examples to explore the impact of data heterogeneity in the FL framework as well as compare performances across frameworks. We built predictive models based on local, pooled, and FL frameworks using EHR data across multiple hospitals. The local framework only used data from each site itself. The pooled framework combined data from all sites. In the FL framework, each local site did not have access to other sites' data. A model was updated locally, and its parameters were shared to a central aggregator, which was used to update the federated model's parameters and then subsequently, shared with each site. We found models built within a FL framework outperformed local counterparts. Then, we analyzed variable importance discrepancies across sites and frameworks. Finally, we explored potential sources of the heterogeneity within the EHR data. The different distributions of demographic profiles, medication use, and site information contributed to data heterogeneity.

摘要

随着电子健康记录(EHR)等医疗保健数据的更广泛可得性,越来越多基于数据驱动的方法被提出来以提高医疗服务质量。预测建模旨在构建用于预测临床风险的计算模型,是医疗保健分析中一个热门的研究课题。然而,对医疗保健数据隐私的担忧可能会阻碍有效且可推广的预测模型的开发,因为这通常需要来自多个临床机构的丰富多样的数据。最近,联邦学习(FL)在解决这一问题方面显示出了前景。然而,来自不同本地参与站点的数据异质性可能会影响联邦模型的预测性能。由于急性肾损伤(AKI)和脓毒症在重症监护病房(ICU)患者中具有较高的患病率,基于人工智能对这些病症进行早期预测是重症医学中的一个重要课题。在本研究中,我们以ICU中AKI和脓毒症发病风险预测为例,探讨FL框架中数据异质性的影响,并比较不同框架下的性能。我们使用多家医院的EHR数据,基于本地、合并和FL框架构建了预测模型。本地框架仅使用每个站点自身的数据。合并框架将所有站点的数据合并在一起。在FL框架中,每个本地站点无法访问其他站点的数据。模型在本地进行更新,其参数被共享给一个中央聚合器,该聚合器用于更新联邦模型的参数,随后再与每个站点共享。我们发现,在FL框架内构建的模型优于本地模型。然后,我们分析了不同站点和框架之间变量重要性的差异。最后,我们探索了EHR数据中异质性的潜在来源。人口统计学特征、用药情况和站点信息的不同分布导致了数据异质性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/b9740bfe26f9/pdig.0000117.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/04ffdc003dc9/pdig.0000117.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/a43c87156468/pdig.0000117.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/7a1f5cb81238/pdig.0000117.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/cab0c165c4b3/pdig.0000117.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/46d73e6d97e3/pdig.0000117.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/15dce10c371f/pdig.0000117.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/07041d02bdc9/pdig.0000117.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/dec05474e3fb/pdig.0000117.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/b9740bfe26f9/pdig.0000117.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/04ffdc003dc9/pdig.0000117.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/a43c87156468/pdig.0000117.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/7a1f5cb81238/pdig.0000117.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/cab0c165c4b3/pdig.0000117.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/46d73e6d97e3/pdig.0000117.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/15dce10c371f/pdig.0000117.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/07041d02bdc9/pdig.0000117.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/dec05474e3fb/pdig.0000117.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/69e1/10016691/b9740bfe26f9/pdig.0000117.g009.jpg

相似文献

1
Data heterogeneity in federated learning with Electronic Health Records: Case studies of risk prediction for acute kidney injury and sepsis diseases in critical care.电子健康记录联合学习中的数据异质性:重症监护中急性肾损伤和脓毒症疾病风险预测的案例研究
PLOS Digit Health. 2023 Mar 15;2(3):e0000117. doi: 10.1371/journal.pdig.0000117. eCollection 2023 Mar.
2
An adaptive federated learning framework for clinical risk prediction with electronic health records from multiple hospitals.一种基于多家医院电子健康记录的用于临床风险预测的自适应联邦学习框架。
Patterns (N Y). 2023 Dec 26;5(1):100898. doi: 10.1016/j.patter.2023.100898. eCollection 2024 Jan 12.
3
Federated Learning of Electronic Health Records Improves Mortality Prediction in Patients Hospitalized with COVID-19.电子健康记录的联邦学习改善了新冠肺炎住院患者的死亡率预测。
medRxiv. 2020 Aug 14:2020.08.11.20172809. doi: 10.1101/2020.08.11.20172809.
4
Federated Learning of Electronic Health Records to Improve Mortality Prediction in Hospitalized Patients With COVID-19: Machine Learning Approach.用于改善COVID-19住院患者死亡率预测的电子健康记录联邦学习:机器学习方法。
JMIR Med Inform. 2021 Jan 27;9(1):e24207. doi: 10.2196/24207.
5
Analyzing the Impact of Personalization on Fairness in Federated Learning for Healthcare.分析个性化对医疗保健联邦学习公平性的影响。
J Healthc Inform Res. 2024 Mar 23;8(2):181-205. doi: 10.1007/s41666-024-00164-7. eCollection 2024 Jun.
6
Federated Learning in Risk Prediction: A Primer and Application to COVID-19-Associated Acute Kidney Injury.联邦学习在风险预测中的应用:概述及在 COVID-19 相关急性肾损伤中的应用。
Nephron. 2023;147(1):52-56. doi: 10.1159/000525645. Epub 2022 Jul 14.
7
The FeatureCloud Platform for Federated Learning in Biomedicine: Unified Approach.FeatureCloud 平台在生物医学领域的联邦学习:统一方法。
J Med Internet Res. 2023 Jul 12;25:e42621. doi: 10.2196/42621.
8
Federated Learning for Healthcare Informatics.医疗信息学中的联邦学习
J Healthc Inform Res. 2021;5(1):1-19. doi: 10.1007/s41666-020-00082-4. Epub 2020 Nov 12.
9
A privacy-preserving platform oriented medical healthcare and its application in identifying patients with candidemia.面向医疗保健的隐私保护平台及其在鉴定念珠菌血症患者中的应用。
Sci Rep. 2024 Jul 6;14(1):15589. doi: 10.1038/s41598-024-66596-8.
10
Predicting treatment response in multicenter non-small cell lung cancer patients based on federated learning.基于联邦学习预测多中心非小细胞肺癌患者的治疗反应。
BMC Cancer. 2024 Jun 5;24(1):688. doi: 10.1186/s12885-024-12456-7.

引用本文的文献

1
A personalized federated learning approach to enhance joint modeling for heterogeneous medical institutions.一种用于增强异构医疗机构联合建模的个性化联邦学习方法。
Digit Health. 2025 Jul 29;11:20552076251360861. doi: 10.1177/20552076251360861. eCollection 2025 Jan-Dec.
2
Effective Non-IID Degree Estimation for Robust Federated Learning in Healthcare Datasets.医疗保健数据集中用于稳健联邦学习的有效非独立同分布度估计
J Healthc Inform Res. 2025 Mar 22;9(3):437-464. doi: 10.1007/s41666-025-00195-8. eCollection 2025 Sep.
3
Federated target trial emulation using distributed observational data for treatment effect estimation.

本文引用的文献

1
A New Look and Convergence Rate of Federated Multitask Learning With Laplacian Regularization.基于拉普拉斯正则化的联邦多任务学习的新视角与收敛速度
IEEE Trans Neural Netw Learn Syst. 2024 Jun;35(6):8075-8085. doi: 10.1109/TNNLS.2022.3224252. Epub 2024 Jun 3.
2
Generalizability challenges of mortality risk prediction models: A retrospective analysis on a multi-center database.死亡率风险预测模型的可推广性挑战:对多中心数据库的回顾性分析
PLOS Digit Health. 2022 Apr 5;1(4):e0000023. doi: 10.1371/journal.pdig.0000023. eCollection 2022 Apr.
3
A systematic review of federated learning applications for biomedical data.
使用分布式观测数据进行联合目标试验模拟以估计治疗效果
NPJ Digit Med. 2025 Jul 1;8(1):387. doi: 10.1038/s41746-025-01803-y.
4
The future of healthcare-associated infection surveillance: Automated surveillance and using the potential of artificial intelligence.医疗保健相关感染监测的未来:自动监测与人工智能潜力的利用
J Intern Med. 2025 Aug;298(2):54-77. doi: 10.1111/joim.20100. Epub 2025 Jun 5.
5
Federated Target Trial Emulation using Distributed Observational Data for Treatment Effect Estimation.使用分布式观察数据进行治疗效果估计的联合目标试验模拟
medRxiv. 2025 May 5:2025.05.02.25326905. doi: 10.1101/2025.05.02.25326905.
6
Federated learning with multi-cohort real-world data for predicting the progression from mild cognitive impairment to Alzheimer's disease.利用多队列真实世界数据进行联邦学习以预测从轻度认知障碍到阿尔茨海默病的进展
Alzheimers Dement. 2025 Apr;21(4):e70128. doi: 10.1002/alz.70128.
7
Federated Learning in Smart Healthcare: A Comprehensive Review on Privacy, Security, and Predictive Analytics with IoT Integration.智能医疗中的联邦学习:关于隐私、安全以及与物联网集成的预测分析的全面综述
Healthcare (Basel). 2024 Dec 22;12(24):2587. doi: 10.3390/healthcare12242587.
8
Federated systems for automated infection surveillance: a perspective.联邦系统用于自动化感染监测:一个视角。
Antimicrob Resist Infect Control. 2024 Sep 27;13(1):113. doi: 10.1186/s13756-024-01464-8.
9
Empowering precise advertising with Fed-GANCC: A novel federated learning approach leveraging Generative Adversarial Networks and group clustering.利用 Fed-GANCC 实现精准广告投放:一种利用生成对抗网络和群组聚类的新型联邦学习方法。
PLoS One. 2024 Apr 10;19(4):e0298261. doi: 10.1371/journal.pone.0298261. eCollection 2024.
10
An adaptive federated learning framework for clinical risk prediction with electronic health records from multiple hospitals.一种基于多家医院电子健康记录的用于临床风险预测的自适应联邦学习框架。
Patterns (N Y). 2023 Dec 26;5(1):100898. doi: 10.1016/j.patter.2023.100898. eCollection 2024 Jan 12.
生物医学数据联合学习应用的系统综述。
PLOS Digit Health. 2022 May 19;1(5):e0000033. doi: 10.1371/journal.pdig.0000033. eCollection 2022 May.
4
Machine learning models for predicting acute kidney injury: a systematic review and critical appraisal.用于预测急性肾损伤的机器学习模型:系统评价与批判性评估
Clin Kidney J. 2022 Aug 2;15(12):2266-2280. doi: 10.1093/ckj/sfac181. eCollection 2022 Dec.
5
Machine learning generalizability across healthcare settings: insights from multi-site COVID-19 screening.跨医疗环境的机器学习可推广性:来自多地点新冠病毒筛查的见解
NPJ Digit Med. 2022 Jun 7;5(1):69. doi: 10.1038/s41746-022-00614-9.
6
Clinically Interpretable Machine Learning Models for Early Prediction of Mortality in Older Patients with Multiple Organ Dysfunction Syndrome: An International Multicenter Retrospective Study.临床可解释的机器学习模型用于预测多器官功能障碍综合征老年患者的早期死亡率:一项国际多中心回顾性研究。
J Gerontol A Biol Sci Med Sci. 2023 Mar 30;78(4):718-726. doi: 10.1093/gerona/glac107.
7
Towards Personalized Federated Learning.迈向个性化联邦学习。
IEEE Trans Neural Netw Learn Syst. 2023 Dec;34(12):9587-9603. doi: 10.1109/TNNLS.2022.3160699. Epub 2023 Nov 30.
8
Machine learning prediction model of acute kidney injury after percutaneous coronary intervention.经皮冠状动脉介入治疗后急性肾损伤的机器学习预测模型。
Sci Rep. 2022 Jan 14;12(1):749. doi: 10.1038/s41598-021-04372-8.
9
Computational signatures for post-cardiac arrest trajectory prediction: Importance of early physiological time series.计算心脏停搏后轨迹预测的特征:早期生理时间序列的重要性。
Anaesth Crit Care Pain Med. 2022 Feb;41(1):101015. doi: 10.1016/j.accpm.2021.101015. Epub 2021 Dec 27.
10
Sepsis prediction, early detection, and identification using clinical text for machine learning: a systematic review.使用临床文本进行机器学习的脓毒症预测、早期检测和识别:系统评价。
J Am Med Inform Assoc. 2022 Jan 29;29(3):559-575. doi: 10.1093/jamia/ocab236.