• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于异构临床数据的分布式学习及其在整合230个地点的新冠肺炎数据中的应用

Distributed learning for heterogeneous clinical data with application to integrating COVID-19 data across 230 sites.

作者信息

Tong Jiayi, Luo Chongliang, Islam Md Nazmul, Sheils Natalie E, Buresh John, Edmondson Mackenzie, Merkel Peter A, Lautenbach Ebbing, Duan Rui, Chen Yong

机构信息

Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA, USA.

Division of Public Health Sciences, Department of Surgery, Washington University in St. Louis, St. Louis, MO, USA.

出版信息

NPJ Digit Med. 2022 Jun 14;5(1):76. doi: 10.1038/s41746-022-00615-8.

DOI:10.1038/s41746-022-00615-8
PMID:35701668
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9198031/
Abstract

Integrating real-world data (RWD) from several clinical sites offers great opportunities to improve estimation with a more general population compared to analyses based on a single clinical site. However, sharing patient-level data across sites is practically challenging due to concerns about maintaining patient privacy. We develop a distributed algorithm to integrate heterogeneous RWD from multiple clinical sites without sharing patient-level data. The proposed distributed conditional logistic regression (dCLR) algorithm can effectively account for between-site heterogeneity and requires only one round of communication. Our simulation study and data application with the data of 14,215 COVID-19 patients from 230 clinical sites in the UnitedHealth Group Clinical Research Database demonstrate that the proposed distributed algorithm provides an estimator that is robust to heterogeneity in event rates when efficiently integrating data from multiple clinical sites. Our algorithm is therefore a practical alternative to both meta-analysis and existing distributed algorithms for modeling heterogeneous multi-site binary outcomes.

摘要

与基于单个临床站点的分析相比,整合来自多个临床站点的真实世界数据(RWD)为使用更具普遍性的人群改进估计提供了巨大机会。然而,由于担心维护患者隐私,跨站点共享患者层面的数据在实际操作中具有挑战性。我们开发了一种分布式算法,无需共享患者层面的数据即可整合来自多个临床站点的异构RWD。所提出的分布式条件逻辑回归(dCLR)算法可以有效考虑站点间的异质性,并且只需要一轮通信。我们的模拟研究以及对联合健康集团临床研究数据库中来自230个临床站点的14215名COVID-19患者数据的应用表明,所提出的分布式算法在有效整合来自多个临床站点的数据时,能提供一个对事件发生率异质性具有鲁棒性的估计器。因此,我们的算法是荟萃分析和用于对异构多站点二元结局进行建模的现有分布式算法的一种实用替代方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f955/9198031/652d4f5e64c8/41746_2022_615_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f955/9198031/f50a5701e3f6/41746_2022_615_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f955/9198031/57087741c27d/41746_2022_615_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f955/9198031/92e0c9fc8860/41746_2022_615_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f955/9198031/89f07d55ac6f/41746_2022_615_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f955/9198031/652d4f5e64c8/41746_2022_615_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f955/9198031/f50a5701e3f6/41746_2022_615_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f955/9198031/57087741c27d/41746_2022_615_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f955/9198031/92e0c9fc8860/41746_2022_615_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f955/9198031/89f07d55ac6f/41746_2022_615_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f955/9198031/652d4f5e64c8/41746_2022_615_Fig5_HTML.jpg

相似文献

1
Distributed learning for heterogeneous clinical data with application to integrating COVID-19 data across 230 sites.用于异构临床数据的分布式学习及其在整合230个地点的新冠肺炎数据中的应用
NPJ Digit Med. 2022 Jun 14;5(1):76. doi: 10.1038/s41746-022-00615-8.
2
Distributed Quasi-Poisson regression algorithm for modeling multi-site count outcomes in distributed data networks.分布式准泊松回归算法在分布式数据网络中对多点计数结果进行建模。
J Biomed Inform. 2022 Jul;131:104097. doi: 10.1016/j.jbi.2022.104097. Epub 2022 May 25.
3
DLMM as a lossless one-shot algorithm for collaborative multi-site distributed linear mixed models.作为一种无损的一次性算法,DLMM 适用于协作式多站点分布式线性混合模型。
Nat Commun. 2022 Mar 30;13(1):1678. doi: 10.1038/s41467-022-29160-4.
4
Learning from local to global: An efficient distributed algorithm for modeling time-to-event data.从局部到全局学习:一种用于建模事件时间数据的高效分布式算法。
J Am Med Inform Assoc. 2020 Jul 1;27(7):1028-1036. doi: 10.1093/jamia/ocaa044.
5
One-shot distributed algorithms for addressing heterogeneity in competing risks data across clinical sites.单步分布式算法用于解决临床站点间竞争风险数据的异质性。
J Biomed Inform. 2024 Feb;150:104595. doi: 10.1016/j.jbi.2024.104595. Epub 2024 Jan 18.
6
Robust-ODAL: Learning from heterogeneous health systems without sharing patient-level data.鲁棒性 ODAL:在不共享患者级数据的情况下从异构健康系统中学习。
Pac Symp Biocomput. 2020;25:695-706.
7
An efficient and accurate distributed learning algorithm for modeling multi-site zero-inflated count outcomes.一种高效准确的分布式学习算法,用于对多站点零膨胀计数结果进行建模。
Sci Rep. 2021 Oct 4;11(1):19647. doi: 10.1038/s41598-021-99078-2.
8
Multisite learning of high-dimensional heterogeneous data with applications to opioid use disorder study of 15,000 patients across 5 clinical sites.多站点高维异质数据的联合学习及其在 5 个临床站点的 15000 名患者阿片类药物使用障碍研究中的应用。
Sci Rep. 2022 Jun 30;12(1):11073. doi: 10.1038/s41598-022-14029-9.
9
Learning from vertically distributed data across multiple sites: An efficient privacy-preserving algorithm for Cox proportional hazards model with variable selection.从多个站点的垂直分布数据中学习:一种用于具有变量选择的Cox比例风险模型的高效隐私保护算法。
J Biomed Inform. 2024 Jan;149:104581. doi: 10.1016/j.jbi.2023.104581. Epub 2023 Dec 23.
10
ODACH: a one-shot distributed algorithm for Cox model with heterogeneous multi-center data.ODACH:一种用于异质多中心 Cox 模型的单步分布式算法。
Sci Rep. 2022 Apr 22;12(1):6627. doi: 10.1038/s41598-022-09069-0.

引用本文的文献

1
Recent methodological advances in federated learning for healthcare.医疗保健领域联邦学习的最新方法进展。
Patterns (N Y). 2024 Jun 14;5(6):101006. doi: 10.1016/j.patter.2024.101006.
2
Centralized and Federated Models for the Analysis of Clinical Data.集中式和联邦式临床数据分析模型。
Annu Rev Biomed Data Sci. 2024 Aug;7(1):179-199. doi: 10.1146/annurev-biodatasci-122220-115746. Epub 2024 Jul 24.
3
Learning competing risks across multiple hospitals: one-shot distributed algorithms.在多家医院学习竞争风险:单次分布式算法。

本文引用的文献

1
Distributed Quasi-Poisson regression algorithm for modeling multi-site count outcomes in distributed data networks.分布式准泊松回归算法在分布式数据网络中对多点计数结果进行建模。
J Biomed Inform. 2022 Jul;131:104097. doi: 10.1016/j.jbi.2022.104097. Epub 2022 May 25.
2
dPQL: a lossless distributed algorithm for generalized linear mixed model with application to privacy-preserving hospital profiling.dPQL:一种用于广义线性混合模型的无损分布式算法及其在隐私保护医院分析中的应用。
J Am Med Inform Assoc. 2022 Jul 12;29(8):1366-1371. doi: 10.1093/jamia/ocac067.
3
ODACH: a one-shot distributed algorithm for Cox model with heterogeneous multi-center data.
J Am Med Inform Assoc. 2024 Apr 19;31(5):1102-1112. doi: 10.1093/jamia/ocae027.
4
Establishment of an International Evidence Sharing Network Through Common Data Model for Cardiovascular Research.通过通用数据模型建立心血管研究国际证据共享网络。
Korean Circ J. 2022 Dec;52(12):853-864. doi: 10.4070/kcj.2022.0294.
5
Machine learning approaches for electronic health records phenotyping: a methodical review.基于机器学习的电子健康记录表型分析方法:系统评价
J Am Med Inform Assoc. 2023 Jan 18;30(2):367-381. doi: 10.1093/jamia/ocac216.
6
Multisite learning of high-dimensional heterogeneous data with applications to opioid use disorder study of 15,000 patients across 5 clinical sites.多站点高维异质数据的联合学习及其在 5 个临床站点的 15000 名患者阿片类药物使用障碍研究中的应用。
Sci Rep. 2022 Jun 30;12(1):11073. doi: 10.1038/s41598-022-14029-9.
ODACH:一种用于异质多中心 Cox 模型的单步分布式算法。
Sci Rep. 2022 Apr 22;12(1):6627. doi: 10.1038/s41598-022-09069-0.
4
DLMM as a lossless one-shot algorithm for collaborative multi-site distributed linear mixed models.作为一种无损的一次性算法,DLMM 适用于协作式多站点分布式线性混合模型。
Nat Commun. 2022 Mar 30;13(1):1678. doi: 10.1038/s41467-022-29160-4.
5
An efficient and accurate distributed learning algorithm for modeling multi-site zero-inflated count outcomes.一种高效准确的分布式学习算法,用于对多站点零膨胀计数结果进行建模。
Sci Rep. 2021 Oct 4;11(1):19647. doi: 10.1038/s41598-021-99078-2.
6
Variation in US Hospital Mortality Rates for Patients Admitted With COVID-19 During the First 6 Months of the Pandemic.美国在大流行的前 6 个月中,因 COVID-19 入院的患者的医院死亡率存在差异。
JAMA Intern Med. 2021 Apr 1;181(4):471-478. doi: 10.1001/jamainternmed.2020.8193.
7
Multiple imputation for analysis of incomplete data in distributed health data networks.分布式健康数据网络中不完全数据的多重插补分析。
Nat Commun. 2020 Oct 29;11(1):5467. doi: 10.1038/s41467-020-19270-2.
8
International electronic health record-derived COVID-19 clinical course profiles: the 4CE consortium.国际电子健康记录衍生的COVID-19临床病程概况:4CE联盟
NPJ Digit Med. 2020 Aug 19;3:109. doi: 10.1038/s41746-020-00308-0. eCollection 2020.
9
Learning from local to global: An efficient distributed algorithm for modeling time-to-event data.从局部到全局学习:一种用于建模事件时间数据的高效分布式算法。
J Am Med Inform Assoc. 2020 Jul 1;27(7):1028-1036. doi: 10.1093/jamia/ocaa044.
10
Ideas for how informaticians can get involved with COVID-19 research.关于信息学家如何参与COVID-19研究的想法。
BioData Min. 2020 May 12;13:3. doi: 10.1186/s13040-020-00213-y. eCollection 2020.