• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

COLA-GLM:用于分散式观察性医疗数据的广义线性模型的协作式一次性无损算法

COLA-GLM: collaborative one-shot and lossless algorithms of generalized linear models for decentralized observational healthcare data.

作者信息

Wu Qiong, Reps Jenna M, Li Lu, Zhang Bingyu, Lu Yiwen, Tong Jiayi, Zhang Dazheng, Lumley Thomas, Brand Milou T, Van Zandt Mui, Falconer Thomas, He Xing, Huang Yu, Li Haoyang, Yan Chao, Tang Guojun, Williams Andrew E, Wang Fei, Bian Jiang, Malin Bradley, Hripcsak George, Schuemie Martijn J, Lu Yun, Drew Steve, Zhou Jiayu, Asch David A, Chen Yong

机构信息

Department of Biostatistics and Health Data Science, University of Pittsburgh, Pittsburgh, PA, USA.

Department of Biostatistics, Epidemiology, and Informatics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA.

出版信息

NPJ Digit Med. 2025 Jul 15;8(1):442. doi: 10.1038/s41746-025-01781-1.

DOI:10.1038/s41746-025-01781-1
PMID:40664765
Abstract

Clinical insights from real-world data often require aggregating information from institutions to ensure sufficient sample sizes and generalizability. However, patient privacy concerns only limit the sharing of patient-level data, and traditional federated learning algorithms, relying on extensive back-and-forth communications, can be inefficient to implement. We introduce the Collaborative One-shot Lossless Algorithm for Generalized Linear Models (COLA-GLM), a novel federated learning algorithm that supports diverse outcome types via generalized linear models and achieves results identical to a pooled patient-level data analysis (lossless) with only a single round of aggregated data exchange (one-shot). To further protect aggregated institutional data, we developed a secure extension, secure-COLA-GLM, utilizing homomorphic encryption. We demonstrated the effectiveness and lossless property of COLA-GLM through applications to an international influenza cohort and a decentralized U.S. COVID-19 mortality study. COLA-GLM and secure-COLA-GLM offer a scalable, efficient solution for decentralized collaborative learning involving multiple data partners and diverse security requirements.

摘要

来自真实世界数据的临床见解通常需要汇总来自各机构的信息,以确保足够的样本量和可推广性。然而,患者隐私问题仅限制了患者层面数据的共享,而传统的联邦学习算法依赖大量的来回通信,实施起来可能效率低下。我们引入了广义线性模型的协作一次性无损算法(COLA-GLM),这是一种新颖的联邦学习算法,它通过广义线性模型支持多种结果类型,并且仅通过一轮汇总数据交换(一次性)就能实现与汇总患者层面数据分析相同的结果(无损)。为了进一步保护汇总的机构数据,我们利用同态加密开发了一个安全扩展版本,即安全COLA-GLM。我们通过将其应用于一个国际流感队列和一项分散式美国新冠肺炎死亡率研究,证明了COLA-GLM的有效性和无损特性。COLA-GLM和安全COLA-GLM为涉及多个数据伙伴和不同安全要求的分散式协作学习提供了一种可扩展、高效的解决方案。

相似文献

1
COLA-GLM: collaborative one-shot and lossless algorithms of generalized linear models for decentralized observational healthcare data.COLA-GLM:用于分散式观察性医疗数据的广义线性模型的协作式一次性无损算法
NPJ Digit Med. 2025 Jul 15;8(1):442. doi: 10.1038/s41746-025-01781-1.
2
Unlocking efficiency in real-world collaborative studies: a multi-site international study with one-shot lossless GLMM algorithm.在实际协作研究中提高效率:一项采用一次性无损广义线性混合模型算法的多中心国际研究。
NPJ Digit Med. 2025 Jul 19;8(1):457. doi: 10.1038/s41746-025-01846-1.
3
Privacy-Preserving Glycemic Management in Type 1 Diabetes: Development and Validation of a Multiobjective Federated Reinforcement Learning Framework.1型糖尿病中保护隐私的血糖管理:多目标联邦强化学习框架的开发与验证
JMIR Diabetes. 2025 Jul 4;10:e72874. doi: 10.2196/72874.
4
Swarm learning network for privacy-preserving and collaborative deep learning assisted diagnosis of fracture: a multi-center diagnostic study.用于骨折隐私保护与协作深度学习辅助诊断的群体学习网络:一项多中心诊断研究
Front Med (Lausanne). 2025 Jul 3;12:1534117. doi: 10.3389/fmed.2025.1534117. eCollection 2025.
5
Communication-efficient federated learning of temporal effects on opioid use disorder with data from distributed research networks.利用分布式研究网络的数据进行通信高效的阿片类药物使用障碍时间效应联合学习。
J Am Med Inform Assoc. 2025 Apr 1;32(4):656-664. doi: 10.1093/jamia/ocae313.
6
Fusion of Personalized Federated Learning (PFL) with Differential Privacy (DP) Learning for Diagnosis of Arrhythmia Disease.个性化联邦学习(PFL)与差分隐私(DP)学习相结合用于心律失常疾病诊断
PLoS One. 2025 Jul 11;20(7):e0327108. doi: 10.1371/journal.pone.0327108. eCollection 2025.
7
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
8
Accreditation through the eyes of nurse managers: an infinite staircase or a phenomenon that evaporates like water.护士长眼中的认证:是无尽的阶梯还是如流水般消逝的现象。
J Health Organ Manag. 2025 Jun 30. doi: 10.1108/JHOM-01-2025-0029.
9
Fused federated learning framework for secure and decentralized patient monitoring in healthcare 5.0 using IoMT.用于医疗保健5.0中使用物联网进行安全且分散的患者监测的融合联邦学习框架
Sci Rep. 2025 Jul 7;15(1):24263. doi: 10.1038/s41598-025-06574-w.
10
Measures implemented in the school setting to contain the COVID-19 pandemic.学校为控制 COVID-19 疫情而采取的措施。
Cochrane Database Syst Rev. 2022 Jan 17;1(1):CD015029. doi: 10.1002/14651858.CD015029.

本文引用的文献

1
Centralized and Federated Models for the Analysis of Clinical Data.集中式和联邦式临床数据分析模型。
Annu Rev Biomed Data Sci. 2024 Aug;7(1):179-199. doi: 10.1146/annurev-biodatasci-122220-115746. Epub 2024 Jul 24.
2
Researching COVID to Enhance Recovery (RECOVER) adult study protocol: Rationale, objectives, and design.COVID 研究促进康复(RECOVER)成人研究方案:原理、目标和设计。
PLoS One. 2023 Jun 23;18(6):e0286297. doi: 10.1371/journal.pone.0286297. eCollection 2023.
3
Managing re-identification risks while providing access to the All of Us research program.
在提供对“所有人”研究计划访问权限的同时,管理重新识别风险。
J Am Med Inform Assoc. 2023 Apr 19;30(5):907-914. doi: 10.1093/jamia/ocad021.
4
Distributed Quasi-Poisson regression algorithm for modeling multi-site count outcomes in distributed data networks.分布式准泊松回归算法在分布式数据网络中对多点计数结果进行建模。
J Biomed Inform. 2022 Jul;131:104097. doi: 10.1016/j.jbi.2022.104097. Epub 2022 May 25.
5
ODACH: a one-shot distributed algorithm for Cox model with heterogeneous multi-center data.ODACH:一种用于异质多中心 Cox 模型的单步分布式算法。
Sci Rep. 2022 Apr 22;12(1):6627. doi: 10.1038/s41598-022-09069-0.
6
DLMM as a lossless one-shot algorithm for collaborative multi-site distributed linear mixed models.作为一种无损的一次性算法,DLMM 适用于协作式多站点分布式线性混合模型。
Nat Commun. 2022 Mar 30;13(1):1678. doi: 10.1038/s41467-022-29160-4.
7
Seek COVER: using a disease proxy to rapidly develop and validate a personalized risk calculator for COVID-19 outcomes in an international network.寻找替代指标:利用疾病替代指标在国际网络中快速开发和验证针对 COVID-19 结局的个体化风险计算器。
BMC Med Res Methodol. 2022 Jan 30;22(1):35. doi: 10.1186/s12874-022-01505-z.
8
An efficient and accurate distributed learning algorithm for modeling multi-site zero-inflated count outcomes.一种高效准确的分布式学习算法,用于对多站点零膨胀计数结果进行建模。
Sci Rep. 2021 Oct 4;11(1):19647. doi: 10.1038/s41598-021-99078-2.
9
Male gender is a predictor of higher mortality in hospitalized adults with COVID-19.男性性别是住院 COVID-19 成年患者死亡率更高的预测因素。
PLoS One. 2021 Jul 9;16(7):e0254066. doi: 10.1371/journal.pone.0254066. eCollection 2021.
10
How do we share data in COVID-19 research? A systematic review of COVID-19 datasets in PubMed Central Articles.我们如何在 COVID-19 研究中共享数据?对 PubMed Central 文章中 COVID-19 数据集的系统评价。
Brief Bioinform. 2021 Mar 22;22(2):800-811. doi: 10.1093/bib/bbaa331.