• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用联邦学习开发事件发生时间预测模型。

Development of time to event prediction models using federated learning.

作者信息

Kragh Jørgensen Rasmus Rask, Jensen Jonas Faartoft, El-Galaly Tarec, Bøgsted Martin, Brøndum Rasmus Froberg, Simonsen Mikkel Runason, Jakobsen Lasse Hjort

机构信息

Department of Hematology, Clinical Cancer Research Center, Aalborg University Hospital, Aalborg, Denmark.

Center for Clinical Data Science, Aalborg University and Aalborg University Hospital, Aalborg, Denmark.

出版信息

BMC Med Res Methodol. 2025 May 26;25(1):143. doi: 10.1186/s12874-025-02598-y.

DOI:10.1186/s12874-025-02598-y
PMID:40419965
Abstract

BACKGROUND

In a wide range of diseases, it is necessary to utilize multiple data sources to obtain enough data for model training. However, performing centralized pooling of multiple data sources, while protecting each patients' sensitive data, can require a cumbersome process involving many institutional bodies. Alternatively, federated learning (FL) can be utilized to train models based on data located at multiple sites.

METHOD

We propose two methods for training time-to-event prediction models based on distributed data, relying on FL algorithms, for time-to-event prediction models. Both approach incorporates steps to allow prediction of individual-level survival curves, without exposing individual-level event times. For Cox proportional hazards models, the latter is accomplished by using a kernel smoother for the baseline hazard function. The other proposed methodology is based on general parametric likelihood theory for right-censored data. We compared these two methods in four simulation and with one real-world dataset predicting the survival probability in patients with Hodgkin lymphoma (HL).

RESULTS

The simulations demonstrated that the FL models performed similarly to the non-distributed case in all four experiments, with only slight deviations in predicted survival probabilities compared to the true model. Our findings were similar in the real-world advanced-stage HL example where the FL models were compared to their non-distributed versions, revealing only small deviations in performance.

CONCLUSION

The proposed procedures enable training of time-to-event models using data distributed across sites, without direct sharing of individual-level data and event times, while retaining a predictive performance on par with undistributed approaches.

摘要

背景

在多种疾病中,有必要利用多个数据源来获取足够的数据进行模型训练。然而,在保护每个患者敏感数据的同时,对多个数据源进行集中汇总可能需要一个涉及许多机构的繁琐过程。另外,可以利用联邦学习(FL)基于位于多个站点的数据来训练模型。

方法

我们提出了两种基于分布式数据训练事件发生时间预测模型的方法,依靠FL算法来构建事件发生时间预测模型。两种方法都包含了允许预测个体水平生存曲线的步骤,同时不暴露个体水平的事件发生时间。对于Cox比例风险模型,后者通过对基线风险函数使用核平滑器来实现。另一种提出的方法是基于右删失数据的一般参数似然理论。我们在四个模拟实验以及一个预测霍奇金淋巴瘤(HL)患者生存概率的真实世界数据集上比较了这两种方法。

结果

模拟实验表明,在所有四个实验中,FL模型的表现与非分布式情况相似,与真实模型相比,预测生存概率仅有轻微偏差。在真实世界的晚期HL实例中,我们将FL模型与其非分布式版本进行比较,发现结果相似,性能上仅有微小偏差。

结论

所提出的方法能够使用跨站点分布的数据训练事件发生时间模型,无需直接共享个体水平数据和事件发生时间,同时保持与非分布式方法相当的预测性能。

相似文献

1
Development of time to event prediction models using federated learning.使用联邦学习开发事件发生时间预测模型。
BMC Med Res Methodol. 2025 May 26;25(1):143. doi: 10.1186/s12874-025-02598-y.
2
Decentralized collaborative multi-institutional PET attenuation and scatter correction using federated deep learning.利用联邦深度学习进行去中心化协作的多机构 PET 衰减和散射校正。
Eur J Nucl Med Mol Imaging. 2023 Mar;50(4):1034-1050. doi: 10.1007/s00259-022-06053-8. Epub 2022 Dec 12.
3
Federated learning for enhanced dose-volume parameter prediction with decentralized data.用于通过分散数据增强剂量体积参数预测的联邦学习。
Med Phys. 2025 Mar;52(3):1408-1415. doi: 10.1002/mp.17566. Epub 2024 Dec 6.
4
Communication-efficient federated learning of temporal effects on opioid use disorder with data from distributed research networks.利用分布式研究网络的数据进行通信高效的阿片类药物使用障碍时间效应联合学习。
J Am Med Inform Assoc. 2025 Apr 1;32(4):656-664. doi: 10.1093/jamia/ocae313.
5
Learning from vertically distributed data across multiple sites: An efficient privacy-preserving algorithm for Cox proportional hazards model with variable selection.从多个站点的垂直分布数据中学习:一种用于具有变量选择的Cox比例风险模型的高效隐私保护算法。
J Biomed Inform. 2024 Jan;149:104581. doi: 10.1016/j.jbi.2023.104581. Epub 2023 Dec 23.
6
Predicting treatment response in multicenter non-small cell lung cancer patients based on federated learning.基于联邦学习预测多中心非小细胞肺癌患者的治疗反应。
BMC Cancer. 2024 Jun 5;24(1):688. doi: 10.1186/s12885-024-12456-7.
7
Federated Learning in Glaucoma: A Comprehensive Review and Future Perspectives.青光眼领域的联邦学习:全面综述与未来展望
Ophthalmol Glaucoma. 2025 Jan-Feb;8(1):92-105. doi: 10.1016/j.ogla.2024.08.004. Epub 2024 Aug 29.
8
The FeatureCloud Platform for Federated Learning in Biomedicine: Unified Approach.FeatureCloud 平台在生物医学领域的联邦学习:统一方法。
J Med Internet Res. 2023 Jul 12;25:e42621. doi: 10.2196/42621.
9
Learning from local to global: An efficient distributed algorithm for modeling time-to-event data.从局部到全局学习:一种用于建模事件时间数据的高效分布式算法。
J Am Med Inform Assoc. 2020 Jul 1;27(7):1028-1036. doi: 10.1093/jamia/ocaa044.
10
Federated Target Trial Emulation using Distributed Observational Data for Treatment Effect Estimation.使用分布式观察数据进行治疗效果估计的联合目标试验模拟
medRxiv. 2025 May 5:2025.05.02.25326905. doi: 10.1101/2025.05.02.25326905.

本文引用的文献

1
Machine Learning-Based Survival Prediction Models for Progression-Free and Overall Survival in Advanced-Stage Hodgkin Lymphoma.基于机器学习的晚期霍奇金淋巴瘤无进展生存和总生存预测模型。
JCO Clin Cancer Inform. 2024 Apr;8:e2300255. doi: 10.1200/CCI.23.00255.
2
Learning from vertically distributed data across multiple sites: An efficient privacy-preserving algorithm for Cox proportional hazards model with variable selection.从多个站点的垂直分布数据中学习:一种用于具有变量选择的Cox比例风险模型的高效隐私保护算法。
J Biomed Inform. 2024 Jan;149:104581. doi: 10.1016/j.jbi.2023.104581. Epub 2023 Dec 23.
3
Privacy-preserving analysis of time-to-event data under nested case-control sampling.
嵌套病例对照抽样下的生存数据分析的隐私保护。
Stat Methods Med Res. 2024 Jan;33(1):96-111. doi: 10.1177/09622802231215804. Epub 2023 Dec 13.
4
Privacy-aware multi-institutional time-to-event studies.隐私感知多机构事件发生时间研究
PLOS Digit Health. 2022 Sep 6;1(9):e0000101. doi: 10.1371/journal.pdig.0000101. eCollection 2022 Sep.
5
DC-COX: Data collaboration Cox proportional hazards model for privacy-preserving survival analysis on multiple parties.DC-COX:用于多方隐私保护生存分析的数据协作Cox比例风险模型。
J Biomed Inform. 2023 Jan;137:104264. doi: 10.1016/j.jbi.2022.104264. Epub 2022 Nov 30.
6
WICOX: Weight-Based Integrated Cox Model for Time-to-Event Data in Distributed Databases Without Data-Sharing.基于权重的 Cox 模型在不共享数据的分布式数据库中对事件时间数据的应用
IEEE J Biomed Health Inform. 2023 Jan;27(1):526-537. doi: 10.1109/JBHI.2022.3218585. Epub 2023 Jan 4.
7
Accurate training of the Cox proportional hazards model on vertically-partitioned data while preserving privacy.在保护隐私的同时,对垂直分区数据进行 Cox 比例风险模型的精确训练。
BMC Med Inform Decis Mak. 2022 Feb 24;22(1):49. doi: 10.1186/s12911-022-01771-3.
8
Survival analysis under the Cox proportional hazards model with pooled covariates.在具有合并协变量的Cox比例风险模型下的生存分析。
Stat Med. 2021 Feb 20;40(4):998-1020. doi: 10.1002/sim.8816. Epub 2020 Nov 18.
9
Learning from local to global: An efficient distributed algorithm for modeling time-to-event data.从局部到全局学习:一种用于建模事件时间数据的高效分布式算法。
J Am Med Inform Assoc. 2020 Jul 1;27(7):1028-1036. doi: 10.1093/jamia/ocaa044.
10
Reconstructing time-to-event data from published Kaplan-Meier curves.从已发表的卡普兰-迈耶曲线重建事件发生时间数据。
Stata J. 2017 Oct;17(4):786-802.