• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

SurvMaximin:稳健的联邦式方法,用于传输生存风险预测模型。

SurvMaximin: Robust federated approach to transporting survival risk prediction models.

机构信息

Department of Biostatistics, Harvard T. H. Chan School of Public Health, Boston, MA, USA.

Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.

出版信息

J Biomed Inform. 2022 Oct;134:104176. doi: 10.1016/j.jbi.2022.104176. Epub 2022 Aug 23.

DOI:10.1016/j.jbi.2022.104176
PMID:36007785
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9707637/
Abstract

OBJECTIVE

For multi-center heterogeneous Real-World Data (RWD) with time-to-event outcomes and high-dimensional features, we propose the SurvMaximin algorithm to estimate Cox model feature coefficients for a target population by borrowing summary information from a set of health care centers without sharing patient-level information.

MATERIALS AND METHODS

For each of the centers from which we want to borrow information to improve the prediction performance for the target population, a penalized Cox model is fitted to estimate feature coefficients for the center. Using estimated feature coefficients and the covariance matrix of the target population, we then obtain a SurvMaximin estimated set of feature coefficients for the target population. The target population can be an entire cohort comprised of all centers, corresponding to federated learning, or a single center, corresponding to transfer learning.

RESULTS

Simulation studies and a real-world international electronic health records application study, with 15 participating health care centers across three countries (France, Germany, and the U.S.), show that the proposed SurvMaximin algorithm achieves comparable or higher accuracy compared with the estimator using only the information of the target site and other existing methods. The SurvMaximin estimator is robust to variations in sample sizes and estimated feature coefficients between centers, which amounts to significantly improved estimates for target sites with fewer observations.

CONCLUSIONS

The SurvMaximin method is well suited for both federated and transfer learning in the high-dimensional survival analysis setting. SurvMaximin only requires a one-time summary information exchange from participating centers. Estimated regression vectors can be very heterogeneous. SurvMaximin provides robust Cox feature coefficient estimates without outcome information in the target population and is privacy-preserving.

摘要

目的

对于具有时间事件结局和高维特征的多中心异质真实世界数据(RWD),我们提出了 SurvMaximin 算法,通过从一组医疗中心获取汇总信息,在不共享患者级信息的情况下,为目标人群估计 Cox 模型特征系数。

材料和方法

对于我们希望从中获取信息以提高目标人群预测性能的每个中心,我们拟合了一个惩罚 Cox 模型来估计中心的特征系数。使用估计的特征系数和目标人群的协方差矩阵,我们然后为目标人群获得 SurvMaximin 估计的特征系数集。目标人群可以是由所有中心组成的整个队列,对应于联邦学习,也可以是单个中心,对应于迁移学习。

结果

模拟研究和真实的国际电子健康记录应用研究表明,与仅使用目标站点信息和其他现有方法的估计器相比,所提出的 SurvMaximin 算法在准确性方面具有可比性或更高的准确性。SurvMaximin 估计器对中心之间的样本量和估计特征系数的变化具有鲁棒性,这对于观察次数较少的目标站点来说意味着显著改善了估计。

结论

SurvMaximin 方法非常适合高维生存分析环境中的联邦学习和迁移学习。SurvMaximin 仅需要参与中心进行一次性的汇总信息交换。估计的回归向量可能非常异构。SurvMaximin 提供了稳健的 Cox 特征系数估计,而无需目标人群中的结局信息,并且具有隐私保护。

相似文献

1
SurvMaximin: Robust federated approach to transporting survival risk prediction models.SurvMaximin:稳健的联邦式方法,用于传输生存风险预测模型。
J Biomed Inform. 2022 Oct;134:104176. doi: 10.1016/j.jbi.2022.104176. Epub 2022 Aug 23.
2
Learning from vertically distributed data across multiple sites: An efficient privacy-preserving algorithm for Cox proportional hazards model with variable selection.从多个站点的垂直分布数据中学习:一种用于具有变量选择的Cox比例风险模型的高效隐私保护算法。
J Biomed Inform. 2024 Jan;149:104581. doi: 10.1016/j.jbi.2023.104581. Epub 2023 Dec 23.
3
Learning from local to global: An efficient distributed algorithm for modeling time-to-event data.从局部到全局学习:一种用于建模事件时间数据的高效分布式算法。
J Am Med Inform Assoc. 2020 Jul 1;27(7):1028-1036. doi: 10.1093/jamia/ocaa044.
4
Combining Federated Machine Learning and Qualitative Methods to Investigate Novel Pediatric Asthma Subtypes: Protocol for a Mixed Methods Study.联合联邦机器学习和定性方法研究新型儿科哮喘亚型:混合方法研究方案。
JMIR Res Protoc. 2024 Jul 8;13:e57981. doi: 10.2196/57981.
5
Privacy-Preserving Federated Survival Support Vector Machines for Cross-Institutional Time-To-Event Analysis: Algorithm Development and Validation.用于跨机构事件发生时间分析的隐私保护联合生存支持向量机:算法开发与验证
JMIR AI. 2024 Mar 29;3:e47652. doi: 10.2196/47652.
6
ODACH: a one-shot distributed algorithm for Cox model with heterogeneous multi-center data.ODACH:一种用于异质多中心 Cox 模型的单步分布式算法。
Sci Rep. 2022 Apr 22;12(1):6627. doi: 10.1038/s41598-022-09069-0.
7
A privacy-preserving and computation-efficient federated algorithm for generalized linear mixed models to analyze correlated electronic health records data.一种用于分析相关电子健康记录数据的广义线性混合模型的隐私保护和计算高效的联邦算法。
PLoS One. 2023 Jan 17;18(1):e0280192. doi: 10.1371/journal.pone.0280192. eCollection 2023.
8
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
9
Multi-site fMRI analysis using privacy-preserving federated learning and domain adaptation: ABIDE results.使用隐私保护联邦学习和域适应的多站点功能磁共振成像分析:ABIDE研究结果
Med Image Anal. 2020 Oct;65:101765. doi: 10.1016/j.media.2020.101765. Epub 2020 Jul 2.
10
The FeatureCloud Platform for Federated Learning in Biomedicine: Unified Approach.FeatureCloud 平台在生物医学领域的联邦学习:统一方法。
J Med Internet Res. 2023 Jul 12;25:e42621. doi: 10.2196/42621.

引用本文的文献

1
Bridging Data Gaps in Healthcare: A Scoping Review of Transfer Learning in Structured Data Analysis.弥合医疗保健领域的数据差距:结构化数据分析中迁移学习的范围综述
Health Data Sci. 2025 Sep 3;5:0321. doi: 10.34133/hds.0321. eCollection 2025.
2
FedECA: federated external control arms for causal inference with time-to-event data in distributed settings.FedECA:用于在分布式环境中对具有事件发生时间数据进行因果推断的联邦外部对照臂。
Nat Commun. 2025 Aug 13;16(1):7496. doi: 10.1038/s41467-025-62525-z.
3
Recent methodological advances in federated learning for healthcare.医疗保健领域联邦学习的最新方法进展。
Patterns (N Y). 2024 Jun 14;5(6):101006. doi: 10.1016/j.patter.2024.101006.
4
Learning across diverse biomedical data modalities and cohorts: Challenges and opportunities for innovation.跨多种生物医学数据模式和队列的学习:创新面临的挑战与机遇
Patterns (N Y). 2024 Jan 17;5(2):100913. doi: 10.1016/j.patter.2023.100913. eCollection 2024 Feb 9.
5
Federated and distributed learning applications for electronic health records and structured medical data: a scoping review.联邦学习和分布式学习在电子健康记录和结构化医疗数据中的应用:范围综述。
J Am Med Inform Assoc. 2023 Nov 17;30(12):2041-2049. doi: 10.1093/jamia/ocad170.
6
Federated Learning in Health care Using Structured Medical Data.利用结构化医疗数据进行医疗保健中的联邦学习。
Adv Kidney Dis Health. 2023 Jan;30(1):4-16. doi: 10.1053/j.akdh.2022.11.007.
7
Hospitalizations Associated With Mental Health Conditions Among Adolescents in the US and France During the COVID-19 Pandemic.美国和法国青少年在 COVID-19 大流行期间因心理健康问题住院的情况。
JAMA Netw Open. 2022 Dec 1;5(12):e2246548. doi: 10.1001/jamanetworkopen.2022.46548.
8
COMMUTE: Communication-efficient transfer learning for multi-site risk prediction.通勤:面向多站点风险预测的通信高效迁移学习。
J Biomed Inform. 2023 Jan;137:104243. doi: 10.1016/j.jbi.2022.104243. Epub 2022 Nov 18.

本文引用的文献

1
Individual Data Protected Integrative Regression Analysis of High-Dimensional Heterogeneous Data.高维异构数据的个体数据保护整合回归分析
J Am Stat Assoc. 2022;117(540):2105-2119. doi: 10.1080/01621459.2021.1904958. Epub 2021 May 19.
2
Transfer Learning for High-Dimensional Linear Regression: Prediction, Estimation and Minimax Optimality.高维线性回归的迁移学习:预测、估计与极小极大最优性
J R Stat Soc Series B Stat Methodol. 2022 Feb;84(1):149-173. doi: 10.1111/rssb.12479. Epub 2021 Nov 16.
3
International Changes in COVID-19 Clinical Trajectories Across 315 Hospitals and 6 Countries: Retrospective Cohort Study.国际 315 家医院 6 个国家 COVID-19 临床转归变化:回顾性队列研究。
J Med Internet Res. 2021 Oct 11;23(10):e31400. doi: 10.2196/31400.
4
Survival Analysis with Electronic Health Record Data: Experiments with Chronic Kidney Disease.利用电子健康记录数据进行生存分析:慢性肾脏病实验
Stat Anal Data Min. 2014 Oct;7(5):385-403. doi: 10.1002/sam.11236. Epub 2014 Aug 19.
5
Adoption rates of electronic health records in Turkish Hospitals and the relation with hospital sizes.土耳其医院电子病历的采用率及其与医院规模的关系。
BMC Health Serv Res. 2020 Oct 21;20(1):967. doi: 10.1186/s12913-020-05767-5.
6
International electronic health record-derived COVID-19 clinical course profiles: the 4CE consortium.国际电子健康记录衍生的COVID-19临床病程概况:4CE联盟
NPJ Digit Med. 2020 Aug 19;3:109. doi: 10.1038/s41746-020-00308-0. eCollection 2020.
7
Learning from local to global: An efficient distributed algorithm for modeling time-to-event data.从局部到全局学习:一种用于建模事件时间数据的高效分布式算法。
J Am Med Inform Assoc. 2020 Jul 1;27(7):1028-1036. doi: 10.1093/jamia/ocaa044.
8
Association of Patient Characteristics and Tumor Genomics With Clinical Outcomes Among Patients With Non-Small Cell Lung Cancer Using a Clinicogenomic Database.基于临床基因组数据库的非小细胞肺癌患者的患者特征和肿瘤基因组与临床结局的相关性分析。
JAMA. 2019 Apr 9;321(14):1391-1399. doi: 10.1001/jama.2019.3241.
9
Maximin Projection Learning for Optimal Treatment Decision with Heterogeneous Individualized Treatment Effects.用于具有异质个体化治疗效果的最优治疗决策的极大极小投影学习
J R Stat Soc Series B Stat Methodol. 2018 Sep;80(4):681-702. doi: 10.1111/rssb.12273. Epub 2018 May 10.
10
Electronic Health Record Portal Adoption: a cross country analysis.电子健康记录门户的采用:一项跨国分析。
BMC Med Inform Decis Mak. 2017 Jul 5;17(1):97. doi: 10.1186/s12911-017-0482-9.