• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

2 万多名肺癌患者的分布式学习 - 个人健康训练。

Distributed learning on 20 000+ lung cancer patients - The Personal Health Train.

机构信息

Department of Radiation Oncology (MAASTRO), GROW - School for Oncology and Developmental Biology, Maastricht University Medical Centre+, The Netherlands; The D-Lab: Dpt of Precision Medicine, GROW - School for Oncology and Developmental Biology, Maastricht University Medical Centre+, The Netherlands.

Department of Radiation Oncology (MAASTRO), GROW - School for Oncology and Developmental Biology, Maastricht University Medical Centre+, The Netherlands; Department of Radiation Oncology, Radboud University Medical Center, Nijmegen, The Netherlands.

出版信息

Radiother Oncol. 2020 Mar;144:189-200. doi: 10.1016/j.radonc.2019.11.019. Epub 2020 Jan 3.

DOI:10.1016/j.radonc.2019.11.019
PMID:31911366
Abstract

BACKGROUND AND PURPOSE

Access to healthcare data is indispensable for scientific progress and innovation. Sharing healthcare data is time-consuming and notoriously difficult due to privacy and regulatory concerns. The Personal Health Train (PHT) provides a privacy-by-design infrastructure connecting FAIR (Findable, Accessible, Interoperable, Reusable) data sources and allows distributed data analysis and machine learning. Patient data never leaves a healthcare institute.

MATERIALS AND METHODS

Lung cancer patient-specific databases (tumor staging and post-treatment survival information) of oncology departments were translated according to a FAIR data model and stored locally in a graph database. Software was installed locally to enable deployment of distributed machine learning algorithms via a central server. Algorithms (MATLAB, code and documentation publicly available) are patient privacy-preserving as only summary statistics and regression coefficients are exchanged with the central server. A logistic regression model to predict post-treatment two-year survival was trained and evaluated by receiver operating characteristic curves (ROC), root mean square prediction error (RMSE) and calibration plots.

RESULTS

In 4 months, we connected databases with 23 203 patient cases across 8 healthcare institutes in 5 countries (Amsterdam, Cardiff, Maastricht, Manchester, Nijmegen, Rome, Rotterdam, Shanghai) using the PHT. Summary statistics were computed across databases. A distributed logistic regression model predicting post-treatment two-year survival was trained on 14 810 patients treated between 1978 and 2011 and validated on 8 393 patients treated between 2012 and 2015.

CONCLUSION

The PHT infrastructure demonstrably overcomes patient privacy barriers to healthcare data sharing and enables fast data analyses across multiple institutes from different countries with different regulatory regimens. This infrastructure promotes global evidence-based medicine while prioritizing patient privacy.

摘要

背景与目的

获取医疗保健数据对于科学进步和创新至关重要。由于隐私和监管方面的考虑,共享医疗保健数据既耗时又困难。个人健康列车(PHT)提供了一个隐私设计的基础设施,连接了 FAIR(可查找、可访问、可互操作、可重用)数据源,并允许分布式数据分析和机器学习。患者数据从未离开过医疗机构。

材料与方法

根据 FAIR 数据模型对肿瘤学部门的肺癌患者特定数据库(肿瘤分期和治疗后生存信息)进行翻译,并在本地存储在图形数据库中。在本地安装软件,以便通过中央服务器部署分布式机器学习算法。算法(MATLAB,代码和文档均可公开获得)对患者隐私具有保护作用,因为仅与中央服务器交换汇总统计信息和回归系数。通过接收者操作特征曲线(ROC)、均方根预测误差(RMSE)和校准图来训练和评估用于预测治疗后两年生存的逻辑回归模型。

结果

在 4 个月的时间里,我们使用 PHT 连接了 8 个医疗机构的 23203 名患者的数据库,这些医疗机构分布在 5 个国家(阿姆斯特丹、卡迪夫、马斯特里赫特、曼彻斯特、奈梅亨、罗马、鹿特丹、上海)。在数据库之间计算汇总统计信息。在 14810 名 1978 年至 2011 年期间治疗的患者和 8393 名 2012 年至 2015 年期间治疗的患者上训练了用于预测治疗后两年生存的分布式逻辑回归模型,并对其进行了验证。

结论

PHT 基础设施明显克服了患者隐私障碍,实现了医疗保健数据的共享,并能够在来自不同国家和具有不同监管方案的多个机构之间进行快速数据分析。该基础设施在优先考虑患者隐私的同时,促进了全球循证医学。

相似文献

1
Distributed learning on 20 000+ lung cancer patients - The Personal Health Train.2 万多名肺癌患者的分布式学习 - 个人健康训练。
Radiother Oncol. 2020 Mar;144:189-200. doi: 10.1016/j.radonc.2019.11.019. Epub 2020 Jan 3.
2
Infrastructure platform for privacy-preserving distributed machine learning development of computer-assisted theragnostics in cancer.用于癌症计算机辅助治疗学中隐私保护分布式机器学习开发的基础架构平台。
J Biomed Inform. 2022 Oct;134:104181. doi: 10.1016/j.jbi.2022.104181. Epub 2022 Aug 30.
3
Privacy-preserving federated machine learning on FAIR health data: A real-world application.公平健康数据上的隐私保护联邦机器学习:一个实际应用
Comput Struct Biotechnol J. 2024 Feb 17;24:136-145. doi: 10.1016/j.csbj.2024.02.014. eCollection 2024 Dec.
4
Colorectal cancer health and care quality indicators in a federated setting using the Personal Health Train.利用个人健康训练系统在联邦环境中评估结直肠癌健康和护理质量指标。
BMC Med Inform Decis Mak. 2024 May 9;24(1):121. doi: 10.1186/s12911-024-02526-y.
5
Systematic Review of Privacy-Preserving Distributed Machine Learning From Federated Databases in Health Care.医疗保健领域联合数据库中隐私保护分布式机器学习的系统综述
JCO Clin Cancer Inform. 2020 Mar;4:184-200. doi: 10.1200/CCI.19.00047.
6
Predicting 30-Day Readmission Risk for Patients With Chronic Obstructive Pulmonary Disease Through a Federated Machine Learning Architecture on Findable, Accessible, Interoperable, and Reusable (FAIR) Data: Development and Validation Study.通过基于可查找、可访问、可互操作和可重用(FAIR)数据的联邦机器学习架构预测慢性阻塞性肺疾病患者30天再入院风险:开发与验证研究
JMIR Med Inform. 2022 Jun 2;10(6):e35307. doi: 10.2196/35307.
7
Infrastructure and distributed learning methodology for privacy-preserving multi-centric rapid learning health care: euroCAT.用于隐私保护的多中心快速学习医疗保健的基础设施和分布式学习方法:euroCAT
Clin Transl Radiat Oncol. 2017 May 19;4:24-31. doi: 10.1016/j.ctro.2016.12.004. eCollection 2017 Jun.
8
Distributed Skin Lesion Analysis Across Decentralised Data Sources.分布式皮肤损伤分析跨越去中心化数据源。
Stud Health Technol Inform. 2021 May 27;281:352-356. doi: 10.3233/SHTI210179.
9
A multicenter random forest model for effective prognosis prediction in collaborative clinical research network.多中心随机森林模型在协作临床研究网络中的有效预后预测。
Artif Intell Med. 2020 Mar;103:101814. doi: 10.1016/j.artmed.2020.101814. Epub 2020 Feb 5.
10
Learning From Others Without Sacrificing Privacy: Simulation Comparing Centralized and Federated Machine Learning on Mobile Health Data.从他人身上学习而不牺牲隐私:移动健康数据集中式和联邦机器学习的模拟比较。
JMIR Mhealth Uhealth. 2021 Mar 30;9(3):e23728. doi: 10.2196/23728.

引用本文的文献

1
Bridging Data Silos in Oncology with Modular Software for Federated Analysis on Fast Healthcare Interoperability Resources: Multisite Implementation Study.使用模块化软件在快速医疗保健互操作性资源上进行联合分析以弥合肿瘤学中的数据孤岛:多站点实施研究
J Med Internet Res. 2025 Apr 15;27:e65681. doi: 10.2196/65681.
2
Identifying pathways to the prevention of dementia: the Netherlands consortium of dementia cohorts.确定预防痴呆症的途径:荷兰痴呆症队列研究联盟
BMC Neurol. 2025 Feb 12;25(1):59. doi: 10.1186/s12883-024-03995-4.
3
Advancing Privacy-Preserving Health Care Analytics and Implementation of the Personal Health Train: Federated Deep Learning Study.
推进隐私保护医疗保健分析与个人健康列车的实施:联邦深度学习研究
JMIR AI. 2025 Feb 6;4:e60847. doi: 10.2196/60847.
4
Application of privacy protection technology to healthcare big data.隐私保护技术在医疗大数据中的应用。
Digit Health. 2024 Nov 4;10:20552076241282242. doi: 10.1177/20552076241282242. eCollection 2024 Jan-Dec.
5
Real-world federated learning in radiology: hurdles to overcome and benefits to gain.放射学中的真实世界联邦学习:需克服的障碍与可获得的益处
J Am Med Inform Assoc. 2025 Jan 1;32(1):193-205. doi: 10.1093/jamia/ocae259.
6
Advancing healthcare through data: the BETTER project's vision for distributed analytics.通过数据推动医疗保健发展:BETTER项目对分布式分析的愿景。
Front Med (Lausanne). 2024 Oct 2;11:1473874. doi: 10.3389/fmed.2024.1473874. eCollection 2024.
7
A study on interoperability between two Personal Health Train infrastructures in leukodystrophy data analysis.在白质营养不良数据分析中,两种个人健康训练基础设施的互操作性研究。
Sci Data. 2024 Jun 22;11(1):663. doi: 10.1038/s41597-024-03450-6.
8
Colorectal cancer health and care quality indicators in a federated setting using the Personal Health Train.利用个人健康训练系统在联邦环境中评估结直肠癌健康和护理质量指标。
BMC Med Inform Decis Mak. 2024 May 9;24(1):121. doi: 10.1186/s12911-024-02526-y.
9
A distributed feature selection pipeline for survival analysis using radiomics in non-small cell lung cancer patients.一种使用非小细胞肺癌患者放射组学的生存分析分布式特征选择管道。
Sci Rep. 2024 Apr 3;14(1):7814. doi: 10.1038/s41598-024-58241-1.
10
Machine Learning Meets Cancer.机器学习与癌症相遇。
Cancers (Basel). 2024 Mar 8;16(6):1100. doi: 10.3390/cancers16061100.