Laboratory of Radiation Physics, Odense University Hospital, Denmark; Department of Clinical Research, University of Southern Denmark, Odense, Denmark; Danish Centre for Particle Therapy, Aarhus University Hospital, Denmark; Institute of Medical Physics, School of Physics, University of Sydney, Australia.
Radiotherapy Department, The Christie NHS Foundation Trust, Manchester, United Kingdom.
Radiother Oncol. 2022 Aug;173:319-326. doi: 10.1016/j.radonc.2022.06.009. Epub 2022 Jun 20.
Prediction models are useful to design personalised treatment. However, safe and effective implementation relies on external validation. Retrospective data are available in many institutions, but sharing between institutions can be challenging due to patient data sensitivity and governance or legal barriers. This study validates a larynx cancer survival model performed using distributed learning without any sensitive data leaving the institution.
Open-source distributed learning software based on a stratified Cox proportional hazard model was developed and used to validate the Egelmeer et al. MAASTRO survival model across two hospitals in two countries. The validation optimised a single scaling parameter multiplied by the original predicted prognostic index. All analyses and figures were based on the distributed system, ensuring no information leakage from the individual centres. All applied software is provided as freeware to facilitate distributed learning in other institutions.
1745 patients received radiotherapy for larynx cancer in the two centres from Jan 2005 to Dec 2018. Limiting to a maximum of one missing value in the parameters of the survival model reduced the cohort to 1095 patients. The Harrell C-index was 0.74 (CI95%, 0.71-0.76) and 0.70 (0.66-0.75) for the two centres. However, the model needed a scaling update. In addition, it was found that survival predictions of patients undergoing hypofractionation were less precise.
Open-source distributed learning software was able to validate, and suggest a minor update to the original survival model without central access to patient sensitive information. Even without the update, the original MAASTRO survival model of Egelmeer et al. performed reasonably well, providing similar results in this validation as in its original validation.
预测模型有助于设计个性化治疗方案。然而,其安全有效的实施依赖于外部验证。许多机构都有回顾性数据,但由于患者数据的敏感性以及治理或法律障碍,机构间的数据共享可能具有挑战性。本研究通过无敏感数据离开机构的分布式学习验证了喉癌生存模型。
开发了基于分层 Cox 比例风险模型的开源分布式学习软件,并用于在两个国家的两家医院验证 Egelmeer 等人的 MAASTRO 生存模型。验证优化了一个乘以原始预测预后指数的单一缩放参数。所有分析和图形均基于分布式系统,确保从各个中心没有信息泄露。所有应用的软件均作为免费软件提供,以促进其他机构的分布式学习。
2005 年 1 月至 2018 年 12 月,两个中心共有 1745 例喉癌患者接受放疗。将生存模型参数中的缺失值限制在一个以内,可将队列减少至 1095 例患者。两个中心的 Harrell C 指数分别为 0.74(95%CI,0.71-0.76)和 0.70(0.66-0.75)。但是,该模型需要进行缩放更新。此外,还发现接受低分割放疗的患者的生存预测精度较低。
开源分布式学习软件能够在不访问患者敏感信息的情况下验证原始生存模型,并对其进行微小更新。即使不进行更新,Egelmeer 等人的原始 MAASTRO 生存模型也表现良好,在本次验证中的表现与原始验证结果相似。