Department of Research and Development, Netherlands Comprehensive Cancer Organization (IKNL), Zernikestraat 29, 5612 HZ, Eindhoven, The Netherlands.
Department of Health Technology and Services Research, University of Twente, Enschede, The Netherlands.
Sci Rep. 2021 Mar 26;11(1):6968. doi: 10.1038/s41598-021-86327-7.
Cox Proportional Hazards (CPH) analysis is the standard for survival analysis in oncology. Recently, several machine learning (ML) techniques have been adapted for this task. Although they have shown to yield results at least as good as classical methods, they are often disregarded because of their lack of transparency and little to no explainability, which are key for their adoption in clinical settings. In this paper, we used data from the Netherlands Cancer Registry of 36,658 non-metastatic breast cancer patients to compare the performance of CPH with ML techniques (Random Survival Forests, Survival Support Vector Machines, and Extreme Gradient Boosting [XGB]) in predicting survival using the [Formula: see text]-index. We demonstrated that in our dataset, ML-based models can perform at least as good as the classical CPH regression ([Formula: see text]-index [Formula: see text]), and in the case of XGB even better ([Formula: see text]-index [Formula: see text]). Furthermore, we used Shapley Additive Explanation (SHAP) values to explain the models' predictions. We concluded that the difference in performance can be attributed to XGB's ability to model nonlinearities and complex interactions. We also investigated the impact of specific features on the models' predictions as well as their corresponding insights. Lastly, we showed that explainable ML can generate explicit knowledge of how models make their predictions, which is crucial in increasing the trust and adoption of innovative ML techniques in oncology and healthcare overall.
Cox 比例风险(CPH)分析是肿瘤学中生存分析的标准。最近,已经采用了几种机器学习(ML)技术来完成这项任务。尽管它们已经证明至少可以产生与经典方法一样好的结果,但由于缺乏透明度和几乎没有可解释性,它们通常被忽视,而这对于它们在临床环境中的采用至关重要。在本文中,我们使用来自荷兰癌症登记处的 36658 名非转移性乳腺癌患者的数据,比较了 CPH 与 ML 技术(随机生存森林、生存支持向量机和极端梯度增强[XGB])在使用[Formula: see text]-指数预测生存方面的性能。我们证明,在我们的数据集,基于 ML 的模型至少可以与经典的 CPH 回归[Formula: see text]-指数[Formula: see text]一样好地执行,在 XGB 的情况下甚至更好[Formula: see text]-指数[Formula: see text]。此外,我们使用 Shapley Additive Explanation(SHAP)值来解释模型的预测。我们得出结论,性能差异可以归因于 XGB 建模非线性和复杂相互作用的能力。我们还研究了特定特征对模型预测的影响及其相应的见解。最后,我们表明,可解释的 ML 可以生成关于模型如何进行预测的明确知识,这对于增加创新的 ML 技术在肿瘤学和整个医疗保健中的信任和采用至关重要。