Mathematical Institute (MI) Leiden University, Niels Bohrweg 1, Leiden, 2333 CA, the Netherlands.
Department of Biomedical Data Sciences, Section Medical Statistics, Leiden University Medical Center (LUMC), Albinusdreef 2, Leiden, 2333 ZA, The Netherlands.
BMC Med Res Methodol. 2020 Nov 16;20(1):277. doi: 10.1186/s12874-020-01153-1.
Predicting survival of recipients after liver transplantation is regarded as one of the most important challenges in contemporary medicine. Hence, improving on current prediction models is of great interest.Nowadays, there is a strong discussion in the medical field about machine learning (ML) and whether it has greater potential than traditional regression models when dealing with complex data. Criticism to ML is related to unsuitable performance measures and lack of interpretability which is important for clinicians.
In this paper, ML techniques such as random forests and neural networks are applied to large data of 62294 patients from the United States with 97 predictors selected on clinical/statistical grounds, over more than 600, to predict survival from transplantation. Of particular interest is also the identification of potential risk factors. A comparison is performed between 3 different Cox models (with all variables, backward selection and LASSO) and 3 machine learning techniques: a random survival forest and 2 partial logistic artificial neural networks (PLANNs). For PLANNs, novel extensions to their original specification are tested. Emphasis is given on the advantages and pitfalls of each method and on the interpretability of the ML techniques.
Well-established predictive measures are employed from the survival field (C-index, Brier score and Integrated Brier Score) and the strongest prognostic factors are identified for each model. Clinical endpoint is overall graft-survival defined as the time between transplantation and the date of graft-failure or death. The random survival forest shows slightly better predictive performance than Cox models based on the C-index. Neural networks show better performance than both Cox models and random survival forest based on the Integrated Brier Score at 10 years.
In this work, it is shown that machine learning techniques can be a useful tool for both prediction and interpretation in the survival context. From the ML techniques examined here, PLANN with 1 hidden layer predicts survival probabilities the most accurately, being as calibrated as the Cox model with all variables.
Retrospective data were provided by the Scientific Registry of Transplant Recipients under Data Use Agreement number 9477 for analysis of risk factors after liver transplantation.
预测肝移植受者的存活率被认为是当代医学中最重要的挑战之一。因此,改进当前的预测模型是非常重要的。如今,医学领域对机器学习(ML)有很大的兴趣,讨论的重点是它在处理复杂数据时是否比传统回归模型具有更大的潜力。对 ML 的批评涉及不合适的性能指标和缺乏可解释性,这对临床医生很重要。
本文应用机器学习技术,如随机森林和神经网络,对来自美国的 62294 名患者的大数据进行分析,这些患者有 97 个临床/统计选择的预测因子,超过 600 个,以预测移植后的存活率。特别感兴趣的是识别潜在的危险因素。对 3 种不同的 Cox 模型(全变量、后向选择和 LASSO)和 3 种机器学习技术(随机生存森林和 2 种部分逻辑人工神经网络(PLANNs))进行了比较。对 PLANNs 的原始规范进行了新的扩展测试。重点介绍了每种方法的优缺点以及 ML 技术的可解释性。
采用了来自生存领域的成熟预测指标(C 指数、Brier 评分和综合 Brier 评分),并为每个模型确定了最强的预后因素。临床终点是总体移植物存活率,定义为移植后至移植物失功或死亡的时间。随机生存森林的预测性能略优于基于 C 指数的 Cox 模型。基于综合 Brier 评分,神经网络在 10 年内的表现优于 Cox 模型和随机生存森林。
在这项工作中,证明了机器学习技术可以成为生存分析中预测和解释的有用工具。在检查的 ML 技术中,具有 1 个隐藏层的 PLANN 预测存活率最准确,与包含所有变量的 Cox 模型一样具有校准能力。
根据数据使用协议 9477,由 Scientific Registry of Transplant Recipients 提供回顾性数据,用于分析肝移植后的危险因素。