Department of Health Research Methods, Evidence and Impact, McMaster University, 1280 Main Street West, Hamilton, Ontario, L8S 4K1, Canada.
Institute of Health Policy, Management and Evaluation, University of Toronto, Health Sciences Building, 155 College Street, Suite 425, Toronto, Ontario, M5T 3M6, Canada.
Ir J Med Sci. 2021 May;190(2):807-817. doi: 10.1007/s11845-020-02332-1. Epub 2020 Aug 6.
Supervised machine learning (ML) is a class of algorithms that "learn" from existing input-output pairs, which is gaining popularity in pattern recognition for classification and prediction problems. In this scoping review, we examined the use of supervised ML algorithms for the prediction of long-term allograft survival in kidney transplant recipients. Data sources included PubMed, the Cumulative Index to Nursing and Allied Health Literature, and the Institute for Electrical and Electronics Engineers (IEEE) Xplore libraries from inception to November 2019. We screened titles and abstracts and potentially eligible full-text reports to select studies and subsequently abstracted the data. Eleven studies were identified. Decision trees were the most commonly used method (n = 8), followed by artificial neural networks (ANN) (n = 4) and Bayesian belief networks (n = 2). The area under receiver operating curve (AUC) was the most common measure of discrimination (n = 7), followed by sensitivity (n = 5) and specificity (n = 4). Model calibration examining the reliability in risk prediction was performed using either the Pearson r or the Hosmer-Lemeshow test in four studies. One study showed that logistic regression had comparable performance to ANN, while another study demonstrated that ANN performed better in terms of sensitivity, specificity, and accuracy, as compared with a Cox proportional hazards model. We synthesized the evidence related to the comparison of ML techniques with traditional statistical approaches for prediction of long-term allograft survival in patients with a kidney transplant. The methodological and reporting quality of included studies was poor. Our study also demonstrated mixed results in terms of the predictive potential of the models.
监督机器学习 (ML) 是一类从现有输入-输出对中“学习”的算法,它在分类和预测问题的模式识别中越来越受欢迎。在这项范围审查中,我们检查了监督 ML 算法在预测肾移植受者长期移植物存活率中的应用。数据源包括 PubMed、护理与联合健康文献累积索引以及电气与电子工程师协会 (IEEE) Xplore 库,从开始到 2019 年 11 月。我们筛选了标题和摘要以及可能符合条件的全文报告,以选择研究并随后提取数据。确定了 11 项研究。决策树是最常用的方法(n=8),其次是人工神经网络(ANN)(n=4)和贝叶斯信念网络(n=2)。接收器工作特性曲线下的面积(AUC)是最常见的区分度衡量标准(n=7),其次是灵敏度(n=5)和特异性(n=4)。使用 Pearson r 或 Hosmer-Lemeshow 检验在四项研究中进行了模型校准,以检查风险预测的可靠性。一项研究表明,逻辑回归与 ANN 具有相当的性能,而另一项研究表明,与 Cox 比例风险模型相比,ANN 在灵敏度、特异性和准确性方面表现更好。我们综合了有关 ML 技术与传统统计方法在预测肾移植患者长期移植物存活率方面比较的证据。纳入研究的方法学和报告质量较差。我们的研究还表明,模型的预测潜力存在混合结果。