Department of Physiotherapy, Singapore General Hospital, Singapore, Singapore.
Department of Biostatistics, Vanderbilt University Medical Center, Nashville, TN, USA.
Knee Surg Sports Traumatol Arthrosc. 2020 Oct;28(10):3207-3216. doi: 10.1007/s00167-019-05822-7. Epub 2019 Dec 12.
Machine-learning methods are flexible prediction algorithms with potential advantages over conventional regression. This study aimed to use machine learning methods to predict post-total knee arthroplasty (TKA) walking limitation, and to compare their performance with that of logistic regression.
From the department's clinical registry, a cohort of 4026 patients who underwent elective, primary TKA between July 2013 and July 2017 was identified. Candidate predictors included demographics and preoperative clinical, psychosocial, and outcome measures. The primary outcome was severe walking limitation at 6 months post-TKA, defined as a maximum walk time ≤ 15 min. Eight common regression (logistic, penalized logistic, and ordinal logistic with natural splines) and ensemble machine learning (random forest, extreme gradient boosting, and SuperLearner) methods were implemented to predict the probability of severe walking limitation. Models were compared on discrimination and calibration metrics.
At 6 months post-TKA, 13% of patients had severe walking limitation. Machine learning and logistic regression models performed moderately [mean area under the ROC curves (AUC) 0.73-0.75]. Overall, the ordinal logistic regression model performed best while the SuperLearner performed best among machine learning methods, with negligible differences between them (Brier score difference, < 0.001; 95% CI [- 0.0025, 0.002]).
When predicting post-TKA physical function, several machine learning methods did not outperform logistic regression-in particular, ordinal logistic regression that does not assume linearity in its predictors.
Prognostic level II.
机器学习方法是一种灵活的预测算法,具有优于传统回归的潜在优势。本研究旨在使用机器学习方法预测全膝关节置换术(TKA)后行走受限,并比较其与逻辑回归的性能。
从该部门的临床登记处,确定了一个队列,该队列包含 4026 名 2013 年 7 月至 2017 年 7 月间接受选择性初次 TKA 的患者。候选预测因子包括人口统计学和术前临床、心理社会和结果测量。主要结局是 TKA 后 6 个月严重行走受限,定义为最大行走时间≤15 分钟。实施了 8 种常见的回归(逻辑、惩罚逻辑和带有自然样条的有序逻辑)和集成机器学习(随机森林、极端梯度增强和 SuperLearner)方法来预测严重行走受限的概率。模型通过区分和校准指标进行比较。
TKA 后 6 个月,13%的患者有严重的行走受限。机器学习和逻辑回归模型的表现中等[平均 ROC 曲线下面积(AUC)0.73-0.75]。总体而言,有序逻辑回归模型表现最佳,而机器学习方法中 SuperLearner 表现最佳,两者之间几乎没有差异(Brier 评分差异,<0.001;95%置信区间[-0.0025,0.002])。
在预测 TKA 后身体功能时,几种机器学习方法并未优于逻辑回归,特别是对于其预测因子不假设线性的有序逻辑回归。
预后 II 级。