Alaiti Rafael Krasic, Vallio Caio Sain, Assunção Jorge Henrique, de Andrade E Silva Fernando Brandão, Gracitelli Mauro Emilio Conforto, Neto Arnaldo Amado Ferreira, Malavolta Eduardo Angeli
Research, Technology, and Data Science Office, Grupo Superador, São Paulo, Brazil.
Universidade de São Paulo, São Paulo, Brazil.
Orthop J Sports Med. 2023 Oct 19;11(10):23259671231206180. doi: 10.1177/23259671231206180. eCollection 2023 Oct.
Although some evidence suggests that machine learning algorithms may outperform classical statistical methods in prognosis prediction for several orthopaedic surgeries, to our knowledge, no study has yet used machine learning to predict patient-reported outcome measures after rotator cuff repair.
To determine whether machine learning algorithms using preoperative data can predict the nonachievement of the minimal clinically important difference (MCID) of disability at 2 years after rotator cuff surgical repair with a similar performance to that of other machine learning studies in the orthopaedic surgery literature.
Case-control study; Level of evidence, 3.
We evaluated 474 patients (n = 500 shoulders) with rotator cuff tears who underwent arthroscopic rotator cuff repair between January 2013 and April 2019. The study outcome was the difference between the preoperative and 24-month postoperative American Shoulder and Elbow Surgeons (ASES) score. A cutoff score was calculated based on the established MCID of 15.2 points to separate success (higher than the cutoff) from failure (lower than the cutoff). Routinely collected imaging, clinical, and demographic data were used to train 8 machine learning algorithms (random forest classifier; light gradient boosting machine [LightGBM]; decision tree classifier; extra trees classifier; logistic regression; extreme gradient boosting [XGBoost]; -nearest neighbors [KNN] classifier; and CatBoost classifier). We used a random sample of 70% of patients to train the algorithms, and 30% were left for performance assessment, simulating new data. The performance of the models was evaluated with the area under the receiver operating characteristic curve (AUC).
The AUCs for all algorithms ranged from 0.58 to 0.68. The random forest classifier and LightGBM presented the highest AUC values (0.68 [95% CI, 0.48-0.79] and 0.67 [95% CI, 0.43-0.75], respectively) of the 8 machine learning algorithms. Most of the machine learning algorithms outperformed logistic regression (AUC, 0.59 [95% CI, 0.48-0.81]); nonetheless, their performance was lower than that of other machine learning studies in the orthopaedic surgery literature.
Machine learning algorithms demonstrated some ability to predict the nonachievement of the MCID on the ASES 2 years after rotator cuff repair surgery.
尽管有证据表明,在几种骨科手术的预后预测中,机器学习算法可能优于传统统计方法,但据我们所知,尚无研究使用机器学习来预测肩袖修复术后患者报告的结局指标。
确定使用术前数据的机器学习算法能否预测肩袖手术修复后2年残疾程度未达到最小临床重要差异(MCID)的情况,其性能与骨科手术文献中其他机器学习研究的性能相似。
病例对照研究;证据等级,3级。
我们评估了2013年1月至2019年4月间接受关节镜下肩袖修复的474例肩袖撕裂患者(共500个肩部)。研究结局为术前与术后24个月美国肩肘外科医师学会(ASES)评分的差值。根据既定的15.2分MCID计算出一个临界值,以区分成功(高于临界值)与失败(低于临界值)。使用常规收集的影像学、临床和人口统计学数据来训练8种机器学习算法(随机森林分类器、轻梯度提升机[LightGBM]、决策树分类器、极端随机树分类器、逻辑回归、极端梯度提升[XGBoost]、K近邻[KNN]分类器和CatBoost分类器)。我们使用70%的患者随机样本训练算法,30%留作性能评估,模拟新数据。使用受试者工作特征曲线(AUC)下的面积评估模型性能。
所有算法的AUC范围为0.58至0.68。随机森林分类器和LightGBM在8种机器学习算法中呈现出最高的AUC值(分别为0.68[95%CI,0.48 - 0.79]和0.67[95%CI,0.43 - 0.75])。大多数机器学习算法的表现优于逻辑回归(AUC,0.59[95%CI,0.48 - 0.81]);尽管如此,它们的性能低于骨科手术文献中其他机器学习研究的性能。
机器学习算法显示出一定能力,可预测肩袖修复手术后2年ASES评分未达到MCID的情况。