Faculty of Information Technology, University of Jyväskylä, Jyväskylä, Finland.
Oslo Sports Trauma Research Center, Department of Sports Medicine, Norwegian School of Sport Sciences, Oslo, Norway.
Am J Sports Med. 2022 Sep;50(11):2917-2924. doi: 10.1177/03635465221112095. Epub 2022 Aug 19.
Injury risk prediction is an emerging field in which more research is needed to recognize the best practices for accurate injury risk assessment. Important issues related to predictive machine learning need to be considered, for example, to avoid overinterpreting the observed prediction performance.
To carefully investigate the predictive potential of multiple predictive machine learning methods on a large set of risk factor data for anterior cruciate ligament (ACL) injury; the proposed approach takes into account the effect of chance and random variations in prediction performance.
Case-control study; Level of evidence, 3.
The authors used 3-dimensional motion analysis and physical data collected from 791 female elite handball and soccer players. Four common classifiers were used to predict ACL injuries (n = 60). Area under the receiver operating characteristic curve (AUC-ROC) averaged across 100 cross-validation runs (mean AUC-ROC) was used as a performance metric. Results were confirmed with repeated permutation tests (paired Wilcoxon signed-rank-test; < .05). Additionally, the effect of the most common class imbalance handling techniques was evaluated.
For the best classifier (linear support vector machine), the mean AUC-ROC was 0.63. Regardless of the classifier, the results were significantly better than chance, confirming the predictive ability of the data and methods used. AUC-ROC values varied substantially across repetitions and methods (0.51-0.69). Class imbalance handling did not improve the results.
The authors' approach and data showed statistically significant predictive ability, indicating that there exists information in this prospective data set that may be valuable for understanding injury causation. However, the predictive ability remained low from the perspective of clinical assessment, suggesting that included variables cannot be used for ACL prediction in practice.
损伤风险预测是一个新兴领域,需要更多的研究来认识到准确评估损伤风险的最佳实践。需要考虑与预测机器学习相关的重要问题,例如,避免过度解释观察到的预测性能。
在大量风险因素数据上仔细研究多种预测机器学习方法对前交叉韧带(ACL)损伤的预测潜力;所提出的方法考虑了机会和预测性能随机变化的影响。
病例对照研究;证据水平,3 级。
作者使用 3 维运动分析和从 791 名女性精英手球和足球运动员收集的物理数据。使用 4 种常见的分类器来预测 ACL 损伤(n = 60)。100 次交叉验证运行的接收器操作特征曲线下面积(平均 AUC-ROC)被用作性能指标。结果通过重复置换检验(配对 Wilcoxon 符号秩检验;<0.05)得到确认。此外,还评估了最常见的类别不平衡处理技术的效果。
对于最佳分类器(线性支持向量机),平均 AUC-ROC 为 0.63。无论分类器如何,结果均明显优于机会,这证实了所用数据和方法的预测能力。AUC-ROC 值在重复和方法之间变化很大(0.51-0.69)。类别不平衡处理并没有改善结果。
作者的方法和数据显示出统计学上显著的预测能力,表明在这个前瞻性数据集存在可能有助于理解损伤因果关系的信息。然而,从临床评估的角度来看,预测能力仍然较低,这表明所包含的变量在实践中不能用于 ACL 预测。