Ren Xiangyu, Boisbluche Simon, Philippe Kilian, Demy Mathieu, Äyrämö Sami, Rautiainen Ilkka, Ding Shuzhe, Prioux Jacques
Key Laboratory of Adolescent Health Assessment and Exercise Intervention of Ministry of Education, Sino-French Joint Research Center of Sport Science, College of Physical Education and Health, East China Normal University, Shanghai, China.
Movement, Sport, Health Laboratory, University of Rennes 2, Bruz, France.
Eur J Sport Sci. 2025 Oct;25(10):e70057. doi: 10.1002/ejsc.70057.
In sports, injury prevention is a key factor for success. Although injuries are challenging to predict, new technologies and the application of data science can provide valuable insights. This study aimed to predict injury risk among professional rugby union players using machine learning (ML) models. We analyzed data from 63 professional rugby union players during three seasons, categorized them into forwards and backs, and further classified them into five specific positions (tight five, back row, scrum-half, inside backs, outside backs). The dataset included GPS data and derived metrics such as total workload in the 1, 2, and 3 weeks prior to injury, acute-to-chronic workload ratio over different time windows, monotony, and strain. Injury prediction was assessed separately for different player positions using five ML classification models: logistic regression, naïve Bayes (NB), support vector machine, random forest (RF), and eXtreme gradient boosting (XGBoost). RF performed best for forwards overall, with XGBoost excelling in the tight five and SVM in the back row, whereas among backs, RF led for inside backs and NB for outside backs. Additionally, feature importance plots were used to examine the impact of various factors on injury occurrence. In conclusion, our ML-based approach can effectively predict injuries, with average F1 scores up to 0.66 (± 0.14), particularly when applying a combination of GPS-derived metrics. Additionally, key characteristics indicative of injury for players in various positions have been successfully identified. These findings underscored the potential of ML to enhance injury prediction and inform tailored training strategies for athletes.
在体育领域,预防伤病是取得成功的关键因素。尽管伤病难以预测,但新技术和数据科学的应用能够提供有价值的见解。本研究旨在使用机器学习(ML)模型预测职业英式橄榄球联盟球员的伤病风险。我们分析了63名职业英式橄榄球联盟球员在三个赛季的数据,将他们分为前锋和后卫,并进一步细分为五个特定位置(前排五名球员、后排球员、传锋、内锋、边锋)。数据集包括GPS数据以及派生指标,如受伤前1周、2周和3周的总工作量、不同时间窗口内的急性与慢性工作量比率、单调性和应变。使用五个ML分类模型分别评估不同球员位置的伤病预测:逻辑回归、朴素贝叶斯(NB)、支持向量机、随机森林(RF)和极端梯度提升(XGBoost)。总体而言,RF对前锋的预测效果最佳,XGBoost在紧前排五名球员中表现出色,SVM在后排球员中表现出色,而在后卫中,RF对内锋的预测领先,NB对边锋的预测领先。此外,特征重要性图用于检验各种因素对伤病发生的影响。总之,我们基于ML的方法能够有效预测伤病,平均F1分数高达0.66(±0.14),特别是在应用GPS派生指标组合时。此外,已成功识别出不同位置球员受伤的关键特征。这些发现强调了ML在增强伤病预测和为运动员制定量身定制的训练策略方面的潜力。