Ren Xiangyu, Boisbluche Simon, Philippe Kilian, Demy Mathieu, Äyrämö Sami, Rautiainen Ilkka, Ding Shuzhe, Prioux Jacques
Sino-French Joint Research Center of Sport Science, Key Laboratory of Adolescent Health Assessment and Exercise Intervention of Ministry of Education, College of Physical Education and Health, East China Normal University, Shanghai, China.
Movement, Sport, Health Laboratory, University of Rennes 2, Bruz, France.
Eur J Sport Sci. 2025 Sep;25(9):e70042. doi: 10.1002/ejsc.70042.
Rugby union is an intermittent high-intensity contact sport requiring the analysis of various training and match metrics. Time-motion analysis and video analysis have enhanced the understanding of the interplay between these two factors. However, limited studies have investigated the effect of workload on key performance indicators (KPIs) during matches. In this study, data collected from the global positioning system (GPS) were used to calculate cumulative workload values over 7, 14, and 21 days prior to each game. After dimensionality reduction through principal component analysis (PCA), these workload values were employed as features, with game KPIs as target variables. Modeling was conducted using linear regression (LR), support vector regression (SVR), random forest regression (RFR), and light gradient boosting machine (LightGBM) for regression tasks. The superiority of the model was assessed by coefficient of determination ( ), root mean square error ( ), and correlation coefficient ( ). The findings revealed that although individual GPS metrics exhibited weak correlations with KPIs, machine learning (ML) models particularly RFR, successfully captured complex interactions and nonlinear relationships. These models achieved significantly improved predictive performance, with values ranging from 0.40 to 0.72 for certain KPIs. Using SHapley Additive exPlanations (SHAP) analysis and partial dependence plots, this study enhanced the interpretability of ML models by identifying the influence of GPS features on KPIs and exploring their underlying mechanisms. These findings offer actionable insights for workload management, emphasizing critical factors that affect player performance.
英式橄榄球联盟是一项间歇性的高强度对抗性运动,需要对各种训练和比赛指标进行分析。时间动作分析和视频分析增进了人们对这两个因素之间相互作用的理解。然而,针对比赛期间工作量对关键绩效指标(KPI)影响的研究较少。在本研究中,利用从全球定位系统(GPS)收集的数据计算每场比赛前7天、14天和21天的累积工作量值。通过主成分分析(PCA)进行降维后,将这些工作量值用作特征,将比赛KPI作为目标变量。使用线性回归(LR)、支持向量回归(SVR)、随机森林回归(RFR)和轻梯度提升机(LightGBM)进行回归任务建模。通过决定系数( )、均方根误差( )和相关系数( )评估模型的优越性。研究结果表明,尽管单个GPS指标与KPI的相关性较弱,但机器学习(ML)模型,尤其是RFR,成功捕捉到了复杂的相互作用和非线性关系。这些模型的预测性能显著提高,某些KPI的 值范围为0.40至0.72。本研究通过使用SHapley加法解释(SHAP)分析和部分依赖图,确定了GPS特征对KPI的影响并探索其潜在机制,从而提高了ML模型的可解释性。这些发现为工作量管理提供了可操作的见解,强调了影响球员表现的关键因素。