School of Physical Education, University of Campinas, Campinas, Brazil.
Faculty of São Vicente, São Vicente, Brazil.
PLoS One. 2024 May 30;19(5):e0304139. doi: 10.1371/journal.pone.0304139. eCollection 2024.
The present study aimed to assess the use of technical-tactical variables and machine learning (ML) classifiers in the automatic classification of the passing difficulty (DP) level in soccer matches and to illustrate the use of the model with the best performance to distinguish the best passing players. We compared eight ML classifiers according to their accuracy performance in classifying passing events using 35 technical-tactical variables based on spatiotemporal data. The Support Vector Machine (SVM) algorithm achieved a balanced accuracy of 0.70 ± 0.04%, considering a multi-class classification. Next, we illustrate the use of the best-performing classifier in the assessment of players. In our study, 2,522 pass actions were classified by the SVM algorithm as low (53.9%), medium (23.6%), and high difficulty passes (22.5%). Furthermore, we used successful rates in low-DP, medium-DP, and high-DP as inputs for principal component analysis (PCA). The first principal component (PC1) showed a higher correlation with high-DP (0.80), followed by medium-DP (0.73), and low-DP accuracy (0.24). The PC1 scores were used to rank the best passing players. This information can be a very rich performance indication by ranking the best passing players and teams and can be applied in offensive sequences analysis and talent identification.
本研究旨在评估技术战术变量和机器学习 (ML) 分类器在足球比赛中自动分类传球难度 (DP) 水平的应用,并通过使用性能最佳的模型来说明如何区分最佳传球球员。我们根据基于时空数据的 35 个技术战术变量,比较了 8 种 ML 分类器在传球事件分类中的准确性表现。支持向量机 (SVM) 算法在多类分类中实现了 0.70±0.04%的平衡准确率。接下来,我们说明了使用性能最佳的分类器来评估球员的方法。在我们的研究中,2522 次传球动作被 SVM 算法分类为低难度(53.9%)、中难度(23.6%)和高难度(22.5%)传球。此外,我们使用低 DP、中 DP 和高 DP 的成功率作为主成分分析(PCA)的输入。第一主成分(PC1)与高 DP 的相关性更高(0.80),其次是中 DP(0.73)和低 DP 准确率(0.24)。PC1 得分用于对最佳传球球员进行排名。通过对最佳传球球员和球队进行排名,这一信息可以提供非常丰富的表现指标,并可应用于进攻序列分析和人才识别。