Institute of Sports and Sports Science, Karlsruhe Institute of Technology, Karlsruhe, Germany.
Match Analysis, TSG 1899 Hoffenheim, Zuzenhausen, Germany.
Sci Med Footb. 2024 Nov;8(4):317-332. doi: 10.1080/24733938.2023.2239766. Epub 2023 Aug 4.
The interest in sports performance analysis is rising and tracking data holds high potential for game analysis in team sports due to its accuracy and informative content. Together with machine learning approaches one can obtain deeper and more objective insights into the performance structure. In soccer, the analysis of the defense was neglected in comparison to the offense. Therefore, the aim of this study is to predict ball gains in defense using tracking data to identify tactical variables that drive defensive success. We evaluated tracking data of 153 games of German Bundesliga season 2020/21. With it, we derived player (defensive pressure, distance to the ball, & velocity) and team metrics (inter-line distances, numerical superiority, surface area, & spread) each containing a tactical idea. Afterwards, we trained supervised machine learning classifiers (logistic regression, XGBoost, & Random Forest Classifier) to predict successful (ball gain) vs. unsuccessful defensive plays (no ball gain). The expert-reduction-model (Random Forest Classifier with 16 features) showed the best and satisfying prediction performance (F1-Score (test) = 0.57). Analyzing the most important input features of this model, we are able to identify tactical principles of defensive play that appear to be related to gaining the ball: press the ball leading player, create numerical superiority in areas close to the ball (press short pass options), compact organization of defending team. Those principles are highly interesting for practitioners to gain valuable insights in the tactical behavior of soccer players that may be related to the success of defensive play.
人们对运动表现分析的兴趣日益浓厚,由于跟踪数据的准确性和丰富内容,它在团队运动中的比赛分析中具有很高的潜力。通过与机器学习方法结合,人们可以更深入、更客观地了解表现结构。在足球中,与进攻相比,防守的分析被忽视了。因此,本研究的目的是使用跟踪数据预测防守中的球权获得,以确定驱动防守成功的战术变量。我们评估了德国德甲联赛 2020/21 赛季的 153 场比赛的跟踪数据。利用这些数据,我们得出了球员(防守压力、与球的距离和速度)和球队(线间距离、人数优势、表面面积和分散度)的指标,每个指标都包含一个战术理念。然后,我们训练了监督机器学习分类器(逻辑回归、XGBoost 和随机森林分类器)来预测成功的(球权获得)与不成功的防守动作(无球权获得)。专家简化模型(具有 16 个特征的随机森林分类器)显示了最佳和令人满意的预测性能(测试 F1 得分=0.57)。分析该模型最重要的输入特征,我们能够确定与获得球权相关的防守策略原则:压迫持球球员,在靠近球的区域创造人数优势(压迫短传选项),防守团队紧凑的组织。这些原则对教练来说非常有趣,可以深入了解与防守成功相关的足球运动员的战术行为。