Centre for Sport Science and University Sports, University of Vienna, Vienna, Austria.
Faculty of Computer Science, University of Vienna, Vienna, Austria.
PLoS One. 2024 Apr 18;19(4):e0298107. doi: 10.1371/journal.pone.0298107. eCollection 2024.
With recent technological advancements, quantitative analysis has become an increasingly important area within professional sports. However, the manual process of collecting data on relevant match events like passes, goals and tacklings comes with considerable costs and limited consistency across providers, affecting both research and practice. In football, while automatic detection of events from positional data of the players and the ball could alleviate these issues, it is not entirely clear what accuracy current state-of-the-art methods realistically achieve because there is a lack of high-quality validations on realistic and diverse data sets. This paper adds context to existing research by validating a two-step rule-based pass and shot detection algorithm on four different data sets using a comprehensive validation routine that accounts for the temporal, hierarchical and imbalanced nature of the task. Our evaluation shows that pass and shot detection performance is highly dependent on the specifics of the data set. In accordance with previous studies, we achieve F-scores of up to 0.92 for passes, but only when there is an inherent dependency between event and positional data. We find a significantly lower accuracy with F-scores of 0.71 for passes and 0.65 for shots if event and positional data are independent. This result, together with a critical evaluation of existing methodologies, suggests that the accuracy of current football event detection algorithms operating on positional data is currently overestimated. Further analysis reveals that the temporal extraction of passes and shots from positional data poses the main challenge for rule-based approaches. Our results further indicate that the classification of plays into shots and passes is a relatively straightforward task, achieving F-scores between 0.83 to 0.91 ro rule-based classifiers and up to 0.95 for machine learning classifiers. We show that there exist simple classifiers that accurately differentiate shots from passes in different data sets using a low number of human-understandable rules. Operating on basic spatial features, our classifiers provide a simple, objective event definition that can be used as a foundation for more reliable event-based match analysis.
随着最近技术的进步,定量分析已成为专业体育领域中越来越重要的领域。然而,收集传球、进球和抢断等相关比赛事件数据的手动过程成本高昂,且不同提供者之间的一致性有限,这对研究和实践都有影响。在足球中,虽然可以通过球员和球的位置数据自动检测事件来缓解这些问题,但由于缺乏对真实和多样化数据集的高质量验证,目前尚不清楚最先进的方法的准确性实际能达到什么程度。本文通过使用全面的验证程序,根据事件和位置数据之间的固有依赖性,在四个不同的数据集上验证基于规则的两步传球和射门检测算法,为现有研究提供了背景。我们的评估表明,传球和射门检测性能高度依赖于数据集的具体情况。与先前的研究一致,我们实现了高达 0.92 的传球 F 分数,但只有当事件和位置数据之间存在固有依赖性时。如果事件和位置数据是独立的,则传球的准确性会显著降低,F 分数为 0.71,射门的 F 分数为 0.65。这一结果,再加上对现有方法的批判性评估,表明基于位置数据运行的当前足球事件检测算法的准确性被高估了。进一步的分析表明,从位置数据中提取传球和射门的时间是基于规则的方法面临的主要挑战。我们的结果进一步表明,将比赛分类为射门和传球是一项相对简单的任务,基于规则的分类器的 F 分数在 0.83 到 0.91 之间,机器学习分类器的 F 分数高达 0.95。我们表明,在不同的数据集上,使用少量易于理解的规则,存在简单的分类器可以准确地区分射门和传球。我们的分类器基于基本的空间特征,提供了一个简单、客观的事件定义,可以作为更可靠的基于事件的比赛分析的基础。