Janssens Bram, Bogaert Matthias, Maton Mathijs
Department of Marketing, Innovation and Organisation, Ghent University, Tweekerkenstraat 2, 9000 Ghent, Belgium.
Ann Oper Res. 2023;325(1):557-588. doi: 10.1007/s10479-021-04476-4. Epub 2022 Jan 19.
The importance of young athletes in the field of professional cycling has sky-rocketed during the past years. Nevertheless, the early talent identification of these riders largely remains a subjective assessment. Therefore, an analytical system which automatically detects talented riders based on their freely available youth results should be installed. However, such a system cannot be copied directly from related fields, as large distinctions are observed between cycling and other sports. The aim of this paper is to develop such a data analytical system, which leverages the unique features of each race and thereby focusses on feature engineering, data quality, and visualization. To facilitate the deployment of prediction algorithms in situations without complete cases, we propose an adaptation to the k-nearest neighbours imputation algorithm which uses expert knowledge. Overall, our proposed method correlates strongly with eventual rider performance and can aid scouts in targeting young talents. On top of that, we introduce several model interpretation tools to give insight into which current starting professional riders are expected to perform well and why.
在过去几年里,年轻运动员在职业自行车领域的重要性急剧上升。然而,这些车手的早期天赋识别在很大程度上仍然是主观评估。因此,应该建立一个基于车手可自由获取的青年时期成绩自动检测有天赋车手的分析系统。然而,这样的系统不能直接从相关领域照搬,因为自行车运动与其他运动之间存在很大差异。本文的目的是开发这样一个数据分析系统,该系统利用每场比赛的独特特征,从而专注于特征工程、数据质量和可视化。为便于在无完整案例的情况下部署预测算法,我们提出了一种对使用专家知识的k近邻插补算法的改进方法。总体而言,我们提出的方法与车手最终表现密切相关,可帮助球探锁定年轻人才。除此之外,我们还引入了几种模型解释工具,以深入了解哪些目前的职业车手有望表现出色以及原因。