Retzepis Nikolaos-Orestis, Avloniti Alexandra, Kokkotis Christos, Protopapa Maria, Stampoulis Theodoros, Gkachtsou Anastasia, Pantazis Dimitris, Balampanos Dimitris, Smilios Ilias, Chatzinikolaou Athanasios
School of Physical Education and Sport Science, Democritus University of Thrace, 69100 Komotini, Greece.
Sports (Basel). 2024 Oct 22;12(11):287. doi: 10.3390/sports12110287.
Maturation is a key factor in sports participation and often determines the young athletes' characterization as a talent. However, there is no evidence of practical models for understanding the factors that discriminate children according to maturity. Hence, this study aims to deepen the understanding of the factors that affect maturity in 11-year-old Team Sports Athletes by utilizing explainable artificial intelligence (XAI) models. We utilized three established machine learning (ML) classifiers and applied the Sequential Forward Feature Selection (SFFS) algorithm to each. In this binary classification task, the logistic regression (LR) classifier achieved a top accuracy of 96.67% using the seven most informative factors (Sitting Height, Father's Height, Body Fat, Weight, Height, Left and Right-Hand Grip Strength). The SHapley Additive exPlanations (SHAP) model was instrumental in identifying the contribution of each factor, offering key insights into variable importance. Independent sample -tests on these selected factors confirmed their significance in distinguishing between the two classes. By providing detailed and personalized insights into child development, this integration has the potential to enhance the effectiveness of maturation prediction significantly. These advancements could lead to a transformative approach in young athletes' pediatric growth analysis, fostering better sports performance and developmental outcomes for children.
成熟度是体育参与的关键因素,往往决定了年轻运动员是否被视为有天赋。然而,目前尚无实用模型可用于理解根据成熟度区分儿童的因素。因此,本研究旨在通过利用可解释人工智能(XAI)模型,加深对影响11岁团体运动运动员成熟度因素的理解。我们使用了三种已建立的机器学习(ML)分类器,并分别应用了顺序前向特征选择(SFFS)算法。在这个二分类任务中,逻辑回归(LR)分类器使用七个最具信息量的因素(坐高、父亲身高、体脂、体重、身高、左手和右手握力)达到了96.67%的最高准确率。SHapley加性解释(SHAP)模型有助于确定每个因素的贡献,提供了关于变量重要性的关键见解。对这些选定因素进行的独立样本t检验证实了它们在区分两类之间的重要性。通过提供关于儿童发育的详细和个性化见解,这种整合有可能显著提高成熟度预测的有效性。这些进展可能会在年轻运动员的儿科生长分析中带来变革性方法,促进儿童更好的运动表现和发育结果。