School of Foreign Languages, Wuhan Business University, Wuhan, Hubei, People's Republic of China.
PLoS One. 2024 Sep 5;19(9):e0309838. doi: 10.1371/journal.pone.0309838. eCollection 2024.
Student academic achievement is an important indicator for evaluating the quality of education, especially, the achievement prediction empowers educators in tailoring their instructional approaches, thereby fostering advancements in both student performance and the overall educational quality. However, extracting valuable insights from vast educational data to develop effective strategies for evaluating student performance remains a significant challenge for higher education institutions. Traditional machine learning (ML) algorithms often struggle to clearly delineate the interplay between the factors that influence academic success and the resulting grades. To address these challenges, this paper introduces the XGB-SHAP model, a novel approach for predicting student achievement that combines Extreme Gradient Boosting (XGBoost) with SHapley Additive exPlanations (SHAP). The model was applied to a dataset from a public university in Wuhan, encompassing the academic records of 87 students who were enrolled in a Japanese course between September 2021 and June 2023. The findings indicate the model excels in accuracy, achieving a Mean absolute error (MAE) of approximately 6 and an R-squared value near 0.82, surpassing three other ML models. The model further uncovers how different instructional modes influence the factors that contribute to student achievement. This insight supports the need for a customized approach to feature selection that aligns with the specific characteristics of each teaching mode. Furthermore, the model highlights the importance of incorporating self-directed learning skills into student-related indicators when predicting academic performance.
学生学业成绩是评估教育质量的重要指标,特别是,成绩预测使教育工作者能够调整他们的教学方法,从而促进学生成绩和整体教育质量的提高。然而,从庞大的教育数据中提取有价值的见解,以制定评估学生表现的有效策略,对高等教育机构来说仍然是一个重大挑战。传统的机器学习 (ML) 算法往往难以清楚地区分影响学业成功的因素与最终成绩之间的相互作用。为了解决这些挑战,本文引入了 XGB-SHAP 模型,这是一种预测学生成绩的新方法,它将极端梯度提升 (XGBoost) 与 Shapley 可加性解释 (SHAP) 相结合。该模型应用于来自武汉一所公立大学的数据集,其中包含 2021 年 9 月至 2023 年 6 月期间参加日语课程的 87 名学生的学术记录。研究结果表明,该模型在准确性方面表现出色,平均绝对误差 (MAE) 约为 6,R-squared 值接近 0.82,优于其他三个 ML 模型。该模型还揭示了不同的教学模式如何影响影响学生成绩的因素。这一发现支持了根据每个教学模式的具体特点,采用定制的特征选择方法的必要性。此外,该模型强调了在预测学业成绩时,将自主学习技能纳入学生相关指标的重要性。