Department of Physics, Centre for Computing in Science Education, University of Oslo, Blindern, Oslo, Norway.
Department of Physics and Astronomy, Michigan State University, East Lansing, Michigan, United States of America.
PLoS One. 2020 Nov 13;15(11):e0242334. doi: 10.1371/journal.pone.0242334. eCollection 2020.
The time it takes a student to graduate with a university degree is mitigated by a variety of factors such as their background, the academic performance at university, and their integration into the social communities of the university they attend. Different universities have different populations, student services, instruction styles, and degree programs, however, they all collect institutional data. This study presents data for 160,933 students attending a large American research university. The data includes performance, enrollment, demographics, and preparation features. Discrete time hazard models for the time-to-graduation are presented in the context of Tinto's Theory of Drop Out. Additionally, a novel machine learning method: gradient boosted trees, is applied and compared to the typical maximum likelihood method. We demonstrate that enrollment factors (such as changing a major) lead to greater increases in model predictive performance of when a student graduates than performance factors (such as grades) or preparation (such as high school GPA).
学生从大学毕业所需的时间受多种因素的影响,例如他们的背景、在大学的学业成绩以及他们融入所就读大学的社交群体的程度。不同的大学有不同的人群、学生服务、教学风格和学位课程,但它们都收集机构数据。本研究提供了一所大型美国研究型大学的 160933 名学生的数据。这些数据包括表现、入学、人口统计学和准备情况。在 Tinto 的辍学理论背景下,提出了用于毕业时间的离散时间风险模型。此外,还应用了一种新颖的机器学习方法:梯度提升树,并将其与典型的最大似然方法进行了比较。我们证明,与表现因素(如成绩)或准备情况(如高中 GPA)相比,入学因素(如换专业)会导致学生毕业时间的模型预测性能更大的提高。