Department of Mathematics, Imperial College London, London, UK.
Imperial College Business School, Imperial College London, London, UK.
Sci Rep. 2021 Feb 2;11(1):2823. doi: 10.1038/s41598-021-81709-3.
The intrinsic temporality of learning demands the adoption of methodologies capable of exploiting time-series information. In this study we leverage the sequence data framework and show how data-driven analysis of temporal sequences of task completion in online courses can be used to characterise personal and group learners' behaviors, and to identify critical tasks and course sessions in a given course design. We also introduce a recently developed probabilistic Bayesian model to learn sequential behaviours of students and predict student performance. The application of our data-driven sequence-based analyses to data from learners undertaking an on-line Business Management course reveals distinct behaviors within the cohort of learners, identifying learners or groups of learners that deviate from the nominal order expected in the course. Using course grades a posteriori, we explore differences in behavior between high and low performing learners. We find that high performing learners follow the progression between weekly sessions more regularly than low performing learners, yet within each weekly session high performing learners are less tied to the nominal task order. We then model the sequences of high and low performance students using the probablistic Bayesian model and show that we can learn engagement behaviors associated with performance. We also show that the data sequence framework can be used for task-centric analysis; we identify critical junctures and differences among types of tasks within the course design. We find that non-rote learning tasks, such as interactive tasks or discussion posts, are correlated with higher performance. We discuss the application of such analytical techniques as an aid to course design, intervention, and student supervision.
学习的内在时间性要求采用能够利用时间序列信息的方法。在本研究中,我们利用序列数据框架,展示了如何使用在线课程中任务完成的时间序列数据驱动分析来描述个人和群体学习者的行为,并识别给定课程设计中的关键任务和课程环节。我们还介绍了一种最近开发的概率贝叶斯模型,用于学习学生的顺序行为并预测学生的表现。我们的基于数据驱动的序列分析应用于在线商业管理课程学习者的数据,揭示了学习者群体中的不同行为,确定了偏离课程中预期的规范顺序的学习者或学习者群体。使用课程成绩进行事后分析,我们探讨了高绩效和低绩效学习者之间的行为差异。我们发现,高绩效学习者比低绩效学习者更规律地遵循每周课程之间的进度,而在每个每周课程中,高绩效学习者与规范任务顺序的联系较少。然后,我们使用概率贝叶斯模型对高绩效和低绩效学生的序列进行建模,并表明我们可以学习与绩效相关的参与行为。我们还表明,数据序列框架可用于以任务为中心的分析;我们在课程设计中的任务类型之间确定了关键节点和差异。我们发现,非机械学习任务,如互动任务或讨论帖子,与较高的表现相关。我们讨论了将此类分析技术作为课程设计、干预和学生监督辅助手段的应用。