Khoushehgir Fatemeh, Sulaimany Sadegh
Department of IT and Computer Engineering, Azarbaijan Shahid Madani University, Tabriz, Iran.
Social and Biological Network Analysis Laboratory (SBNA), Department of Computer Engineering, University of Kurdistan, Sanandaj, Iran.
Educ Inf Technol (Dordr). 2023 Jan 25:1-20. doi: 10.1007/s10639-023-11597-9.
In recent years, the rapid growth of Massive Open Online Courses (MOOCs) has attracted much attention for related research. Besides, one of the main challenges in MOOCs is the high dropout or low completion rate. Early dropout prediction algorithms aim the educational institutes to retain the students for the related course. There are several methods for identification of the resigning students. These methods are often based on supervised machine learning, and require student activity records to train and create a prediction model based on the features extracted from the raw data. The performance of graph-based algorithms in various applications to discover the strong or weak relationships between entities using limited data encouraged us to turn to these algorithms for this problem. Objective of this paper is proposing a novel method with low complexity, negative link prediction algorithm, for the first time, utilizing only network topological data for dropout prediction. The idea is based on the assumption that entities with similar network structures are more likely to establish or remove a relation. Therefore, we first convert the data into a graph, mapping entities (students and courses) to nodes and relationships (enrollment data) to links. Then we use graph-based algorithms to predict students' dropout, utilizing just enrollment data. The experimental results demonstrate that the proposed method achieves significant performance compared to baseline ones. However, we test the supervised link prediction idea, and show the competitive and promising results in this case as well. Finally, we present important future research directions to improve the results.
近年来,大规模在线开放课程(MOOCs)的迅速发展引起了相关研究的广泛关注。此外,MOOCs面临的主要挑战之一是高辍学率或低完成率。早期辍学预测算法旨在帮助教育机构留住相关课程的学生。有几种识别退学学生的方法。这些方法通常基于监督机器学习,需要学生活动记录来训练并基于从原始数据中提取的特征创建预测模型。基于图的算法在各种应用中利用有限数据发现实体之间强或弱关系的性能,促使我们针对这个问题转向这些算法。本文的目的是首次提出一种低复杂度的新方法——负链路预测算法,仅利用网络拓扑数据进行辍学预测。该想法基于这样的假设:具有相似网络结构的实体更有可能建立或消除关系。因此,我们首先将数据转换为图,将实体(学生和课程)映射到节点,将关系(注册数据)映射到链接。然后我们使用基于图的算法仅利用注册数据来预测学生的辍学情况。实验结果表明,与基线方法相比,所提出的方法取得了显著的性能。然而,我们测试了监督链路预测的想法,并在这种情况下也展示了具有竞争力和前景的结果。最后,我们提出了重要的未来研究方向以改进结果。