Xu Xin, Zhang Susu, Guo Jinxin, Xin Tao
Collaborative Innovation Center of Assessment for Basic Education Quality, Beijing Normal University, Beijing 100875, China.
Departments of Psychology and Statistics, University of Illinois Urbana-Champaign, Champaign, IL 61820, USA.
J Intell. 2024 Jan 17;12(1):10. doi: 10.3390/jintelligence12010010.
Computer-based assessments provide the opportunity to collect a new source of behavioral data related to the problem-solving process, known as log file data. To understand the behavioral patterns that can be uncovered from these process data, many studies have employed clustering methods. In contrast to one-mode clustering algorithms, this study utilized biclustering methods, enabling simultaneous classification of test takers and features extracted from log files. By applying the biclustering algorithms to the "Ticket" task in the PISA 2012 CPS assessment, we evaluated the potential of biclustering algorithms in identifying and interpreting homogeneous biclusters from the process data. Compared with one-mode clustering algorithms, the biclustering methods could uncover clusters of individuals who are homogeneous on a subset of feature variables, holding promise for gaining fine-grained insights into students' problem-solving behavior patterns. Empirical results revealed that specific subsets of features played a crucial role in identifying biclusters. Additionally, the study explored the utilization of biclustering on both the action sequence data and timing data, and the inclusion of time-based features enhanced the understanding of students' action sequences and scores in the context of the analysis.
基于计算机的评估提供了收集与问题解决过程相关的新行为数据来源的机会,即日志文件数据。为了理解可以从这些过程数据中发现的行为模式,许多研究采用了聚类方法。与单模式聚类算法不同,本研究使用了双聚类方法,能够同时对考生和从日志文件中提取的特征进行分类。通过将双聚类算法应用于2012年PISA CPS评估中的“票务”任务,我们评估了双聚类算法从过程数据中识别和解释同质双聚类的潜力。与单模式聚类算法相比,双聚类方法可以发现特征变量子集中同质的个体聚类,有望对学生的问题解决行为模式获得细粒度的洞察。实证结果表明,特定的特征子集在识别双聚类中起着关键作用。此外,该研究还探讨了在动作序列数据和时间数据上使用双聚类,并且纳入基于时间的特征增强了在分析背景下对学生动作序列和分数的理解。