Ranger Jochen, Schmidt Nico, Wolgast Anett
Martin-Luther-University Halle-Wittenberg, Germany.
University of Applied Sciences FHM, Hannover, Germany.
Educ Psychol Meas. 2023 Oct;83(5):1033-1058. doi: 10.1177/00131644221132723. Epub 2022 Nov 4.
Recent approaches to the detection of cheaters in tests employ detectors from the field of machine learning. Detectors based on supervised learning algorithms achieve high accuracy but require labeled data sets with identified cheaters for training. Labeled data sets are usually not available at an early stage of the assessment period. In this article, we discuss the approach of adapting a detector that was trained previously with a labeled training data set to a new unlabeled data set. The training and the new data set may contain data from different tests. The adaptation of detectors to new data or tasks is denominated as transfer learning in the field of machine learning. We first discuss the conditions under which a detector of cheating can be transferred. We then investigate whether the conditions are met in a real data set. We finally evaluate the benefits of transferring a detector of cheating. We find that a transferred detector has higher accuracy than an unsupervised detector of cheating. A naive transfer that consists of a simple reuse of the detector increases the accuracy considerably. A transfer via a self-labeling (SETRED) algorithm increases the accuracy slightly more than the naive transfer. The findings suggest that the detection of cheating might be improved by using existing detectors of cheating at an early stage of an assessment period.
近期用于检测考试作弊者的方法采用了机器学习领域的检测器。基于监督学习算法的检测器准确率很高,但需要带有已识别作弊者的标记数据集进行训练。在评估期的早期阶段,标记数据集通常无法获取。在本文中,我们讨论了将先前用标记训练数据集训练的检测器应用于新的未标记数据集的方法。训练集和新数据集可能包含来自不同测试的数据。在机器学习领域,将检测器应用于新数据或任务被称为迁移学习。我们首先讨论作弊检测器可以迁移的条件。然后我们研究在一个真实数据集中这些条件是否得到满足。我们最后评估迁移作弊检测器的益处。我们发现,迁移后的检测器比无监督作弊检测器具有更高的准确率。简单重复使用检测器的简单迁移能显著提高准确率。通过自标记(SETRED)算法进行的迁移比简单迁移能稍微进一步提高准确率。研究结果表明,在评估期的早期阶段使用现有的作弊检测器可能会改进作弊检测。