检测大规模评估中的作弊行为：将检测器应用于新测试

Detecting Cheating in Large-Scale Assessment: The Transfer of Detectors to New Tests.

作者信息

Ranger Jochen, Schmidt Nico, Wolgast Anett

机构信息

Martin-Luther-University Halle-Wittenberg, Germany.

University of Applied Sciences FHM, Hannover, Germany.

出版信息

Educ Psychol Meas. 2023 Oct;83(5):1033-1058. doi: 10.1177/00131644221132723. Epub 2022 Nov 4.

DOI:10.1177/00131644221132723

PMID:37663534

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10470164/

Abstract

Recent approaches to the detection of cheaters in tests employ detectors from the field of machine learning. Detectors based on supervised learning algorithms achieve high accuracy but require labeled data sets with identified cheaters for training. Labeled data sets are usually not available at an early stage of the assessment period. In this article, we discuss the approach of adapting a detector that was trained previously with a labeled training data set to a new unlabeled data set. The training and the new data set may contain data from different tests. The adaptation of detectors to new data or tasks is denominated as transfer learning in the field of machine learning. We first discuss the conditions under which a detector of cheating can be transferred. We then investigate whether the conditions are met in a real data set. We finally evaluate the benefits of transferring a detector of cheating. We find that a transferred detector has higher accuracy than an unsupervised detector of cheating. A naive transfer that consists of a simple reuse of the detector increases the accuracy considerably. A transfer via a self-labeling (SETRED) algorithm increases the accuracy slightly more than the naive transfer. The findings suggest that the detection of cheating might be improved by using existing detectors of cheating at an early stage of an assessment period.

摘要

近期用于检测考试作弊者的方法采用了机器学习领域的检测器。基于监督学习算法的检测器准确率很高，但需要带有已识别作弊者的标记数据集进行训练。在评估期的早期阶段，标记数据集通常无法获取。在本文中，我们讨论了将先前用标记训练数据集训练的检测器应用于新的未标记数据集的方法。训练集和新数据集可能包含来自不同测试的数据。在机器学习领域，将检测器应用于新数据或任务被称为迁移学习。我们首先讨论作弊检测器可以迁移的条件。然后我们研究在一个真实数据集中这些条件是否得到满足。我们最后评估迁移作弊检测器的益处。我们发现，迁移后的检测器比无监督作弊检测器具有更高的准确率。简单重复使用检测器的简单迁移能显著提高准确率。通过自标记（SETRED）算法进行的迁移比简单迁移能稍微进一步提高准确率。研究结果表明，在评估期的早期阶段使用现有的作弊检测器可能会改进作弊检测。

相似文献

Detecting Cheating in Large-Scale Assessment: The Transfer of Detectors to New Tests.

Educ Psychol Meas. 2023 Oct;83(5):1033-1058. doi: 10.1177/00131644221132723. Epub 2022 Nov 4.

Exploration of the Stacking Ensemble Machine Learning Algorithm for Cheating Detection in Large-Scale Assessment.

Educ Psychol Meas. 2023 Aug;83(4):831-854. doi: 10.1177/00131644221117193. Epub 2022 Aug 13.

Comprehensive study of semi-supervised learning for DNA methylation-based supervised classification of central nervous system tumors.

BMC Bioinformatics. 2022 Jun 8;23(1):223. doi: 10.1186/s12859-022-04764-1.

Transfer Learning Strategy Based on Unsupervised Learning and Ensemble Learning for Breast Cancer Molecular Subtype Prediction Using Dynamic Contrast-Enhanced MRI.

J Magn Reson Imaging. 2022 May;55(5):1518-1534. doi: 10.1002/jmri.27955. Epub 2021 Oct 20.

Leveraging Symbolic Knowledge Bases for Commonsense Natural Language Inference Using Pattern Theory.

IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13185-13202. doi: 10.1109/TPAMI.2023.3287837. Epub 2023 Oct 3.

Source Data-Absent Unsupervised Domain Adaptation Through Hypothesis Transfer and Labeling Transfer.

IEEE Trans Pattern Anal Mach Intell. 2022 Nov;44(11):8602-8617. doi: 10.1109/TPAMI.2021.3103390. Epub 2022 Oct 4.

CPSS: Fusing consistency regularization and pseudo-labeling techniques for semi-supervised deep cardiovascular disease detection using all unlabeled electrocardiograms.

Comput Methods Programs Biomed. 2024 Sep;254:108315. doi: 10.1016/j.cmpb.2024.108315. Epub 2024 Jul 4.

Exploiting Target Data to Learn Deep Convolutional Networks for Scene-Adapted Human Detection.

IEEE Trans Image Process. 2018 Mar;27(3):1418-1432. doi: 10.1109/TIP.2017.2779271. Epub 2017 Dec 4.

Novel Transfer Learning Approach for Medical Imaging with Limited Labeled Data.

Cancers (Basel). 2021 Mar 30;13(7):1590. doi: 10.3390/cancers13071590.

An Unsupervised Transfer Learning Framework for Visible-Thermal Pedestrian Detection.

Sensors (Basel). 2022 Jun 10;22(12):4416. doi: 10.3390/s22124416.

引用本文的文献

A new person-fit statistic for the detection of aberrant responses in polytomous cognitive diagnostic models.

Behav Res Methods. 2025 Apr 9;57(5):138. doi: 10.3758/s13428-025-02659-6.

本文引用的文献

Modeling Item Revisit Behavior: The Hierarchical Speed-Accuracy-Revisits Model.

Educ Psychol Meas. 2021 Apr;81(2):363-387. doi: 10.1177/0013164420950556. Epub 2020 Aug 31.

Exploration of the Stacking Ensemble Machine Learning Algorithm for Cheating Detection in Large-Scale Assessment.

Educ Psychol Meas. 2023 Aug;83(4):831-854. doi: 10.1177/00131644221117193. Epub 2022 Aug 13.

Transfer Adaptation Learning: A Decade Survey.

IEEE Trans Neural Netw Learn Syst. 2022 Jun 21;PP. doi: 10.1109/TNNLS.2022.3183326.

Assessing Preknowledge Cheating via Innovative Measures: A Multiple-Group Analysis of Jointly Modeling Item Responses, Response Times, and Visual Fixation Counts.

Educ Psychol Meas. 2021 Jun;81(3):441-465. doi: 10.1177/0013164420968630. Epub 2020 Oct 31.

The Detection of Cheating on E-Exams in Higher Education-The Performance of Several Old and Some New Indicators.

Front Psychol. 2020 Oct 2;11:568825. doi: 10.3389/fpsyg.2020.568825. eCollection 2020.

Detecting Examinees With Item Preknowledge in Large-Scale Testing Using Extreme Gradient Boosting (XGBoost).

Educ Psychol Meas. 2019 Oct;79(5):931-961. doi: 10.1177/0013164419839439. Epub 2019 Apr 2.

mclust 5: Clustering, Classification and Density Estimation Using Gaussian Finite Mixture Models.

R J. 2016 Aug;8(1):289-317.

PageFocus: Using paradata to detect and prevent cheating on online achievement tests.

Behav Res Methods. 2017 Aug;49(4):1444-1459. doi: 10.3758/s13428-016-0800-7.

Transfer learning for visual categorization: a survey.

IEEE Trans Neural Netw Learn Syst. 2015 May;26(5):1019-34. doi: 10.1109/TNNLS.2014.2330900. Epub 2014 Jul 1.

pROC: an open-source package for R and S+ to analyze and compare ROC curves.

BMC Bioinformatics. 2011 Mar 17;12:77. doi: 10.1186/1471-2105-12-77.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

检测大规模评估中的作弊行为：将检测器应用于新测试

Detecting Cheating in Large-Scale Assessment: The Transfer of Detectors to New Tests.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献