Cui Chaoran, Ma Hebo, Dong Xiaolin, Zhang Chen, Zhang Chunyun, Yao Yumo, Chen Meng, Ma Yuling
School of Computer Science and Technology, Shandong University of Finance and Economics, Jinan, China.
Department of Computing, Hong Kong Polytechnic University, Hong Kong, China.
Neural Netw. 2024 Oct;178:106495. doi: 10.1016/j.neunet.2024.106495. Epub 2024 Jun 27.
Knowledge tracing (KT) aims to monitor students' evolving knowledge states through their learning interactions with concept-related questions, and can be indirectly evaluated by predicting how students will perform on future questions. In this paper, we observe that there is a common phenomenon of answer bias, i.e., a highly unbalanced distribution of correct and incorrect answers for each question. Existing models tend to memorize the answer bias as a shortcut for achieving high prediction performance in KT, thereby failing to fully understand students' knowledge states. To address this issue, we approach the KT task from a causality perspective. A causal graph of KT is first established, from which we identify that the impact of answer bias lies in the direct causal effect of questions on students' responses. A novel COunterfactual REasoning (CORE) framework for KT is further proposed, which separately captures the total causal effect and direct causal effect during training, and mitigates answer bias by subtracting the latter from the former in testing. The CORE framework is applicable to various existing KT models, and we implement it based on the prevailing DKT, DKVMN, and AKT models, respectively. Extensive experiments on three benchmark datasets demonstrate the effectiveness of CORE in making the debiased inference for KT. We have released our code at https://github.com/lucky7-code/CORE.
知识追踪(KT)旨在通过学生与概念相关问题的学习交互来监测其不断演变的知识状态,并且可以通过预测学生在未来问题上的表现来进行间接评估。在本文中,我们观察到存在一种常见的答案偏差现象,即每个问题的正确答案和错误答案分布极不均衡。现有模型倾向于将答案偏差记忆下来,作为在知识追踪中实现高预测性能的捷径,从而无法充分理解学生的知识状态。为了解决这个问题,我们从因果关系的角度来处理知识追踪任务。首先建立了一个知识追踪的因果图,从中我们确定答案偏差的影响在于问题对学生回答的直接因果效应。进一步提出了一种用于知识追踪的新颖的反事实推理(CORE)框架,该框架在训练期间分别捕捉总因果效应和直接因果效应,并在测试中通过从前者中减去后者来减轻答案偏差。CORE框架适用于各种现有的知识追踪模型,并且我们分别基于流行的DKT、DKVMN和AKT模型来实现它。在三个基准数据集上进行的大量实验证明了CORE在进行知识追踪的无偏差推理方面的有效性。我们已在https://github.com/lucky7-code/CORE上发布了我们的代码。