追踪（不）确定性。

Tracking with (Un)Certainty.

作者信息

Hofman Abe D, Brinkhuis Matthieu J S, Bolsinova Maria, Klaiber Jonathan, Maris Gunter, van der Maas Han L J

机构信息

Department of Psychological Methods, University of Amsterdam, 1018 WS Amsterdam, The Netherlands.

Oefenweb, 1011 VL Amsterdam, The Netherlands.

出版信息

J Intell. 2020 Mar 3;8(1):10. doi: 10.3390/jintelligence8010010.

DOI:10.3390/jintelligence8010010

PMID:32138312

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7151223/

Abstract

One of the highest ambitions in educational technology is the move towards personalized learning. To this end, computerized adaptive learning (CAL) systems are developed. A popular method to track the development of student ability and item difficulty, in CAL systems, is the Elo Rating System (ERS). The ERS allows for dynamic model parameters by updating key parameters after every response. However, drawbacks of the ERS are that it does not provide standard errors and that it results in rating variance inflation. We identify three statistical issues responsible for both of these drawbacks. To solve these issues we introduce a new tracking system based on urns, where every person and item is represented by an urn filled with a combination of green and red marbles. Urns are updated, by an exchange of marbles after each response, such that the proportions of green marbles represent estimates of person ability or item difficulty. A main advantage of this approach is that the standard errors are known, hence the method allows for statistical inference, such as testing for learning effects. We highlight features of the Urnings algorithm and compare it to the popular ERS in a simulation study and in an empirical data example from a large-scale CAL application.

摘要

教育技术领域的最高目标之一是朝着个性化学习迈进。为此，人们开发了计算机自适应学习（CAL）系统。在CAL系统中，一种用于跟踪学生能力发展和题目难度的常用方法是Elo评分系统（ERS）。ERS通过在每次回答后更新关键参数来实现动态模型参数。然而，ERS的缺点是它不提供标准误差，并且会导致评分方差膨胀。我们确定了导致这两个缺点的三个统计问题。为了解决这些问题，我们引入了一种基于瓮的新跟踪系统，其中每个人和每个题目都由一个装有绿色和红色弹珠组合的瓮来表示。通过在每次回答后交换弹珠来更新瓮，使得绿色弹珠的比例代表对个人能力或题目难度的估计。这种方法的一个主要优点是标准误差是已知的，因此该方法允许进行统计推断，例如检验学习效果。我们突出了瓮算法的特点，并在模拟研究和来自大规模CAL应用的实证数据示例中将其与流行的ERS进行了比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d53e/7151223/072a1f1795f4/jintelligence-08-00010-g0A1.jpg

相似文献

Tracking with (Un)Certainty.追踪（不）确定性。

J Intell. 2020 Mar 3;8(1):10. doi: 10.3390/jintelligence8010010.

A Multidimensional IRT Approach for Dynamically Monitoring Ability Growth in Computerized Practice Environments.一种用于在计算机化练习环境中动态监测能力增长的多维项目反应理论方法。

Front Psychol. 2019 Mar 29;10:620. doi: 10.3389/fpsyg.2019.00620. eCollection 2019.

Tracking a multitude of abilities as they develop.追踪多项能力的发展情况。

Br J Math Stat Psychol. 2022 Nov;75(3):753-778. doi: 10.1111/bmsp.12276. Epub 2022 Jun 5.

On-the-fly parameter estimation based on item response theory in item-based adaptive learning systems.基于项目的自适应学习系统中基于项目反应理论的即时参数估计。

Behav Res Methods. 2023 Sep;55(6):3260-3280. doi: 10.3758/s13428-022-01953-x. Epub 2022 Sep 9.

An explanatory item response theory method for alleviating the cold-start problem in adaptive learning environments.一种解释性项目反应理论方法，用于缓解自适应学习环境中的冷启动问题。

Behav Res Methods. 2019 Apr;51(2):895-909. doi: 10.3758/s13428-018-1166-9.

Measurement and control of bias in patient reported outcomes using multidimensional item response theory.使用多维项目反应理论测量和控制患者报告结局中的偏倚

BMC Med Res Methodol. 2016 May 26;16:63. doi: 10.1186/s12874-016-0161-z.

The effectiveness of internet-based e-learning on clinician behavior and patient outcomes: a systematic review protocol.基于互联网的电子学习对临床医生行为和患者结局的有效性：一项系统评价方案。

JBI Database System Rev Implement Rep. 2015 Jan;13(1):52-64. doi: 10.11124/jbisrir-2015-1919.

Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测：机器学习在 1 型糖尿病中的应用。

Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.

Mixture Random-Effect IRT Models for Controlling Extreme Response Style on Rating Scales.用于控制量表极端反应风格的混合随机效应项目反应理论模型。

Front Psychol. 2016 Nov 2;7:1706. doi: 10.3389/fpsyg.2016.01706. eCollection 2016.

Evaluating Different Equating Setups in the Continuous Item Pool Calibration for Computerized Adaptive Testing.评估计算机自适应测试连续项目池校准中的不同等值设置

Front Psychol. 2019 Jun 6;10:1277. doi: 10.3389/fpsyg.2019.01277. eCollection 2019.

引用本文的文献

Tracking a multitude of abilities as they develop.追踪多项能力的发展情况。

Br J Math Stat Psychol. 2022 Nov;75(3):753-778. doi: 10.1111/bmsp.12276. Epub 2022 Jun 5.

本文引用的文献

The Wiring of Intelligence.智力的布线。

Perspect Psychol Sci. 2019 Nov;14(6):1034-1061. doi: 10.1177/1745691619866447. Epub 2019 Oct 24.

A Solution to the Measurement Problem in the Idiographic Approach Using Computer Adaptive Practicing.一种使用计算机自适应练习的个案法中测量问题的解决方案。

J Intell. 2018 Mar 2;6(1):14. doi: 10.3390/jintelligence6010014.

Dynamic estimation in the extended marginal Rasch model with an application to mathematical computer-adaptive practice.动态估计扩展边际 Rasch 模型及其在数学计算机自适应练习中的应用。

Br J Math Stat Psychol. 2020 Feb;73(1):72-87. doi: 10.1111/bmsp.12157. Epub 2019 Mar 18.

Behav Res Methods. 2019 Apr;51(2):895-909. doi: 10.3758/s13428-018-1166-9.

Measuring growth in students' proficiency in MOOCs: Two component dynamic extensions for the Rasch model.测量学生在 MOOC 中的熟练度增长：针对 Rasch 模型的两分量动态扩展。

Behav Res Methods. 2019 Feb;51(1):332-341. doi: 10.3758/s13428-018-1129-1.

Cognitive Analysis of Educational Games: The Number Game.教育游戏的认知分析：数字游戏

Top Cogn Sci. 2017 Apr;9(2):395-412. doi: 10.1111/tops.12231. Epub 2016 Nov 20.

TRACING THE DEVELOPMENT OF TYPEWRITING SKILLS IN AN ADAPTIVE E-LEARNING ENVIRONMENT.

Percept Mot Skills. 2015 Dec;121(3):727-45. doi: 10.2466/23.25.PMS.121c26x6. Epub 2015 Dec 10.

The Balance-Scale Task Revisited: A Comparison of Statistical Models for Rule-Based and Information-Integration Theories of Proportional Reasoning.重新审视天平任务：比例推理的基于规则理论和信息整合理论的统计模型比较

PLoS One. 2015 Oct 27;10(10):e0136449. doi: 10.1371/journal.pone.0136449. eCollection 2015.

The role of pattern recognition in children's exact enumeration of small numbers.模式识别在儿童对小数字精确计数中的作用。

Br J Dev Psychol. 2014 Jun;32(2):178-94. doi: 10.1111/bjdp.12032. Epub 2014 Jan 13.

A dynamical model of general intelligence: the positive manifold of intelligence by mutualism.一般智力的动力学模型：通过共生实现智力的正流形

Psychol Rev. 2006 Oct;113(4):842-61. doi: 10.1037/0033-295X.113.4.842.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

追踪（不）确定性。

Tracking with (Un)Certainty.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献