计算机化自适应测试中项目选择的极大极小准则。

Maximin criterion for item selection in computerized adaptive testing.

作者信息

Chen Jyun-Hong, Chao Hsiu-Yi

机构信息

Department of Psychology, National Cheng Kung University, No. 1, University Road, Tainan City, 701401, Taiwan.

Department of Psychology, Soochow University, No. 70, Linhsi Road, Taipei City, 111002, Taiwan.

出版信息

Behav Res Methods. 2025 May 28;57(7):180. doi: 10.3758/s13428-025-02673-8.

DOI:10.3758/s13428-025-02673-8

PMID:40437293

Abstract

In computerized adaptive testing (CAT), information-based item selection rules (ISRs), such as maximum Fisher information (MFI), often excessively rely on discriminating items, leading to unbalanced utilization of the item pool. To address this challenge, the present study introduced the MaxiMin Information (MMI) criterion, which is grounded in decision theory. MMI calculates each item's minimum information (I) within the current confidence interval (CI) of the trait level, selecting the item with the maximum I to be administered. For examinees with broader CIs (less precise trait estimates), MMI leans toward administering less discriminating items, which tend to yield larger I. Conversely, for narrower CIs, MMI aligns more closely with MFI by favoring items with higher discrimination. This indicates that MMI's item selection is tailored to each examinee based on his or her provisional trait estimate and its estimation precision. Five simulation studies were conducted to assess MMI's performance in CAT under various conditions. Results demonstrate that although MMI is comparable with other ISRs in terms of trait estimation precision, it excels in balancing item pool utilization. By fine-tuning confidence levels, MMI not only efficiently schedules the use of discriminating items toward the test's later stages to enhance test efficiency but also effectively adapts to different testing scenarios. From these findings, we generally recommend applying MMI with a confidence level of 95% to optimize item pool utilization without compromising trait estimation accuracy. With its evident advantages, MMI holds promise for practical applications, especially for high-stakes tests requiring utmost test efficiency and security.

摘要

在计算机自适应测试（CAT）中，基于信息的项目选择规则（ISRs），如最大费舍尔信息（MFI），往往过度依赖区分性项目，导致项目库的利用不均衡。为应对这一挑战，本研究引入了基于决策理论的最大最小信息（MMI）准则。MMI计算每个项目在当前特质水平置信区间（CI）内的最小信息（I），选择I值最大的项目进行施测。对于置信区间较宽（特质估计不太精确）的考生，MMI倾向于施测区分性较低的项目，这些项目往往会产生较大的I值。相反，对于较窄的置信区间，MMI通过青睐具有较高区分度的项目，与MFI更为接近。这表明MMI的项目选择是根据每个考生的临时特质估计及其估计精度量身定制的。进行了五项模拟研究，以评估MMI在各种条件下在CAT中的表现。结果表明，虽然MMI在特质估计精度方面与其他ISRs相当，但在平衡项目库利用方面表现出色。通过微调置信水平，MMI不仅有效地将区分性项目的使用安排到测试后期以提高测试效率，还能有效适应不同的测试场景。从这些发现中，我们一般建议应用置信水平为95%的MMI来优化项目库利用，同时不影响特质估计的准确性。凭借其明显的优势，MMI在实际应用中具有前景，特别是对于需要极高测试效率和安全性的高风险测试。

相似文献

Maximin criterion for item selection in computerized adaptive testing.计算机化自适应测试中项目选择的极大极小准则。

Behav Res Methods. 2025 May 28;57(7):180. doi: 10.3758/s13428-025-02673-8.

A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。

Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理（2025年结石病专家共识）

Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.

Is It Possible to Develop a Patient-reported Experience Measure With Lower Ceiling Effect?是否有可能开发一种天花板效应较低的患者报告体验测量方法？

Clin Orthop Relat Res. 2025 Apr 1;483(4):693-703. doi: 10.1097/CORR.0000000000003262. Epub 2024 Oct 25.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

A systematic review and economic evaluation of epoetin alpha, epoetin beta and darbepoetin alpha in anaemia associated with cancer, especially that attributable to cancer treatment.促红细胞生成素α、促红细胞生成素β和达比加群酯治疗癌症相关性贫血（尤其是癌症治疗所致贫血）的系统评价与经济学评估

Health Technol Assess. 2007 Apr;11(13):1-202, iii-iv. doi: 10.3310/hta11130.

Psychological therapies for panic disorder with or without agoraphobia in adults: a network meta-analysis.成人伴或不伴有广场恐惧症的惊恐障碍的心理治疗：一项网状荟萃分析。

Cochrane Database Syst Rev. 2016 Apr 13;4(4):CD011004. doi: 10.1002/14651858.CD011004.pub2.

Personal protective equipment for preventing highly infectious diseases due to exposure to contaminated body fluids in healthcare staff.用于预防医护人员因接触受污染体液而感染高传染性疾病的个人防护装备。

Cochrane Database Syst Rev. 2016 Apr 19;4:CD011621. doi: 10.1002/14651858.CD011621.pub2.

本文引用的文献

Controlling the Minimum Item Exposure Rate in Computerized Adaptive Testing: A Two-Stage Sympson-Hetter Procedure.控制计算机自适应测试中的最小项目曝光率：一种两阶段的辛普森-赫特程序。

Appl Psychol Meas. 2023 Nov;47(7-8):460-477. doi: 10.1177/01466216231209756. Epub 2023 Oct 20.

A Dynamic Stratification Method for Improving Trait Estimation in Computerized Adaptive Testing Under Item Exposure Control.一种在项目曝光控制下改进计算机自适应测试中特质估计的动态分层方法。

Appl Psychol Meas. 2020 May;44(3):182-196. doi: 10.1177/0146621619843820. Epub 2019 Apr 23.

On-the-Fly Assembled Multistage Adaptive Testing.动态组装多级自适应测试

Appl Psychol Meas. 2015 Mar;39(2):104-118. doi: 10.1177/0146621614544519. Epub 2014 Sep 5.

Psychometrics behind Computerized Adaptive Testing.计算机自适应测试背后的心理测量学

Psychometrika. 2015 Mar;80(1):1-20. doi: 10.1007/s11336-014-9401-5. Epub 2014 Feb 6.

Development of a computerized adaptive test for depression.抑郁症计算机自适应测试的开发。

Arch Gen Psychiatry. 2012 Nov;69(11):1104-12. doi: 10.1001/archgenpsychiatry.2012.14.

Investigating the relationship between item exposure and test overlap: item sharing and item pooling.考察项目暴露与测试重叠之间的关系：项目共享和项目汇集。

Br J Math Stat Psychol. 2010 Feb;63(Pt 1):205-26. doi: 10.1348/000711009X430906. Epub 2009 Jun 19.

The maximum priority index method for severely constrained item selection in computerized adaptive testing.计算机化自适应测试中严重受限项目选择的最大优先级指数法。

Br J Math Stat Psychol. 2009 May;62(Pt 2):369-83. doi: 10.1348/000711008X304376. Epub 2008 Jun 2.

Controlling item exposure and test overlap on the fly in computerized adaptive testing.在计算机自适应测试中即时控制项目曝光和测试重叠。

Br J Math Stat Psychol. 2008 Nov;61(Pt 2):471-92. doi: 10.1348/000711007X227067. Epub 2007 Jul 23.

a-Stratified CAT design with content blocking.具有内容分组的分层CAT设计。

Br J Math Stat Psychol. 2003 Nov;56(Pt 2):359-78. doi: 10.1348/000711003770480084.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

计算机化自适应测试中项目选择的极大极小准则。

Maximin criterion for item selection in computerized adaptive testing.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献