• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于混合格式考试的融合SDT/IRT模型

Fused SDT/IRT Models for Mixed-Format Exams.

作者信息

DeCarlo Lawrence T

机构信息

Teachers College, Columbia University, New York, NY, USA.

出版信息

Educ Psychol Meas. 2024 Dec;84(6):1076-1106. doi: 10.1177/00131644241235333. Epub 2024 Mar 28.

DOI:10.1177/00131644241235333
PMID:39484198
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11523186/
Abstract

A psychological framework for different types of items commonly used with mixed-format exams is proposed. A choice model based on signal detection theory (SDT) is used for multiple-choice (MC) items, whereas an item response theory (IRT) model is used for open-ended (OE) items. The SDT and IRT models are shown to share a common conceptualization in terms of latent states of "know/don't know" at the examinee level. This in turn suggests a way to join or "fuse" the models-through the probability of knowing. A general model that fuses the SDT choice model, for MC items, with a generalized sequential logit model, for OE items, is introduced. Fitting SDT and IRT models simultaneously allows one to examine possible differences in psychological processes across the different types of items, to examine the effects of covariates in both models simultaneously, to allow for relations among the model parameters, and likely offers potential estimation benefits. The utility of the approach is illustrated with MC and OE items from large-scale international exams.

摘要

本文提出了一个适用于混合格式考试中常用的不同类型试题的心理框架。基于信号检测理论(SDT)的选择模型用于多项选择题(MC),而项目反应理论(IRT)模型用于开放式(OE)试题。研究表明,SDT和IRT模型在考生层面的“知道/不知道”潜在状态方面具有共同的概念化。这反过来又提出了一种通过知道的概率来连接或“融合”模型的方法。引入了一个通用模型,该模型将用于MC试题的SDT选择模型与用于OE试题的广义顺序逻辑模型相融合。同时拟合SDT和IRT模型可以让人们检验不同类型试题在心理过程上可能存在的差异,同时检验两个模型中协变量的影响,考虑模型参数之间的关系,并且可能带来潜在的估计优势。通过大规模国际考试中的MC和OE试题说明了该方法的实用性。

相似文献

1
Fused SDT/IRT Models for Mixed-Format Exams.用于混合格式考试的融合SDT/IRT模型
Educ Psychol Meas. 2024 Dec;84(6):1076-1106. doi: 10.1177/00131644241235333. Epub 2024 Mar 28.
2
Is It Possible to Develop a Patient-reported Experience Measure With Lower Ceiling Effect?是否有可能开发一种天花板效应较低的患者报告体验测量方法?
Clin Orthop Relat Res. 2025 Apr 1;483(4):693-703. doi: 10.1097/CORR.0000000000003262. Epub 2024 Oct 25.
3
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.
4
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
5
Surveillance of Barrett's oesophagus: exploring the uncertainty through systematic review, expert workshop and economic modelling.巴雷特食管的监测:通过系统评价、专家研讨会和经济模型探索不确定性
Health Technol Assess. 2006 Mar;10(8):1-142, iii-iv. doi: 10.3310/hta10080.
6
Does a Concise Patient-reported Outcome Measure Provide a Valid Measure of Physical Function for Cancer Patients After Lower Extremity Surgery?一种简明的患者报告结局指标能否有效衡量下肢手术后癌症患者的身体功能?
Clin Orthop Relat Res. 2025 Jan 1;483(1):62-75. doi: 10.1097/CORR.0000000000003257. Epub 2024 Oct 4.
7
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
8
Topiramate versus carbamazepine monotherapy for epilepsy: an individual participant data review.托吡酯与卡马西平单药治疗癫痫的疗效比较:个体参与者数据综述
Cochrane Database Syst Rev. 2016 Dec 6;12(12):CD012065. doi: 10.1002/14651858.CD012065.pub2.
9
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.拓扑替康治疗卵巢癌的临床有效性和成本效益的快速系统评价。
Health Technol Assess. 2001;5(28):1-110. doi: 10.3310/hta5280.
10
The effectiveness and cost-effectiveness of carmustine implants and temozolomide for the treatment of newly diagnosed high-grade glioma: a systematic review and economic evaluation.卡莫司汀植入剂与替莫唑胺治疗新诊断的高级别胶质瘤的有效性和成本效益:一项系统评价与经济学评估
Health Technol Assess. 2007 Nov;11(45):iii-iv, ix-221. doi: 10.3310/hta11450.

引用本文的文献

1
On a Reparameterization of the MC-DINA Model.关于MC-DINA模型的一种重新参数化
Appl Psychol Meas. 2025 Mar 11:01466216251324938. doi: 10.1177/01466216251324938.

本文引用的文献

1
Exploring the Effects of Item-Specific Factors in Sequential and IRTree Models.探索项目特异性因素在顺序和IRTree 模型中的作用。
Psychometrika. 2023 Sep;88(3):745-775. doi: 10.1007/s11336-023-09912-x. Epub 2023 Jun 16.
2
A Signal Detection Model for Multiple-Choice Exams.一种用于多项选择题考试的信号检测模型。
Appl Psychol Meas. 2021 Sep;45(6):423-440. doi: 10.1177/01466216211014599. Epub 2021 May 25.
3
An Item Response Model for True-False Exams Based on Signal Detection Theory.基于信号检测理论的是非题项目反应模型。
Appl Psychol Meas. 2020 May;44(3):234-248. doi: 10.1177/0146621619843823. Epub 2019 Apr 23.
4
A generalized item response tree model for psychological assessments.一种用于心理评估的广义项目反应树模型。
Behav Res Methods. 2016 Sep;48(3):1070-85. doi: 10.3758/s13428-015-0631-y.
5
On the Unidentifiability of the Fixed-Effects 3PL Model.关于固定效应三参数逻辑斯蒂模型的不可识别性
Psychometrika. 2015 Jun;80(2):450-67. doi: 10.1007/s11336-014-9404-2. Epub 2014 Jan 31.
6
Signal detection theory with finite mixture distributions: theoretical developments with applications to recognition memory.具有有限混合分布的信号检测理论:理论发展及其在识别记忆中的应用
Psychol Rev. 2002 Oct;109(4):710-21. doi: 10.1037/0033-295x.109.4.710.
7
Modelling sequentially scored item responses.对顺序评分的项目反应进行建模。
Br J Math Stat Psychol. 2000 May;53 ( Pt 1):83-98. doi: 10.1348/000711000159196.