• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在分块计分背景下三种多分类项目反应理论模型的比较

A comparison of three polytomous item response theory models in the context of testlet scoring.

作者信息

Cook K F, Dodd B G, Fitzpatrick S J

机构信息

Baylor College of Medicine/Veterans Affairs, Houston, Texas, USA.

出版信息

J Outcome Meas. 1999;3(1):1-20.

PMID:10063769
Abstract

An alternative to dichotomous scoring of multiple items anchored to a common stem is scoring these items as a single polytomous item (testlet scoring). This study systematically compared the partial credit model (PCM), the generalized partial credit model (GPCM), and the graded response model (GRM) in the context of testlet scoring. Data sets included a sample from the fall 1994 administration of the SAT I (N = 2,548) and a simulated data set. Theta estimation, information, and model fit were analyzed. Correlations among theta estimates ranged from 0.9748 to 0.9921. The relationship among the information functions of the PCM, GPCM and the GRM reflected the discrimination parameter estimates for the latter two models. Suggestions are made with regard to model selection.

摘要

将锚定在共同题干上的多个项目进行二分计分的一种替代方法是将这些项目作为单个多分类项目进行计分(题组计分)。本研究在题组计分的背景下系统地比较了部分计分模型(PCM)、广义部分计分模型(GPCM)和等级反应模型(GRM)。数据集包括1994年秋季SAT I考试的一个样本(N = 2548)和一个模拟数据集。分析了θ估计、信息量和模型拟合情况。θ估计之间的相关性在0.9748至0.9921之间。PCM、GPCM和GRM的信息函数之间的关系反映了后两个模型的区分度参数估计。针对模型选择提出了建议。

相似文献

1
A comparison of three polytomous item response theory models in the context of testlet scoring.在分块计分背景下三种多分类项目反应理论模型的比较
J Outcome Meas. 1999;3(1):1-20.
2
Polytomous multilevel testlet models for testlet-based assessments with complex sampling designs.用于具有复杂抽样设计的基于测验题组评估的多分类多级测验题组模型。
Br J Math Stat Psychol. 2015 Feb;68(1):65-83. doi: 10.1111/bmsp.12035. Epub 2014 Feb 27.
3
The impact of model misfit on partial credit model parameter estimates.模型失配对部分计分模型参数估计的影响。
J Appl Meas. 2004;5(2):115-28.
4
Rasch analysis of distractors in multiple-choice items.多项选择题中干扰项的拉施分析
J Outcome Meas. 1998;2(1):43-65.
5
Polytomous Testlet Response Models for Technology-Enhanced Innovative Items: Implications on Model Fit and Trait Inference.用于技术增强创新型项目的多分类测试题组反应模型:对模型拟合和特质推断的影响
Educ Psychol Meas. 2022 Aug;82(4):811-838. doi: 10.1177/00131644211032261. Epub 2021 Aug 2.
6
Barthel Index of activities of daily living: item response theory analysis of ratings for long-term care residents.日常生活活动能力巴氏指数:长期护理居民评分的项目反应理论分析
Nurs Res. 2015 Mar-Apr;64(2):88-99. doi: 10.1097/NNR.0000000000000072.
7
A Box-Cox normal model for response times.Box-Cox 正态模型在反应时间中的应用。
Br J Math Stat Psychol. 2009 Nov;62(Pt 3):621-40. doi: 10.1348/000711008X374126. Epub 2009 Jan 30.
8
Psychometric properties for the Balanced Inventory of Desirable Responding: dichotomous versus polytomous conventional and IRT scoring.效度量表为理想反应的平衡量表:二分法与多分法传统和IRT 评分。
Psychol Assess. 2014 Sep;26(3):878-91. doi: 10.1037/a0036430. Epub 2014 Apr 7.
9
A graded response model for measuring person reliability.用于测量个体可靠性的分级响应模型。
Br J Math Stat Psychol. 2009 Nov;62(Pt 3):641-62. doi: 10.1348/000711008X377745. Epub 2009 Jan 20.
10
Item response theory analyses of physical functioning items in the medical outcomes study.医学结果研究中身体功能项目的项目反应理论分析。
Med Care. 2007 May;45(5 Suppl 1):S32-8. doi: 10.1097/01.mlr.0000246649.43232.82.

引用本文的文献

1
Getting started with the graded response model: An introduction and tutorial in R.分级反应模型入门:R语言中的介绍与教程
Int J Psychol. 2025 Feb;60(1):e13265. doi: 10.1002/ijop.13265. Epub 2024 Nov 12.
2
A Novel and Highly Effective Bayesian Sampling Algorithm Based on the Auxiliary Variables to Estimate the Testlet Effect Models.一种基于辅助变量的新型高效贝叶斯抽样算法,用于估计测验题目组效应模型。
Front Psychol. 2021 Aug 11;12:509575. doi: 10.3389/fpsyg.2021.509575. eCollection 2021.
3
The transition to digital presentation of the diagnostic imaging domain of the Part IV examination of the National Board of Chiropractic Examiners.
脊骨神经医学国家委员会第四部分考试诊断成像领域向数字呈现的转变。
J Chiropr Educ. 2020 Mar;34(1):52-67. doi: 10.7899/JCE-19-2. Epub 2020 Jan 8.
4
The Patient-Reported Experience Measure for Improving qUality of care in Mental health (PREMIUM) project in France: study protocol for the development and implementation strategy.法国改善精神卫生保健质量的患者报告体验测量(PREMIUM)项目:开发与实施策略的研究方案
Patient Prefer Adherence. 2019 Jan 21;13:165-177. doi: 10.2147/PPA.S172100. eCollection 2019.
5
lordif: An R Package for Detecting Differential Item Functioning Using Iterative Hybrid Ordinal Logistic Regression/Item Response Theory and Monte Carlo Simulations.lordif:一个用于使用迭代混合有序逻辑回归/项目反应理论和蒙特卡罗模拟检测项目功能差异的R包。
J Stat Softw. 2011 Mar 1;39(8):1-30. doi: 10.18637/jss.v039.i08.