• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

二分项目的信度和真分数测量与其拉施难度参数的函数关系。

Reliability and true-score measures of binary items as a function of their Rasch difficulty parameter.

作者信息

Dimitrov Dimiter M

机构信息

Graduate School of Education, MNN4B3, 4400 University Dr., George Mason University, Fairfax, VA 22030, USA.

出版信息

J Appl Meas. 2003;4(3):222-33.

PMID:12904673
Abstract

This article provides formulas for expected true-score measures and reliability of binary items as a function of their Rasch difficulty when the trait (ability) distribution is normal or logistic. The proposed formulas have theoretical value and can be useful in test development, score analysis, and simulation studies. Once the items are calibrated with the dichotomous Rasch model, one can estimate (without further data collection) the expected values for true-score measures (e.g., domain score, true score variance, and error variance for the number-right score) and reliability for both norm-referenced and criterion-referenced interpretations. Thus, given a bank of Rasch calibrated items, one can develop a test with desirable values of population true-score measures and reliability or compare such measures for subsets of items that are grouped by substantive characteristics (e.g., content areas or strands of learning outcomes). An illustrative example for using the proposed formulas is also provided.

摘要

本文给出了在特质(能力)分布为正态或逻辑分布时,作为二分项目Rasch难度函数的期望真分数测量值和信度的公式。所提出的公式具有理论价值,可用于测试开发、分数分析和模拟研究。一旦用二分Rasch模型对项目进行校准,就可以(无需进一步收集数据)估计真分数测量值的期望值(例如,领域分数、真分数方差和答对分数的误差方差)以及常模参照和标准参照解释的信度。因此,给定一组经Rasch校准的项目,就可以开发出具有理想总体真分数测量值和信度的测试,或者比较按实质特征(如内容领域或学习成果的维度)分组的项目子集的此类测量值。还提供了一个使用所提出公式的示例。

相似文献

1
Reliability and true-score measures of binary items as a function of their Rasch difficulty parameter.二分项目的信度和真分数测量与其拉施难度参数的函数关系。
J Appl Meas. 2003;4(3):222-33.
2
Standard setting with dichotomous and constructed response items: some Rasch model approaches.使用二分法和结构化反应题目的标准设定:一些拉施模型方法。
J Appl Meas. 2009;10(4):438-54.
3
Using the Rasch model in nursing research: an introduction and illustrative example.护理研究中Rasch模型的应用:介绍与示例
Int J Nurs Stud. 2009 Mar;46(3):380-93. doi: 10.1016/j.ijnurstu.2008.10.007. Epub 2008 Dec 6.
4
Rasch analysis of distractors in multiple-choice items.多项选择题中干扰项的拉施分析
J Outcome Meas. 1998;2(1):43-65.
5
An analysis of dimensionality using factor analysis (true-score theory) and Rasch measurement: what is the difference? Which method is better?使用因子分析(真分数理论)和拉施测量法进行维度分析:有何差异?哪种方法更好?
J Appl Meas. 2005;6(1):80-99.
6
Expected linking error resulting from item parameter drift among the common Items on Rasch calibrated tests.拉施校准测试中常见项目间因项目参数漂移而产生的预期链接错误。
J Appl Meas. 2005;6(1):48-56.
7
FIM measurement properties and Rasch model details.FIM测量属性及拉施模型细节。
Scand J Rehabil Med. 1997 Dec;29(4):267-72.
8
Rasch fit statistics as a test of the invariance of item parameter estimates.拉施拟合统计作为项目参数估计不变性的一种检验。
J Appl Meas. 2003;4(2):153-63.
9
Utilizing Rasch measurement models to develop a computer adaptive self-report of walking, climbing, and running.利用拉施测量模型开发一种关于行走、攀爬和跑步的计算机自适应自我报告。
Disabil Rehabil. 2008;30(6):458-67. doi: 10.1080/09638280701617317.
10
Conditional pairwise estimation in the Rasch model for ordered response categories using principal components.使用主成分对有序反应类别Rasch模型中的条件成对估计。
J Appl Meas. 2003;4(3):205-21.

引用本文的文献

1
Proof of Reliability Convergence to 1 at Rate of Spearman-Brown Formula for Random Test Forms and Irrespective of Item Pool Dimensionality.信度收敛证明达到斯皮尔曼-布朗公式速率,适用于随机测试形式且与项目库维度无关。
Psychometrika. 2024 Sep;89(3):774-795. doi: 10.1007/s11336-024-09956-7. Epub 2024 Mar 12.
2
Reliability and Time Course of Postexercise Hypotension during Exercise Training among Adults with Hypertension.高血压成人运动训练期间运动后低血压的可靠性和时程
J Cardiovasc Dev Dis. 2024 Jan 29;11(2):42. doi: 10.3390/jcdd11020042.
3
A Comparison of Modern and Popular Approaches to Calculating Reliability for Dichotomously Scored Items.
二分计分项目可靠性计算的现代方法与常用方法比较
Appl Psychol Meas. 2022 Jun;46(4):321-337. doi: 10.1177/01466216221084210. Epub 2022 Apr 14.
4
On True Score Evaluation Using Item Response Theory Modeling.基于项目反应理论模型的真分数评估
Educ Psychol Meas. 2019 Aug;79(4):796-807. doi: 10.1177/0013164417741711. Epub 2017 Nov 16.
5
The Delta-Scoring Method of Tests With Binary Items: A Note on True Score Estimation and Equating.具有二元项目的测试的德尔塔评分方法:关于真分数估计和等值的一则注释
Educ Psychol Meas. 2018 Oct;78(5):805-825. doi: 10.1177/0013164417724187. Epub 2017 Aug 4.
6
Accounting for Differential Item Functioning Using Bayesian Approximate Measurement Invariance.使用贝叶斯近似测量不变性来解释项目功能差异
Educ Psychol Meas. 2020 Aug;80(4):638-664. doi: 10.1177/0013164419887482. Epub 2019 Dec 4.
7
Developing Multistage Tests Using -Scoring Method.使用-评分法开发多阶段测试。 (你提供的原文中“-Scoring Method”前面似乎少了具体内容,可能会影响更准确的理解和翻译)
Educ Psychol Meas. 2019 Oct;79(5):988-1008. doi: 10.1177/0013164419841428. Epub 2019 Apr 22.
8
Validation of Infant and Young Child Feeding Questionnaire for the Assessment of Knowledge, Attitudes and Practices among Child Care Providers: The IYCF-CCPQ.婴儿和幼儿喂养调查问卷的验证,用于评估儿童照护者的知识、态度和实践:IYCF-CCPQ。
Int J Environ Res Public Health. 2019 Jun 17;16(12):2147. doi: 10.3390/ijerph16122147.
9
A Note on the -Scoring Method Adapted for Polytomous Test Items.关于适用于多值测试项目的-评分方法的说明。
Educ Psychol Meas. 2019 Jun;79(3):545-557. doi: 10.1177/0013164418786014. Epub 2018 Jul 4.
10
An Approach to Scoring and Equating Tests With Binary Items: Piloting With Large-Scale Assessments.一种针对具有二元项目的测试进行评分和等值处理的方法:大规模评估试点
Educ Psychol Meas. 2016 Dec;76(6):954-975. doi: 10.1177/0013164416631100. Epub 2016 Feb 16.