Suppr超能文献

使用项目反应模型改进健康教育与健康行为研究中的测量:与经典测试理论方法的比较

Improving measurement in health education and health behavior research using item response modeling: comparison with the classical test theory approach.

作者信息

Wilson Mark, Allen Diane D, Li Jun Corser

机构信息

Graduate School of Education, University of California, Berkeley, CA 94720, USA.

出版信息

Health Educ Res. 2006 Dec;21 Suppl 1:i19-32. doi: 10.1093/her/cyl053. Epub 2006 Jul 31.

Abstract

This paper compares the approach and resultant outcomes of item response models (IRMs) and classical test theory (CTT). First, it reviews basic ideas of CTT, and compares them to the ideas about using IRMs introduced in an earlier paper. It then applies a comparison scheme based on the AERA/APA/NCME 'Standards for Educational and Psychological Tests' to compare the two approaches under three general headings: (i) choosing a model; (ii) evidence for reliability--incorporating reliability coefficients and measurement error--and (iii) evidence for validity--including evidence based on instrument content, response processes, internal structure, other variables and consequences. An example analysis of a self-efficacy (SE) scale for exercise is used to illustrate these comparisons. The investigation found that there were (i) aspects of the techniques and outcomes that were similar between the two approaches, (ii) aspects where the item response modeling approach contributes to instrument construction and evaluation beyond the classical approach and (iii) aspects of the analysis where the measurement models had little to do with the analysis or outcomes. There were no aspects where the classical approach contributed to instrument construction or evaluation beyond what could be done with the IRM approach. Finally, properties of the SE scale are summarized and recommendations made.

摘要

本文比较了项目反应模型(IRMs)和经典测验理论(CTT)的方法及最终结果。首先,回顾了CTT的基本概念,并将其与早期一篇论文中介绍的使用IRMs的概念进行比较。然后,应用基于美国教育研究协会/美国心理学会/美国国家教育测量学会《教育和心理测验标准》的比较方案,在三个总体标题下比较这两种方法:(i)选择模型;(ii)可靠性证据——包括可靠性系数和测量误差——以及(iii)有效性证据——包括基于测验内容、反应过程、内部结构、其他变量和结果的证据。通过对一份运动自我效能感(SE)量表的实例分析来说明这些比较。调查发现:(i)两种方法在技术和结果方面存在相似之处;(ii)项目反应建模方法在仪器构建和评估方面比经典方法有更大贡献的方面;(iii)测量模型与分析或结果几乎无关的分析方面。经典方法在仪器构建或评估方面没有比IRM方法能做到的更多贡献的方面。最后,总结了SE量表的特性并提出了建议。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验