通过因子分析模型探讨经典测试理论与项目反应理论框架之间的关系。

Relationships Among Classical Test Theory and Item Response Theory Frameworks via Factor Analytic Models.

作者信息

Kohli Nidhi, Koran Jennifer, Henn Lisa

机构信息

University of Minnesota, Minneapolis, MN, USA.

Southern Illinois University, Carbondale, IL, USA.

出版信息

Educ Psychol Meas. 2015 Jun;75(3):389-405. doi: 10.1177/0013164414559071. Epub 2014 Nov 20.

DOI:10.1177/0013164414559071

PMID:29795826

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5965645/

Abstract

There are well-defined theoretical differences between the classical test theory (CTT) and item response theory (IRT) frameworks. It is understood that in the CTT framework, person and item statistics are test- and sample-dependent. This is not the perception with IRT. For this reason, the IRT framework is considered to be theoretically superior to the CTT framework for the purpose of estimating person and item parameters. In previous simulation studies, IRT models were used both as generating and as fitting models. Hence, results favoring the IRT framework could be attributed to IRT being the data-generation framework. Moreover, previous studies only considered the traditional CTT framework for the comparison, yet there is considerable literature suggesting that it may be more appropriate to use CTT statistics based on an underlying normal variable (UNV) assumption. The current study relates the class of CTT-based models with the UNV assumption to that of IRT, using confirmatory factor analysis to delineate the connections. A small Monte Carlo study was carried out to assess the comparability between the item and person statistics obtained from the frameworks of IRT and CTT with UNV assumption. Results show the frameworks of IRT and CTT with UNV assumption to be quite comparable, with neither framework showing an advantage over the other.

摘要

经典测验理论（CTT）和项目反应理论（IRT）框架之间存在明确的理论差异。据了解，在CTT框架中，个体和项目统计数据依赖于测验和样本。而IRT并非如此。因此，就估计个体和项目参数而言，IRT框架在理论上被认为优于CTT框架。在以往的模拟研究中，IRT模型既被用作生成模型，也被用作拟合模型。因此，支持IRT框架的结果可能归因于IRT是数据生成框架。此外，以往的研究在比较时仅考虑了传统的CTT框架，但有大量文献表明，基于潜在正态变量（UNV）假设使用CTT统计数据可能更合适。本研究使用验证性因素分析来描述基于UNV假设的CTT模型类别与IRT模型类别之间的联系。进行了一项小型蒙特卡罗研究，以评估从IRT框架和基于UNV假设的CTT框架获得的项目和个体统计数据之间的可比性。结果表明，基于UNV假设的IRT框架和CTT框架具有相当的可比性，两个框架均未显示出优于对方的优势。

相似文献

Relationships Among Classical Test Theory and Item Response Theory Frameworks via Factor Analytic Models.

Educ Psychol Meas. 2015 Jun;75(3):389-405. doi: 10.1177/0013164414559071. Epub 2014 Nov 20.

Methodological issues regarding power of classical test theory (CTT) and item response theory (IRT)-based approaches for the comparison of patient-reported outcomes in two groups of patients--a simulation study.

BMC Med Res Methodol. 2010 Mar 25;10:24. doi: 10.1186/1471-2288-10-24.

A primer on classical test theory and item response theory for assessments in medical education.

Med Educ. 2010 Jan;44(1):109-17. doi: 10.1111/j.1365-2923.2009.03425.x.

Does Scoring Method Impact Estimation of Significant Individual Changes Assessed by Patient-Reported Outcome Measures? Comparing Classical Test Theory Versus Item Response Theory.

Value Health. 2023 Oct;26(10):1518-1524. doi: 10.1016/j.jval.2023.06.002. Epub 2023 Jun 12.

Comparison of Classical Test Theory and Item Response Theory in Individual Change Assessment.

Appl Psychol Meas. 2016 Nov;40(8):559-572. doi: 10.1177/0146621616664046. Epub 2016 Sep 24.

Evaluating Equating Transformations in IRT Observed-Score and Kernel Equating Methods.

Appl Psychol Meas. 2023 Mar;47(2):123-140. doi: 10.1177/01466216221124087. Epub 2022 Oct 4.

What's in a score: A longitudinal investigation of scores based on item response theory and classical test theory for the Amsterdam Instrumental Activities of Daily Living Questionnaire in cognitively normal and impaired older adults.

Neuropsychology. 2024 Jan;38(1):96-105. doi: 10.1037/neu0000914. Epub 2023 Sep 7.

Approximate Functional Relationship between IRT and CTT Item Discrimination Indices: A Simulation, Validation, and Practical Extension of Lord's (1980) Formula.

J Appl Meas. 2017;18(4):393-407.

Using classical test theory, item response theory, and Rasch measurement theory to evaluate patient-reported outcome measures: a comparison of worked examples.

Value Health. 2015 Jan;18(1):25-34. doi: 10.1016/j.jval.2014.10.005.

State of the psychometric methods: comments on the ISOQOL SIG psychometric papers.

J Patient Rep Outcomes. 2019 Jul 30;3(1):49. doi: 10.1186/s41687-019-0134-1.

引用本文的文献

Bayesian item response theory to estimate power in clinical trials with patient-reported outcomes as endpoints.

Qual Life Res. 2025 Apr;34(4):1113-1124. doi: 10.1007/s11136-024-03874-y. Epub 2025 Jan 8.

Latent -Scoring Modeling: Estimation of Item and Person Parameters.

Educ Psychol Meas. 2021 Apr;81(2):388-404. doi: 10.1177/0013164420941147. Epub 2020 Jul 13.

Reliability and validity of the Pittsburgh Sleep Quality Index among frontline COVID-19 health care workers using classical test theory and item response theory.

J Clin Sleep Med. 2022 Feb 1;18(2):541-551. doi: 10.5664/jcsm.9658.

On True Score Evaluation Using Item Response Theory Modeling.

Educ Psychol Meas. 2019 Aug;79(4):796-807. doi: 10.1177/0013164417741711. Epub 2017 Nov 16.

The Delta-Scoring Method of Tests With Binary Items: A Note on True Score Estimation and Equating.

Educ Psychol Meas. 2018 Oct;78(5):805-825. doi: 10.1177/0013164417724187. Epub 2017 Aug 4.

Reliability and validity of on-road driving tests in vulnerable adults: a systematic review.

Int J Rehabil Res. 2019 Dec;42(4):289-299. doi: 10.1097/MRR.0000000000000374.

An Approach to Scoring and Equating Tests With Binary Items: Piloting With Large-Scale Assessments.

Educ Psychol Meas. 2016 Dec;76(6):954-975. doi: 10.1177/0013164416631100. Epub 2016 Feb 16.

On the Relationship Between Classical Test Theory and Item Response Theory: From One to the Other and Back.

Educ Psychol Meas. 2016 Apr;76(2):325-338. doi: 10.1177/0013164415576958. Epub 2015 Apr 1.

本文引用的文献

Item factor analysis: current approaches and future directions.

Psychol Methods. 2007 Mar;12(1):58-79. doi: 10.1037/1082-989X.12.1.58.

Some links between classical and modern test theory via the two-level hierarchical generalized linear model.

J Appl Meas. 2005;6(3):289-310.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过因子分析模型探讨经典测试理论与项目反应理论框架之间的关系。

Relationships Among Classical Test Theory and Item Response Theory Frameworks via Factor Analytic Models.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献