Suppr超能文献

单项评估中不同可靠性估计方法的比较:一项模拟研究。

Comparison of different reliability estimation methods for single-item assessment: a simulation study.

作者信息

Zhang Sijun, Colvin Kimberly

机构信息

Institute of Educational Sciences, Hunan University, Changsha, China.

School of Education, University at Albany, Albany, NY, United States.

出版信息

Front Psychol. 2024 Nov 1;15:1482016. doi: 10.3389/fpsyg.2024.1482016. eCollection 2024.

Abstract

Single-item assessments have recently become popular in various fields, and researchers have developed methods for estimating the reliability of single-item assessments, some based on factor analysis and correction for attenuation, and others using the double monotonicity model, Guttman's λ, or the latent class model. However, no empirical study has investigated which method best estimates the reliability of single-item assessments. This study investigated this question using a simulation study. To represent assessments as they are found in practice, the simulation study varied several aspects: the item discrimination parameter, the test length of the multi-item assessment of the same construct, the sample size, and the correlation between the single-item assessment and the multi-item assessment of the same construct. The results suggest that by using the method based on the double monotonicity model and the method based on correction for attenuation simultaneously, researchers can obtain the most precise estimate of the range of reliability of a single-item assessment in 94.44% of cases. The test length of a multi-item assessment of the same construct, the item discrimination parameter, the sample size, and the correlation between the single-item assessment and the multi-item assessment of the same construct did not influence the choice of method choice.

摘要

单项评估最近在各个领域变得流行起来,研究人员已经开发出了估算单项评估信度的方法,一些基于因素分析和衰减校正,另一些则使用双单调性模型、古特曼λ系数或潜在类别模型。然而,尚无实证研究调查哪种方法能最佳估算单项评估的信度。本研究通过模拟研究对这个问题进行了调查。为了模拟实际中的评估情况,模拟研究在几个方面进行了变化:项目区分参数、同一构念的多项评估的测验长度、样本量以及单项评估与同一构念的多项评估之间的相关性。结果表明,通过同时使用基于双单调性模型的方法和基于衰减校正的方法,研究人员在94.44%的情况下能够获得对单项评估信度范围的最精确估计。同一构念的多项评估的测验长度、项目区分参数、样本量以及单项评估与同一构念的多项评估之间的相关性并不影响方法的选择。

相似文献

2
Methods for Estimating Item-Score Reliability.估计项目得分信度的方法。
Appl Psychol Meas. 2018 Oct;42(7):553-570. doi: 10.1177/0146621618758290. Epub 2018 Apr 9.
7
Sequential Bayesian Ability Estimation Applied to Mixed-Format Item Tests.应用于混合格式项目测试的序贯贝叶斯能力估计
Appl Psychol Meas. 2023 Sep;47(5-6):402-419. doi: 10.1177/01466216231201986. Epub 2023 Sep 8.

本文引用的文献

5
A Review of Key Likert Scale Development Advances: 1995-2019.李克特量表发展关键进展综述:1995 - 2019年
Front Psychol. 2021 May 4;12:637547. doi: 10.3389/fpsyg.2021.637547. eCollection 2021.
10
A basis for analyzing test-retest reliability.分析重测信度的基础。
Psychometrika. 1945;10:255-82. doi: 10.1007/BF02288892.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验