Suppr超能文献

从 α 到 ω 的可靠性:教程。

Reliability from α to ω: A tutorial.

机构信息

Department of Psychology.

Department of Medical Social Sciences.

出版信息

Psychol Assess. 2019 Dec;31(12):1395-1411. doi: 10.1037/pas0000754. Epub 2019 Aug 5.

Abstract

Reliability is a fundamental problem for measurement in all of science. Although defined in multiple ways, and estimated in even more ways, the basic concepts seem straightforward and need to be understood by practitioners as well as methodologists. Reliability theory is not just for the psychometrician estimating latent variables, it is for everyone who wants to make inferences from measures of individuals or of groups. For the case of a single test administration, we consider multiple measures of reliability, ranging from the worst (β) to average (α, λ3) to best (λ4) split half reliabilities, and consider why model-based estimates (ωh, ωt) should be reported. We also address the utility of test-retest and alternate form reliabilities. The advantages of immediate versus delayed retests to decompose observed score variance into specific, state, and trait scores are discussed. But reliability is not just for test scores, it is also important when evaluating the use of ratings. Estimates that may be applied to continuous data include a set of intraclass correlations while discrete categorical data needs to take advantage of the family of κ statistics. Examples of these various reliability estimates are given using state and trait measures of anxiety given with different delays and under different conditions. An online supplemental materials is provided with more detail and elaboration. The online supplemental materials is also used to demonstrate applications of open source software to examples of real data, and comparisons are made between the many types of reliability. (PsycINFO Database Record (c) 2019 APA, all rights reserved).

摘要

可靠性是所有科学测量中的一个基本问题。尽管可靠性已经有了多种定义和多种估计方法,但基本概念似乎很简单,从业者和方法学家都需要理解。可靠性理论不仅适用于估计潜在变量的心理计量学家,也适用于希望从个体或群体的测量结果中进行推断的每个人。对于单次测试的情况,我们考虑了多种可靠性度量,从最差(β)到平均(α,λ3)到最佳(λ4)分半可靠性,并且考虑了为什么应该报告基于模型的估计值(ωh,ωt)。我们还讨论了重测和复本信度的效用。讨论了立即重测与延迟重测的优势,以将观察到的分数方差分解为特定的、状态的和特质的分数。但是,可靠性不仅适用于测试分数,在评估评分的使用时也很重要。可以应用于连续数据的估计值包括一组组内相关系数,而离散分类数据需要利用κ统计量族。使用不同延迟和不同条件下给出的焦虑状态和特质测量,给出了这些各种可靠性估计的示例。在线补充材料提供了更详细和详细的说明。在线补充材料还用于演示开源软件在真实数据示例中的应用,并对许多类型的可靠性进行比较。(PsycINFO 数据库记录(c)2019 APA,保留所有权利)。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验