从 α 到 ω 的可靠性：教程。

Reliability from α to ω: A tutorial.

机构信息

Department of Psychology.

Department of Medical Social Sciences.

出版信息

Psychol Assess. 2019 Dec;31(12):1395-1411. doi: 10.1037/pas0000754. Epub 2019 Aug 5.

DOI:10.1037/pas0000754

PMID:31380696

Abstract

Reliability is a fundamental problem for measurement in all of science. Although defined in multiple ways, and estimated in even more ways, the basic concepts seem straightforward and need to be understood by practitioners as well as methodologists. Reliability theory is not just for the psychometrician estimating latent variables, it is for everyone who wants to make inferences from measures of individuals or of groups. For the case of a single test administration, we consider multiple measures of reliability, ranging from the worst (β) to average (α, λ3) to best (λ4) split half reliabilities, and consider why model-based estimates (ωh, ωt) should be reported. We also address the utility of test-retest and alternate form reliabilities. The advantages of immediate versus delayed retests to decompose observed score variance into specific, state, and trait scores are discussed. But reliability is not just for test scores, it is also important when evaluating the use of ratings. Estimates that may be applied to continuous data include a set of intraclass correlations while discrete categorical data needs to take advantage of the family of κ statistics. Examples of these various reliability estimates are given using state and trait measures of anxiety given with different delays and under different conditions. An online supplemental materials is provided with more detail and elaboration. The online supplemental materials is also used to demonstrate applications of open source software to examples of real data, and comparisons are made between the many types of reliability. (PsycINFO Database Record (c) 2019 APA, all rights reserved).

摘要

可靠性是所有科学测量中的一个基本问题。尽管可靠性已经有了多种定义和多种估计方法，但基本概念似乎很简单，从业者和方法学家都需要理解。可靠性理论不仅适用于估计潜在变量的心理计量学家，也适用于希望从个体或群体的测量结果中进行推断的每个人。对于单次测试的情况，我们考虑了多种可靠性度量，从最差（β）到平均（α，λ3）到最佳（λ4）分半可靠性，并且考虑了为什么应该报告基于模型的估计值（ωh，ωt）。我们还讨论了重测和复本信度的效用。讨论了立即重测与延迟重测的优势，以将观察到的分数方差分解为特定的、状态的和特质的分数。但是，可靠性不仅适用于测试分数，在评估评分的使用时也很重要。可以应用于连续数据的估计值包括一组组内相关系数，而离散分类数据需要利用κ统计量族。使用不同延迟和不同条件下给出的焦虑状态和特质测量，给出了这些各种可靠性估计的示例。在线补充材料提供了更详细和详细的说明。在线补充材料还用于演示开源软件在真实数据示例中的应用，并对许多类型的可靠性进行比较。（PsycINFO 数据库记录（c）2019 APA，保留所有权利）。

相似文献

Reliability from α to ω: A tutorial.从 α 到 ω 的可靠性：教程。

Psychol Assess. 2019 Dec;31(12):1395-1411. doi: 10.1037/pas0000754. Epub 2019 Aug 5.

Composite reliability of multilevel data: It's about observed scores and construct meanings.多级数据的组合信度：关乎观测分数与构念意义。

Psychol Methods. 2021 Feb;26(1):90-102. doi: 10.1037/met0000287. Epub 2020 Jul 16.

Confidence intervals for population reliability coefficients: Evaluation of methods, recommendations, and software for composite measures.群体可靠性系数的置信区间：综合测量方法、建议和软件的评估。

Psychol Methods. 2016 Mar;21(1):69-92. doi: 10.1037/a0040086.

Thanks coefficient alpha, we'll take it from here.谢谢克朗巴哈系数，接下来我们自己来。

Psychol Methods. 2018 Sep;23(3):412-433. doi: 10.1037/met0000144. Epub 2017 May 29.

Practical considerations for evaluating reliability in ambulatory assessment studies.评估动态评估研究中可靠性的实用考虑因素。

Psychol Assess. 2019 Mar;31(3):285-291. doi: 10.1037/pas0000599.

The Diagnostic Infant Preschool Assessment-Likert Version: Preparation, Concurrent Construct Validation, and Test-Retest Reliability.《诊断性婴幼儿及学龄前儿童评估-李克特量表版：编制、同时效度验证及重测信度》

J Child Adolesc Psychopharmacol. 2020 Jun;30(5):326-334. doi: 10.1089/cap.2019.0168. Epub 2020 Mar 10.

A tutorial on the meta-analytic structural equation modeling of reliability coefficients.可靠性系数的元分析结构方程建模教程。

Psychol Methods. 2020 Dec;25(6):747-775. doi: 10.1037/met0000261. Epub 2020 Mar 5.

Conditional reliability coefficients for test scores.测试分数的条件可靠性系数。

Psychol Methods. 2018 Jun;23(2):351-362. doi: 10.1037/met0000132. Epub 2017 Apr 6.

Autoregressive mediation models using composite scores and latent variables: Comparisons and recommendations.使用综合评分和潜在变量的自回归中介模型：比较与建议。

Psychol Methods. 2020 Aug;25(4):472-495. doi: 10.1037/met0000251. Epub 2020 Apr 9.

Reliability and omega hierarchical in multidimensional data: A comparison of various estimators.多维数据中的可靠性与欧米伽层级：各种估计量的比较

Psychol Methods. 2025 Feb;30(1):40-59. doi: 10.1037/met0000525. Epub 2022 Sep 1.

引用本文的文献

Psychometric properties and qualitative evaluation of a Swedish translation of the New Sexual Satisfaction Scale-Short (NSSS-S).新性满意度量表简版（NSSS-S）瑞典语翻译版的心理测量特性及质性评价

PLoS One. 2025 Aug 25;20(8):e0330353. doi: 10.1371/journal.pone.0330353. eCollection 2025.

Intra- and inter-observer reliability and repeatability of the metatarsus adductus angle in childhood: A concordance study.儿童期内收足跖骨角的观察者间及观察者内可靠性与可重复性：一项一致性研究。

Pediatr Radiol. 2025 Aug 25. doi: 10.1007/s00247-025-06375-3.

Reliability and validity evidence for the physical activity parenting questionnaire for children (PAP-C) in Chinese school-aged children.中国学龄儿童体育活动育儿问卷（PAP-C）的信效度证据

Eur J Pediatr. 2025 Aug 18;184(9):561. doi: 10.1007/s00431-025-06413-0.

On the psychometric properties and genomic etiology of the general factor of psychopathology.关于精神病理学一般因素的心理测量特性和基因组病因学。

Mol Psychiatry. 2025 Aug 14. doi: 10.1038/s41380-025-03151-5.

Tutorial: Power analyses for interaction effects in cross-sectional regressions.教程：横断面回归中交互效应的功效分析。

Adv Methods Pract Psychol Sci. 2023 Jul-Sep;6(3). doi: 10.1177/25152459231187531. Epub 2023 Sep 26.

Engaging people with lived experience of psychological disorders: Current research and future directions for community-engaged measure development in psychological science.让有心理障碍亲身经历的人参与进来：心理学领域社区参与式测量发展的当前研究与未来方向。

Clin Psychol Sci. 2025 Jul;13(4):720-739. doi: 10.1177/21677026241304339. Epub 2025 Feb 5.

Beyond school gates: the role of motivation in music learning on elementary school students' daily music listening behaviors.校门之外：动机对小学生日常音乐聆听行为中音乐学习的作用。

Front Psychol. 2025 Jul 16;16:1441572. doi: 10.3389/fpsyg.2025.1441572. eCollection 2025.

Development of the Movement Disorders Interpretation Bias Scale and psychometric evaluation in adults with Huntington's disease.运动障碍解释偏差量表的编制及对成年亨廷顿舞蹈病患者的心理测量学评估

Parkinsonism Relat Disord. 2025 Jul 21;138:107971. doi: 10.1016/j.parkreldis.2025.107971.

Some Considerations for the Use of the Abbreviated Profile of Hearing Aid Benefit (APHAB) as a Hearing-Aid Outcome Measure.关于使用助听器受益简表（APHAB）作为助听器效果评估指标的一些考量

Trends Hear. 2025 Jan-Dec;29:23312165251359755. doi: 10.1177/23312165251359755. Epub 2025 Jul 21.

Baumann Skin Type Questionnaire (BSTQ): creation and validation of the Polish language version - part two.鲍曼皮肤类型问卷（BSTQ）：波兰语版本的创建与验证 - 第二部分

Postepy Dermatol Alergol. 2025 Apr 15;42(3):259-266. doi: 10.5114/ada.2025.149555. eCollection 2025 Jun.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

从 α 到 ω 的可靠性：教程。

Reliability from α to ω: A tutorial.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献