• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

如何理解信度?信度的通俗解释以及信度与效应量的关系。

How to Make Sense of Reliability? Common Language Interpretation of Reliability and the Relation of Reliability to Effect Size.

作者信息

Metsämuuronen Jari, Niemensivu Timi

机构信息

Faculty of Science, Turku Research Institute for Learning Analytics, University of Turku, Turku, Finland.

出版信息

Appl Psychol Meas. 2025 Jun 24:01466216251350159. doi: 10.1177/01466216251350159.

DOI:10.1177/01466216251350159
PMID:40575449
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12187714/
Abstract

Communicating the factual meaning of a particular reliability estimate is sometimes difficult. What does a specific reliability estimate of 0.80 or 0.95 mean in common language? Deflation-corrected estimates of reliability (DCER) using Somers' or Goodman-Kruskal as the item-score correlations are transformed into forms where specific estimates from the family of common language effect sizes are visible. This makes it possible to communicate reliability estimates using a common language and to evaluate the magnitude of a particular reliability estimate in the same way and with the same metric as we do with effect size estimates. Using a DCER, we can say that with = 40 items, if the reliability is 0.95, in 80 out of 100 random pairs of test takers from different subpopulations on all items combined, those with a higher item response will also score higher on the test. In this case, using the thresholds familiar from effect sizes, we can say that the reliability is "very high." The transformation of the reliability estimate into a common language effect size depends on the size of the item-score association estimates and the number of items, so no closed-form equations for the transformations are given. However, relevant thresholds are provided for practical use.

摘要

传达特定可靠性估计的实际意义有时很困难。用通俗的语言来说,特定的可靠性估计值0.80或0.95意味着什么?使用Somers' 或Goodman-Kruskal 作为项目得分相关性的经通胀校正的可靠性估计值(DCER)被转换为可见通用语言效应量族中特定估计值的形式。这使得使用通用语言传达可靠性估计值成为可能,并能够以与效应量估计相同的方式和相同的度量标准来评估特定可靠性估计值的大小。使用DCER,我们可以说,对于有40个项目的情况,如果可靠性为0.95,在来自不同亚群体的100对随机测试者中,就所有项目综合来看,在80对中,项目反应较高的测试者在测试中得分也会更高。在这种情况下,使用效应量中熟悉的阈值,我们可以说可靠性“非常高”。可靠性估计值向通用语言效应量的转换取决于项目得分关联估计值的大小和项目数量,因此没有给出转换的封闭形式方程。然而,提供了相关阈值以供实际使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/12187714/708e5883415d/10.1177_01466216251350159-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/12187714/32867aa92988/10.1177_01466216251350159-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/12187714/708e5883415d/10.1177_01466216251350159-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/12187714/32867aa92988/10.1177_01466216251350159-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a89/12187714/708e5883415d/10.1177_01466216251350159-fig2.jpg

相似文献

1
How to Make Sense of Reliability? Common Language Interpretation of Reliability and the Relation of Reliability to Effect Size.如何理解信度?信度的通俗解释以及信度与效应量的关系。
Appl Psychol Meas. 2025 Jun 24:01466216251350159. doi: 10.1177/01466216251350159.
2
Stigma Management Strategies of Autistic Social Media Users.自闭症社交媒体用户的污名管理策略
Autism Adulthood. 2025 May 28;7(3):273-282. doi: 10.1089/aut.2023.0095. eCollection 2025 Jun.
3
Automated monitoring compared to standard care for the early detection of sepsis in critically ill patients.与标准护理相比,自动监测用于危重症患者脓毒症的早期检测
Cochrane Database Syst Rev. 2018 Jun 25;6(6):CD012404. doi: 10.1002/14651858.CD012404.pub2.
4
Interventions to reduce harm from continued tobacco use.减少持续吸烟危害的干预措施。
Cochrane Database Syst Rev. 2016 Oct 13;10(10):CD005231. doi: 10.1002/14651858.CD005231.pub3.
5
Adapting Safety Plans for Autistic Adults with Involvement from the Autism Community.在自闭症群体的参与下为成年自闭症患者调整安全计划。
Autism Adulthood. 2025 May 28;7(3):293-302. doi: 10.1089/aut.2023.0124. eCollection 2025 Jun.
6
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
7
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
8
Drugs for preventing postoperative nausea and vomiting in adults after general anaesthesia: a network meta-analysis.成人全身麻醉后预防术后恶心呕吐的药物:网状Meta分析
Cochrane Database Syst Rev. 2020 Oct 19;10(10):CD012859. doi: 10.1002/14651858.CD012859.pub2.
9
Laser therapy for treating hypertrophic and keloid scars.激光疗法治疗增生性瘢痕和瘢痕疙瘩。
Cochrane Database Syst Rev. 2022 Sep 26;9(9):CD011642. doi: 10.1002/14651858.CD011642.pub2.
10
Deworming drugs for soil-transmitted intestinal worms in children: effects on nutritional indicators, haemoglobin and school performance.儿童肠道土源性蠕虫驱虫药物:对营养指标、血红蛋白及学业表现的影响
Cochrane Database Syst Rev. 2012 Jul 11(7):CD000371. doi: 10.1002/14651858.CD000371.pub4.

本文引用的文献

1
Directional nature of the product-moment correlation coefficient and some consequences.积差相关系数的方向性及一些结果。
Front Psychol. 2022 Oct 17;13:988660. doi: 10.3389/fpsyg.2022.988660. eCollection 2022.
2
Attenuation-Corrected Estimators of Reliability.可靠性的衰减校正估计器。
Appl Psychol Meas. 2022 Nov;46(8):720-737. doi: 10.1177/01466216221108131. Epub 2022 Sep 15.
3
Typology of Deflation-Corrected Estimators of Reliability.经通缩调整的可靠性估计量的类型学。
Front Psychol. 2022 Jul 18;13:891959. doi: 10.3389/fpsyg.2022.891959. eCollection 2022.
4
Deflation-Corrected Estimators of Reliability.经通货紧缩调整后的可靠性估计量
Front Psychol. 2022 Jan 4;12:748672. doi: 10.3389/fpsyg.2021.748672. eCollection 2021.
5
Avoid Cohen's 'Small', 'Medium', and 'Large' for Power Analysis.避免 Cohen 的“小”、“中”和“大”进行功效分析。
Trends Cogn Sci. 2020 Mar;24(3):200-207. doi: 10.1016/j.tics.2019.12.009. Epub 2020 Jan 15.
6
On the Use, the Misuse, and the Very Limited Usefulness of Cronbach's Alpha.论克朗巴哈α系数的使用、误用及非常有限的实用性。
Psychometrika. 2009 Mar;74(1):107-120. doi: 10.1007/s11336-008-9101-0. Epub 2008 Dec 11.
7
ALPHA FACTOR ANALYSIS.阿尔法因子分析
Psychometrika. 1965 Mar;30:1-14. doi: 10.1007/BF02289743.