• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

置信区间有助于(但不能保证)比统计显著性检验进行更好的推断。

Confidence intervals permit, but do not guarantee, better inference than statistical significance testing.

作者信息

Coulson Melissa, Healey Michelle, Fidler Fiona, Cumming Geoff

机构信息

Statistical Cognition Laboratory, School of Psychological Science, La Trobe University Melbourne, VIC, Australia.

出版信息

Front Psychol. 2010 Jul 2;1:26. doi: 10.3389/fpsyg.2010.00026. eCollection 2010.

DOI:10.3389/fpsyg.2010.00026
PMID:21607077
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3095378/
Abstract

A statistically significant result, and a non-significant result may differ little, although significance status may tempt an interpretation of difference. Two studies are reported that compared interpretation of such results presented using null hypothesis significance testing (NHST), or confidence intervals (CIs). Authors of articles published in psychology, behavioral neuroscience, and medical journals were asked, via email, to interpret two fictitious studies that found similar results, one statistically significant, and the other non-significant. Responses from 330 authors varied greatly, but interpretation was generally poor, whether results were presented as CIs or using NHST. However, when interpreting CIs respondents who mentioned NHST were 60% likely to conclude, unjustifiably, the two results conflicted, whereas those who interpreted CIs without reference to NHST were 95% likely to conclude, justifiably, the two results were consistent. Findings were generally similar for all three disciplines. An email survey of academic psychologists confirmed that CIs elicit better interpretations if NHST is not invoked. Improved statistical inference can result from encouragement of meta-analytic thinking and use of CIs but, for full benefit, such highly desirable statistical reform requires also that researchers interpret CIs without recourse to NHST.

摘要

一个具有统计学显著性的结果和一个无显著性的结果可能差异不大,尽管显著性状态可能会引发对差异的解读。本文报告了两项研究,它们比较了使用零假设显著性检验(NHST)或置信区间(CI)呈现的此类结果的解读情况。通过电子邮件,向心理学、行为神经科学和医学期刊上发表文章的作者们提出要求,请他们解读两项虚拟研究,这两项研究得出了相似的结果,一项具有统计学显著性,另一项无显著性。330位作者的回复差异很大,但无论结果是以置信区间还是使用零假设显著性检验呈现,解读通常都很糟糕。然而,在解读置信区间时,提及零假设显著性检验的受访者中有60%可能会不合理地得出这两个结果相互矛盾的结论,而那些在解读置信区间时未提及零假设显著性检验的受访者中有95%可能会合理地得出这两个结果一致的结论。所有三个学科的研究结果总体上相似。对学术心理学家的一项电子邮件调查证实,如果不引入零假设显著性检验,置信区间能引发更好的解读。鼓励进行元分析思维和使用置信区间可以带来更好的统计推断,但要获得充分的益处,这种非常理想的统计改革还要求研究人员在解读置信区间时不依赖零假设显著性检验。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/a5d807317ac3/fpsyg-01-00026-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/4c05f9a2715a/fpsyg-01-00026-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/544f6607b52c/fpsyg-01-00026-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/b57457e0292a/fpsyg-01-00026-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/b77d128ca253/fpsyg-01-00026-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/7c87630bd230/fpsyg-01-00026-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/88f85fc5a861/fpsyg-01-00026-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/f31790c3ae3c/fpsyg-01-00026-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/a5d807317ac3/fpsyg-01-00026-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/4c05f9a2715a/fpsyg-01-00026-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/544f6607b52c/fpsyg-01-00026-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/b57457e0292a/fpsyg-01-00026-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/b77d128ca253/fpsyg-01-00026-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/7c87630bd230/fpsyg-01-00026-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/88f85fc5a861/fpsyg-01-00026-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/f31790c3ae3c/fpsyg-01-00026-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0dbd/3095378/a5d807317ac3/fpsyg-01-00026-g008.jpg

相似文献

1
Confidence intervals permit, but do not guarantee, better inference than statistical significance testing.置信区间有助于(但不能保证)比统计显著性检验进行更好的推断。
Front Psychol. 2010 Jul 2;1:26. doi: 10.3389/fpsyg.2010.00026. eCollection 2010.
2
Robust misinterpretation of confidence intervals.对置信区间的严重误解。
Psychon Bull Rev. 2014 Oct;21(5):1157-64. doi: 10.3758/s13423-013-0572-3.
3
Statistical inference in abstracts of major medical and epidemiology journals 1975-2014: a systematic review.1975 - 2014年主要医学和流行病学杂志摘要中的统计推断:一项系统综述。
Eur J Epidemiol. 2017 Jan;32(1):21-29. doi: 10.1007/s10654-016-0211-1. Epub 2016 Nov 17.
4
Reform of statistical inference in psychology: the case of memory & cognition.心理学中统计推断的改革:以记忆与认知为例。
Behav Res Methods Instrum Comput. 2004 May;36(2):312-24. doi: 10.3758/bf03195577.
5
Bayesian inference of population prevalence.贝叶斯推断种群流行率。
Elife. 2021 Oct 6;10:e62461. doi: 10.7554/eLife.62461.
6
Perceived Statistical Knowledge Level and Self-Reported Statistical Practice Among Academic Psychologists.学术心理学家的统计学知识认知水平与自我报告的统计实践情况
Front Psychol. 2018 Jun 22;9:996. doi: 10.3389/fpsyg.2018.00996. eCollection 2018.
7
The reporting of values, confidence intervals and statistical significance in Preventive Veterinary Medicine (1997-2017).《预防兽医学》(1997 - 2017年)中数值、置信区间及统计学显著性的报告
PeerJ. 2021 Nov 24;9:e12453. doi: 10.7717/peerj.12453. eCollection 2021.
8
Neuroscience Needs to Test Both Statistical and Scientific Hypotheses.神经科学需要同时检验统计假设和科学假设。
J Neurosci. 2022 Nov 9;42(45):8432-8438. doi: 10.1523/JNEUROSCI.1134-22.2022.
9
On confidence intervals for within-subjects designs.关于被试内设计的置信区间
Psychol Methods. 2005 Dec;10(4):397-412. doi: 10.1037/1082-989X.10.4.397.
10
A logical analysis of null hypothesis significance testing using popular terminology.使用通俗术语对零假设显著性检验进行逻辑分析。
BMC Med Res Methodol. 2022 Sep 19;22(1):244. doi: 10.1186/s12874-022-01696-5.

引用本文的文献

1
Within-subject confidence intervals for pairwise differences in scatter plots.散点图中两两差异的受试者内置信区间。
Psychon Bull Rev. 2025 Aug 28. doi: 10.3758/s13423-025-02750-1.
2
Investigation of choroid plexus variability in schizophrenia-spectrum disorders-insights from a multimodal study.精神分裂症谱系障碍中脉络丛变异性的研究——来自多模态研究的见解
Schizophrenia (Heidelb). 2024 Dec 20;10(1):121. doi: 10.1038/s41537-024-00543-4.
3
The psychological reality of the learned "p < .05" boundary.所学的“p <.05”界限的心理现实。

本文引用的文献

1
Statistical reform in psychology: is anything changing?心理学中的统计改革:有什么正在发生变化吗?
Psychol Sci. 2007 Mar;18(3):230-2. doi: 10.1111/j.1467-9280.2007.01881.x.
2
Researchers misunderstand confidence intervals and standard error bars.研究人员对置信区间和标准误差线存在误解。
Psychol Methods. 2005 Dec;10(4):389-96. doi: 10.1037/1082-989X.10.4.389.
3
Inference by eye: confidence intervals and how to read pictures of data.直观推断:置信区间以及如何解读数据图表
Cogn Res Princ Implic. 2024 May 3;9(1):27. doi: 10.1186/s41235-024-00553-x.
4
Scale-free dynamics in the core-periphery topography and task alignment decline from conscious to unconscious states.无标度动态在核心-边缘拓扑和任务一致性上从有意识状态下降到无意识状态。
Commun Biol. 2023 May 9;6(1):499. doi: 10.1038/s42003-023-04879-y.
5
Power analysis for idiographic (within-subject) clinical trials: Implications for treatments of rare conditions and precision medicine.个体(受试者内)临床试验的功效分析:对罕见病治疗和精准医学的启示。
Behav Res Methods. 2023 Dec;55(8):4175-4199. doi: 10.3758/s13428-022-02012-1. Epub 2022 Dec 16.
6
P value and Bayesian analysis in randomized-controlled trials in child health research published over 10 years, 2007 to 2017: a methodological review protocol.P 值和贝叶斯分析在过去 10 年(2007 年至 2017 年)发表的儿童健康研究随机对照试验中的应用:方法学综述方案。
Syst Rev. 2021 Mar 10;10(1):71. doi: 10.1186/s13643-021-01622-8.
7
Informal versus formal judgment of statistical models: The case of normality assumptions.非形式化与形式化判断统计模型:正态性假设案例。
Psychon Bull Rev. 2021 Aug;28(4):1164-1182. doi: 10.3758/s13423-021-01879-z. Epub 2021 Mar 3.
8
Perceived Statistical Knowledge Level and Self-Reported Statistical Practice Among Academic Psychologists.学术心理学家的统计学知识认知水平与自我报告的统计实践情况
Front Psychol. 2018 Jun 22;9:996. doi: 10.3389/fpsyg.2018.00996. eCollection 2018.
9
-Value, Confidence Intervals, and Statistical Inference: A New Dataset of Misinterpretation.- 数值、置信区间与统计推断:一个关于误解的新数据集。
Front Psychol. 2018 Jun 8;9:868. doi: 10.3389/fpsyg.2018.00868. eCollection 2018.
10
Can a workplace leadership intervention reduce job insecurity and improve health? Results from a field study.工作场所领导力干预能否减少工作不安全感并改善健康?一项现场研究的结果。
Int Arch Occup Environ Health. 2018 Jul;91(5):547-557. doi: 10.1007/s00420-018-1302-y. Epub 2018 Mar 22.
Am Psychol. 2005 Feb-Mar;60(2):170-80. doi: 10.1037/0003-066X.60.2.170.
4
The persistence of underpowered studies in psychological research: causes, consequences, and remedies.心理学研究中功效不足研究的持续存在:原因、后果及补救措施。
Psychol Methods. 2004 Jun;9(2):147-63. doi: 10.1037/1082-989X.9.2.147.