• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

推断统计学中的显著性谬误。

The significance fallacy in inferential statistics.

作者信息

Kühberger Anton, Fritz Astrid, Lermer Eva, Scherndl Thomas

机构信息

Department of Psychology and Centre of Cognitive Neuroscience, University of Salzburg, Hellbrunnerstr. 34, 5020, Salzburg, Austria.

Österreichisches Zentrum für Begabtenförderung und Begabungsforschung, Salzburg, Austria.

出版信息

BMC Res Notes. 2015 Mar 17;8:84. doi: 10.1186/s13104-015-1020-4.

DOI:10.1186/s13104-015-1020-4
PMID:25888971
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4377068/
Abstract

BACKGROUND

Statistical significance is an important concept in empirical science. However the meaning of the term varies widely. We investigate into the intuitive understanding of the notion of significance.

METHODS

We described the results of two different experiments published in a major psychological journal to a sample of students of psychology, labeling the findings as 'significant' versus 'non-significant.' Participants were asked to estimate the effect sizes and sample sizes of the original studies.

RESULTS

Labeling the results of a study as significant was associated with estimations of a big effect, but was largely unrelated to sample size. Similarly, non-significant results were estimated as near zero in effect size.

CONCLUSIONS

After considerable training in statistics, students largely equate statistical significance with medium to large effect sizes, rather than with large sample sizes. The data show that students assume that statistical significance is due to real effects, rather than to 'statistical tricks' (e.g., increasing sample size).

摘要

背景

统计显著性是实证科学中的一个重要概念。然而,该术语的含义差异很大。我们调查了对显著性概念的直观理解。

方法

我们向心理学专业的学生样本描述了发表在一本主要心理学杂志上的两项不同实验的结果,将这些发现标记为“显著”与“不显著”。参与者被要求估计原始研究的效应大小和样本大小。

结果

将一项研究的结果标记为显著与对大效应的估计相关,但在很大程度上与样本大小无关。同样,不显著的结果在效应大小方面被估计为接近零。

结论

经过大量的统计学训练后,学生们在很大程度上把统计显著性等同于中等至大的效应大小,而不是大的样本大小。数据表明,学生们认为统计显著性是由于真实效应,而不是“统计技巧”(例如,增加样本大小)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6682/4377068/bc045e07c9a9/13104_2015_1020_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6682/4377068/3356f074461b/13104_2015_1020_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6682/4377068/bc045e07c9a9/13104_2015_1020_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6682/4377068/3356f074461b/13104_2015_1020_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6682/4377068/bc045e07c9a9/13104_2015_1020_Fig2_HTML.jpg

相似文献

1
The significance fallacy in inferential statistics.推断统计学中的显著性谬误。
BMC Res Notes. 2015 Mar 17;8:84. doi: 10.1186/s13104-015-1020-4.
2
Out with .05, in with Replication and Measurement: Isolating and Working with the Particular Effect Sizes that are Troublesome for Inferential Statistics.摒弃0.05,引入重复与测量:分离并处理对推断统计造成麻烦的特定效应量。
J Gen Psychol. 2017 Oct-Dec;144(4):309-316. doi: 10.1080/00221309.2017.1381496. Epub 2017 Oct 12.
3
Power of mental health nursing research: a statistical analysis of studies in the International Journal of Mental Health Nursing.心理健康护理研究的力量:《国际精神健康护理杂志》研究的统计分析。
Int J Ment Health Nurs. 2013 Feb;22(1):69-75. doi: 10.1111/j.1447-0349.2012.00845.x. Epub 2012 Jun 27.
4
The large sample size fallacy.大样本量谬误。
Scand J Caring Sci. 2013 Jun;27(2):487-92. doi: 10.1111/j.1471-6712.2012.01052.x. Epub 2012 Jul 31.
5
A visitor's guide to effect sizes: statistical significance versus practical (clinical) importance of research findings.效应量指南:研究结果的统计学显著性与实际(临床)重要性
Adv Health Sci Educ Theory Pract. 2004;9(3):241-9. doi: 10.1023/B:AHSE.0000038173.00909.f6.
6
Redressing the power and effect of significance. A new approach to an old problem: teaching statistics to nursing students.纠正显著性的效力与影响。解决老问题的新方法:向护理专业学生教授统计学。
Nurse Educ Today. 2000 Jul;20(5):358-64. doi: 10.1054/nedt.2000.0429.
7
Alpha values as a function of sample size, effect size, and power: accuracy over inference.作为样本量、效应量和检验效能函数的α值:推断准确性。
Psychol Rep. 2013 Jun;112(3):835-44. doi: 10.2466/03.49.PR0.112.3.835-844.
8
Effect size estimates: current use, calculations, and interpretation.效应量估计:当前使用、计算和解释。
J Exp Psychol Gen. 2012 Feb;141(1):2-18. doi: 10.1037/a0024338. Epub 2011 Aug 8.
9
Power, effects, confidence, and significance: an investigation of statistical practices in nursing research.功效、效应、置信度与显著性:护理研究中的统计实践调查
Int J Nurs Stud. 2014 May;51(5):795-806. doi: 10.1016/j.ijnurstu.2013.09.014. Epub 2013 Oct 9.
10
Statistical power and effect sizes of clinical neuropsychology research.临床神经心理学研究的统计功效与效应量
J Clin Exp Neuropsychol. 2001 Jun;23(3):399-406. doi: 10.1076/jcen.23.3.399.1181.

引用本文的文献

1
Association of constipation with suicidal ideation among US adults and the partial mediating role of depression.美国成年人便秘与自杀意念的关联及抑郁的部分中介作用。
Sci Rep. 2025 Mar 29;15(1):10936. doi: 10.1038/s41598-025-95252-y.
2
Associations Between Daily-Use Products and Urinary Biomarkers of Endocrine-Disrupting Chemicals in Adults of Reproductive Age.日常用品与育龄期成年人内分泌干扰化学物尿液生物标志物之间的关联
Int J Environ Res Public Health. 2025 Jan 13;22(1):99. doi: 10.3390/ijerph22010099.
3
Multivariate variable selection in N-of-1 observational studies via additive Bayesian networks.

本文引用的文献

1
Sailing From the Seas of Chaos Into the Corridor of Stability: Practical Recommendations to Increase the Informational Value of Studies.从混沌之海驶向稳定之廊:提高研究信息价值的实用建议。
Perspect Psychol Sci. 2014 May;9(3):278-92. doi: 10.1177/1745691614528520.
2
Bayesian Versus Orthodox Statistics: Which Side Are You On?贝叶斯统计与经典统计:你站在哪一边?
Perspect Psychol Sci. 2011 May;6(3):274-90. doi: 10.1177/1745691611406920.
3
The Rules of the Game Called Psychological Science.名为“心理科学”的游戏规则。
基于加性贝叶斯网络的 N-of-1 观察性研究中的多变量选择。
PLoS One. 2024 Aug 26;19(8):e0305225. doi: 10.1371/journal.pone.0305225. eCollection 2024.
4
A Personalized Intervention to Increase Environmental Health Literacy and Readiness to Change in a Northern Nevada Population: Effects of Environmental Chemical Exposure Report-Back.以环境化学暴露报告反馈为基础的内华达州北部人群环境健康素养和改变准备度的个性化干预:效果报告
Int J Environ Res Public Health. 2024 Jul 11;21(7):905. doi: 10.3390/ijerph21070905.
5
Multiple Confidence Intervals and Surprisal Intervals to Avoid Significance Fallacy.避免显著性谬误的多个置信区间和意外区间。
Cureus. 2024 Jan 9;16(1):e51964. doi: 10.7759/cureus.51964. eCollection 2024 Jan.
6
Continuing Medical Education Outcomes are Much More Than Statistical Significance.继续医学教育的成果远不止于统计学意义。
J CME. 2023 Jul 21;12(1):2236893. doi: 10.1080/28338073.2023.2236893. eCollection 2023.
7
A Framework to Avoid Significance Fallacy.避免显著性谬误的框架。
Cureus. 2023 Jun 11;15(6):e40242. doi: 10.7759/cureus.40242. eCollection 2023 Jun.
8
Evaluating equity in performance of an electronic health record-based 6-month mortality risk model to trigger palliative care consultation: a retrospective model validation analysis.评估基于电子健康记录的 6 个月死亡率风险模型在触发姑息治疗咨询方面的表现公平性:回顾性模型验证分析。
BMJ Qual Saf. 2023 Sep;32(9):503-516. doi: 10.1136/bmjqs-2022-015173. Epub 2023 Mar 31.
9
ViLoN-a multi-layer network approach to data integration demonstrated for patient stratification.ViLoN——一种用于数据集成的多层网络方法,用于患者分层。
Nucleic Acids Res. 2023 Jan 11;51(1):e6. doi: 10.1093/nar/gkac988.
10
What possibly affects nighttime heart rate? Conclusions from N-of-1 observational data.什么可能影响夜间心率?基于单病例观察数据的结论。
Digit Health. 2022 Aug 24;8:20552076221120725. doi: 10.1177/20552076221120725. eCollection 2022 Jan-Dec.
Perspect Psychol Sci. 2012 Nov;7(6):543-54. doi: 10.1177/1745691612459060.
4
Editors' Introduction to the Special Section on Replicability in Psychological Science: A Crisis of Confidence?《心理科学中可重复性问题特刊编辑引言:信心危机?》
Perspect Psychol Sci. 2012 Nov;7(6):528-30. doi: 10.1177/1745691612465253.
5
Publication bias in psychology: a diagnosis based on the correlation between effect size and sample size.心理学中的发表偏倚:基于效应量与样本量之间相关性的诊断
PLoS One. 2014 Sep 5;9(9):e105825. doi: 10.1371/journal.pone.0105825. eCollection 2014.
6
The new statistics: why and how.新的统计数据:原因和方法。
Psychol Sci. 2014 Jan;25(1):7-29. doi: 10.1177/0956797613504966. Epub 2013 Nov 12.
7
Power failure: why small sample size undermines the reliability of neuroscience.停电:为什么小样本量会破坏神经科学的可靠性。
Nat Rev Neurosci. 2013 May;14(5):365-76. doi: 10.1038/nrn3475. Epub 2013 Apr 10.
8
Measuring the prevalence of questionable research practices with incentives for truth telling.用真话激励法来衡量可疑研究行为的发生率。
Psychol Sci. 2012 May 1;23(5):524-32. doi: 10.1177/0956797611430953. Epub 2012 Apr 16.
9
Why animal research needs to improve.为什么动物研究需要改进。
Nature. 2011 Sep 28;477(7366):511. doi: 10.1038/477511a.
10
The (mis)reporting of statistical results in psychology journals.心理学期刊中统计结果的(错误)报告。
Behav Res Methods. 2011 Sep;43(3):666-78. doi: 10.3758/s13428-011-0089-5.