• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

发表的研究结果多数是错误的吗?心理学中统计功效、发表偏倚和虚报发现率的趋势(1975-2017)。

Are most published research findings false? Trends in statistical power, publication selection bias, and the false discovery rate in psychology (1975-2017).

机构信息

Department of Sociology, Ludwig-Maximilians-University, Munich, Germany.

出版信息

PLoS One. 2023 Oct 17;18(10):e0292717. doi: 10.1371/journal.pone.0292717. eCollection 2023.

DOI:10.1371/journal.pone.0292717
PMID:37847689
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10581498/
Abstract

The validity of scientific findings may be challenged by the replicability crisis (or cases of fraud), which may result not only in a loss of trust within society but may also lead to wrong or even harmful policy or medical decisions. The question is: how reliable are scientific results that are reported as statistically significant, and how does this reliability develop over time? Based on 35,515 papers in psychology published between 1975 and 2017 containing 487,996 test values, this article empirically examines the statistical power, publication bias, and p-hacking, as well as the false discovery rate. Assuming constant true effects, the statistical power was found to be lower than the suggested 80% except for large underlying true effects (d = 0.8) and increased only slightly over time. Also, publication bias and p-hacking were found to be substantial. The share of false discoveries among all significant results was estimated at 17.7%, assuming a proportion θ = 50% of all hypotheses being true and assuming that p-hacking is the only mechanism generating a higher proportion of just significant results compared to just nonsignificant results. As the analyses rely on multiple assumptions that cannot be tested, alternative scenarios were laid out, again resulting in the rather optimistic result that although research results may suffer from low statistical power and publication selection bias, most of the results reported as statistically significant may contain substantial results, rather than statistical artifacts.

摘要

科学发现的有效性可能受到可重复性危机(或欺诈案例)的挑战,这不仅可能导致社会信任的丧失,还可能导致错误甚至有害的政策或医疗决策。问题是:报告为具有统计学意义的科学结果的可靠性如何,这种可靠性随着时间的推移如何发展?本文基于 1975 年至 2017 年间发表的包含 487996 个测试值的 35515 篇心理学论文,实证检验了统计功效、发表偏倚和 p 值操纵以及错误发现率。假设真实效应不变,除了较大的真实效应(d=0.8)外,统计功效都低于建议的 80%,而且随着时间的推移仅略有增加。此外,还发现发表偏倚和 p 值操纵现象非常严重。在所有显著结果中,虚假发现的比例估计为 17.7%,假设所有假设中有θ=50%的比例为真,并且假设 p 值操纵是产生比仅仅显著结果更高比例的唯一机制。由于分析依赖于无法检验的多个假设,因此还提出了替代方案,再次得出了相当乐观的结果,即尽管研究结果可能受到低统计功效和发表选择偏倚的影响,但报告为具有统计学意义的大多数结果可能包含实质性结果,而不是统计伪影。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab2b/10581498/eaa084e4e498/pone.0292717.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab2b/10581498/881a0e136c4a/pone.0292717.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab2b/10581498/23312e4bbf72/pone.0292717.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab2b/10581498/97e75d71e538/pone.0292717.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab2b/10581498/095768db5c1d/pone.0292717.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab2b/10581498/cb5152739f1b/pone.0292717.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab2b/10581498/efabb8b2402c/pone.0292717.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab2b/10581498/eaa084e4e498/pone.0292717.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab2b/10581498/881a0e136c4a/pone.0292717.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab2b/10581498/23312e4bbf72/pone.0292717.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab2b/10581498/97e75d71e538/pone.0292717.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab2b/10581498/095768db5c1d/pone.0292717.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab2b/10581498/cb5152739f1b/pone.0292717.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab2b/10581498/efabb8b2402c/pone.0292717.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab2b/10581498/eaa084e4e498/pone.0292717.g007.jpg

相似文献

1
Are most published research findings false? Trends in statistical power, publication selection bias, and the false discovery rate in psychology (1975-2017).发表的研究结果多数是错误的吗?心理学中统计功效、发表偏倚和虚报发现率的趋势(1975-2017)。
PLoS One. 2023 Oct 17;18(10):e0292717. doi: 10.1371/journal.pone.0292717. eCollection 2023.
2
Publication bias impacts on effect size, statistical power, and magnitude (Type M) and sign (Type S) errors in ecology and evolutionary biology.发表偏倚对生态学和进化生物学中的效应大小、统计功效和幅度(M 型)以及符号(S 型)错误有影响。
BMC Biol. 2023 Apr 3;21(1):71. doi: 10.1186/s12915-022-01485-y.
3
p-Hacking and publication bias interact to distort meta-analytic effect size estimates.p 值操纵和发表偏倚相互作用,扭曲了荟萃分析效应量的估计。
Psychol Methods. 2020 Aug;25(4):456-471. doi: 10.1037/met0000246. Epub 2019 Dec 2.
4
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
5
The earth is flat ( > 0.05): significance thresholds and the crisis of unreplicable research.地球是平的(p>0.05):显著性阈值与不可重复研究的危机。
PeerJ. 2017 Jul 7;5:e3544. doi: 10.7717/peerj.3544. eCollection 2017.
6
Publication bias and the canonization of false facts.发表性偏倚与虚假事实的公认化
Elife. 2016 Dec 20;5:e21451. doi: 10.7554/eLife.21451.
7
Examining publication bias-a simulation-based evaluation of statistical tests on publication bias.检验发表偏倚——基于模拟的发表偏倚统计检验评估
PeerJ. 2017 Nov 30;5:e4115. doi: 10.7717/peerj.4115. eCollection 2017.
8
Publication bias examined in meta-analyses from psychology and medicine: A meta-meta-analysis.元分析中的发表偏倚检验:一项元元分析。
PLoS One. 2019 Apr 12;14(4):e0215052. doi: 10.1371/journal.pone.0215052. eCollection 2019.
9
Publication bias, statistical power and reporting practices in the Journal of Sports Sciences: potential barriers to replicability.《运动科学杂志》中的发表偏倚、统计效力及报告规范:可重复性的潜在障碍
J Sports Sci. 2023 Sep;41(16):1507-1517. doi: 10.1080/02640414.2023.2269357. Epub 2023 Dec 11.
10
Are most published research findings false in a continuous universe?在一个连续的宇宙中,大多数已发表的研究结果都是错误的吗?
PLoS One. 2022 Dec 20;17(12):e0277935. doi: 10.1371/journal.pone.0277935. eCollection 2022.

本文引用的文献

1
An umbrella review of effect size, bias, and power across meta-analyses in emergency medicine.对急诊医学中荟萃分析的效应大小、偏差和功效进行伞式综述。
Acad Emerg Med. 2021 Dec;28(12):1379-1388. doi: 10.1111/acem.14312. Epub 2021 Jun 28.
2
p-Hacking and publication bias interact to distort meta-analytic effect size estimates.p 值操纵和发表偏倚相互作用,扭曲了荟萃分析效应量的估计。
Psychol Methods. 2020 Aug;25(4):456-471. doi: 10.1037/met0000246. Epub 2019 Dec 2.
3
Estimating statistical power, posterior probability and publication bias of psychological research using the observed replication rate.
利用观察到的重复率估计心理学研究的统计功效、后验概率和发表偏倚。
R Soc Open Sci. 2018 Sep 12;5(9):181190. doi: 10.1098/rsos.181190. eCollection 2018 Sep.
4
Low statistical power in biomedical science: a review of three human research domains.生物医学领域的统计效能低下:对三个人类研究领域的综述
R Soc Open Sci. 2017 Feb 1;4(2):160254. doi: 10.1098/rsos.160254. eCollection 2017 Feb.
5
Empirical assessment of published effect sizes and power in the recent cognitive neuroscience and psychology literature.对近期认知神经科学和心理学文献中已发表的效应量和检验效能的实证评估。
PLoS Biol. 2017 Mar 2;15(3):e2000797. doi: 10.1371/journal.pbio.2000797. eCollection 2017 Mar.
6
The natural selection of bad science.劣质科学的自然选择。
R Soc Open Sci. 2016 Sep 21;3(9):160384. doi: 10.1098/rsos.160384. eCollection 2016 Sep.
7
Distributions of p-values smaller than .05 in psychology: what is going on?心理学中小于0.05的p值分布:究竟是怎么回事?
PeerJ. 2016 Apr 11;4:e1935. doi: 10.7717/peerj.1935. eCollection 2016.
8
The prevalence of statistical reporting errors in psychology (1985-2013).心理学中统计报告错误的发生率(1985 - 2013年)
Behav Res Methods. 2016 Dec;48(4):1205-1226. doi: 10.3758/s13428-015-0664-2.
9
An investigation of the false discovery rate and the misinterpretation of p-values.对错误发现率和p值误读的调查。
R Soc Open Sci. 2014 Nov 19;1(3):140216. doi: 10.1098/rsos.140216. eCollection 2014 Nov.
10
The Search for Significance: A Few Peculiarities in the Distribution of P Values in Experimental Psychology Literature.寻找显著性:实验心理学文献中P值分布的一些奇特之处。
PLoS One. 2015 Jun 10;10(6):e0127872. doi: 10.1371/journal.pone.0127872. eCollection 2015.