• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

有证据表明,有时人们更倾向于选择非显著性结果:是反向 P 值操纵还是选择性报告?

Evidence that nonsignificant results are sometimes preferred: Reverse P-hacking or selective reporting?

机构信息

Division of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, Australia.

Department of Biological Sciences, Bishop's University, Sherbrooke, Canada.

出版信息

PLoS Biol. 2019 Jan 25;17(1):e3000127. doi: 10.1371/journal.pbio.3000127. eCollection 2019 Jan.

DOI:10.1371/journal.pbio.3000127
PMID:30682013
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6364929/
Abstract

There is increased concern about poor scientific practices arising from an excessive focus on P-values. Two particularly worrisome practices are selective reporting of significant results and 'P-hacking'. The latter is the manipulation of data collection, usage, or analyses to obtain statistically significant outcomes. Here, we introduce the novel, to our knowledge, concepts of selective reporting of nonsignificant results and 'reverse P-hacking' whereby researchers ensure that tests produce a nonsignificant result. We test whether these practices occur in experiments in which researchers randomly assign subjects to treatment and control groups to minimise differences in confounding variables that might affect the focal outcome. By chance alone, 5% of tests for a group difference in confounding variables should yield a significant result (P < 0.05). If researchers less often report significant findings and/or reverse P-hack to avoid significant outcomes that undermine the ethos that experimental and control groups only differ with respect to actively manipulated variables, we expect significant results from tests for group differences to be under-represented in the literature. We surveyed the behavioural ecology literature and found significantly more nonsignificant P-values reported for tests of group differences in potentially confounding variables than the expected 95% (P = 0.005; N = 250 studies). This novel, to our knowledge, publication bias could result from selective reporting of nonsignificant results and/or from reverse P-hacking. We encourage others to test for a bias toward publishing nonsignificant results in the equivalent context in their own research discipline.

摘要

人们越来越关注过度关注 P 值所带来的不良科学实践。两种特别令人担忧的做法是有选择地报告显著结果和“P 操纵”。后者是指操纵数据收集、使用或分析以获得统计学上显著的结果。在这里,我们引入了选择性报告无显著结果和“反向 P 操纵”的新概念,研究人员通过这些概念来确保测试产生无显著结果。我们测试了这些做法是否会出现在研究人员随机将受试者分配到处理组和对照组以最小化可能影响焦点结果的混杂变量差异的实验中。仅凭机会,5%的混杂变量组间差异测试应该会产生显著结果(P < 0.05)。如果研究人员较少报告显著发现,并且/或者为了避免显著结果破坏实验组和对照组仅在主动操纵变量方面存在差异的精神,我们预计组间差异测试的显著结果在文献中会被低估。我们调查了行为生态学文献,发现报告的潜在混杂变量组间差异测试的无显著 P 值明显多于预期的 95%(P = 0.005;N = 250 项研究)。这种新颖的、据我们所知的发表偏倚可能是由于有选择性地报告无显著结果和/或反向 P 操纵所致。我们鼓励其他人在自己的研究领域中测试在同等背景下发表无显著结果的偏向。

相似文献

1
Evidence that nonsignificant results are sometimes preferred: Reverse P-hacking or selective reporting?有证据表明,有时人们更倾向于选择非显著性结果:是反向 P 值操纵还是选择性报告?
PLoS Biol. 2019 Jan 25;17(1):e3000127. doi: 10.1371/journal.pbio.3000127. eCollection 2019 Jan.
2
The extent and consequences of p-hacking in science.科学中的 p-值操纵的程度和后果。
PLoS Biol. 2015 Mar 13;13(3):e1002106. doi: 10.1371/journal.pbio.1002106. eCollection 2015 Mar.
3
The distribution of P-values in medical research articles suggested selective reporting associated with statistical significance.医学研究文章中P值的分布表明存在与统计学显著性相关的选择性报告。
J Clin Epidemiol. 2017 Jul;87:70-77. doi: 10.1016/j.jclinepi.2017.04.003. Epub 2017 Apr 9.
4
P-Hacking in Orthopaedic Literature: A Twist to the Tail.骨科文献中的P值篡改:结局的反转
J Bone Joint Surg Am. 2016 Oct 19;98(20):e91. doi: 10.2106/JBJS.16.00479.
5
Is There Evidence of P-Hacking in Imaging Research?影像学研究中存在 P 操纵证据吗?
Can Assoc Radiol J. 2023 Aug;74(3):497-507. doi: 10.1177/08465371221139418. Epub 2022 Nov 22.
6
p-Curve and Effect Size: Correcting for Publication Bias Using Only Significant Results.p 值曲线和效应量:仅使用显著结果校正发表偏倚。
Perspect Psychol Sci. 2014 Nov;9(6):666-81. doi: 10.1177/1745691614553988.
7
A survey of publication bias within evolutionary ecology.进化生态学中发表偏倚的一项调查。
Proc Biol Sci. 2004 Dec 7;271 Suppl 6(Suppl 6):S451-4. doi: 10.1098/rsbl.2004.0218.
8
How to design a pre-specified statistical analysis approach to limit p-hacking in clinical trials: the Pre-SPEC framework.如何设计一种预先指定的统计分析方法来限制临床试验中的 p 值操纵:Pre-SPEC 框架。
BMC Med. 2020 Sep 7;18(1):253. doi: 10.1186/s12916-020-01706-7.
9
P-curve: a key to the file-drawer.P曲线:文件抽屉问题的关键。
J Exp Psychol Gen. 2014 Apr;143(2):534-47. doi: 10.1037/a0033242. Epub 2013 Jul 15.
10
The distribution of probability values in medical abstracts: an observational study.医学摘要中概率值的分布:一项观察性研究。
BMC Res Notes. 2015 Nov 26;8:721. doi: 10.1186/s13104-015-1691-x.

引用本文的文献

1
Methodological challenges using routine clinical care data for real-world evidence: a rapid review utilizing a systematic literature search and focus group discussion.利用常规临床护理数据获取真实世界证据的方法学挑战:一项采用系统文献检索和焦点小组讨论的快速综述
BMC Med Res Methodol. 2025 Jan 14;25(1):8. doi: 10.1186/s12874-024-02440-x.
2
Methods for assessing inverse publication bias of adverse events.评估不良反应逆发表偏倚的方法。
Contemp Clin Trials. 2024 Oct;145:107646. doi: 10.1016/j.cct.2024.107646. Epub 2024 Jul 30.
3
Sex recognition does not modulate aggression toward nest intruders in a paper wasp.

本文引用的文献

1
Imbalance values for baseline covariates in randomized controlled trials: a last resort for the use of values? A pro and contra debate.随机对照试验中基线协变量的失衡值:使用这些值的最后手段?正反观点辩论。
Clin Epidemiol. 2018 May 8;10:531-535. doi: 10.2147/CLEP.S161508. eCollection 2018.
2
Proper experimental design requires randomization/balancing of molecular ecology experiments.恰当的实验设计需要分子生态学实验的随机化/平衡处理。
Ecol Evol. 2018 Jan 10;8(3):1786-1793. doi: 10.1002/ece3.3687. eCollection 2018 Feb.
3
Modelling science trustworthiness under publish or perish pressure.
性别识别不会调节纸黄蜂对巢穴入侵者的攻击性。
Curr Zool. 2022 Jul 4;69(3):324-331. doi: 10.1093/cz/zoac051. eCollection 2023 Jun.
4
Null results of oxytocin and vasopressin administration on mentalizing in a large fMRI sample: evidence from a randomized controlled trial.大样本 fMRI 研究中催产素和血管加压素给药对心理化作用无影响:来自随机对照试验的证据。
Psychol Med. 2023 Apr;53(6):2285-2295. doi: 10.1017/S0033291721004104. Epub 2021 Oct 15.
5
Tempest in a teacup: An analysis of p-Hacking in organizational research.小题大做:组织研究中 p-值操纵的分析。
PLoS One. 2023 Feb 24;18(2):e0281938. doi: 10.1371/journal.pone.0281938. eCollection 2023.
6
Neurophysiological parameters of sensory perception and cognition among different modalities of learners.不同学习方式下感觉知觉和认知的神经生理参数。
J Educ Health Promot. 2020 Jun 30;9:162. doi: 10.4103/jehp.jehp_654_19. eCollection 2020.
7
Null results of oxytocin and vasopressin administration across a range of social cognitive and behavioral paradigms: Evidence from a randomized controlled trial.催产素和血管加压素给药在一系列社会认知和行为范式中的无效结果:来自随机对照试验的证据。
Psychoneuroendocrinology. 2019 Sep;107:124-132. doi: 10.1016/j.psyneuen.2019.04.019. Epub 2019 Apr 29.
在“不发表就出局”压力下对科学可信度进行建模。
R Soc Open Sci. 2018 Jan 10;5(1):171511. doi: 10.1098/rsos.171511. eCollection 2018 Jan.
4
Detecting and avoiding likely false-positive findings - a practical guide.检测和避免可能的假阳性发现——实用指南。
Biol Rev Camb Philos Soc. 2017 Nov;92(4):1941-1968. doi: 10.1111/brv.12315. Epub 2016 Nov 23.
5
Transparency in Ecology and Evolution: Real Problems, Real Solutions.透明度在生态学和进化生物学中的应用:真实问题,真实解决方案。
Trends Ecol Evol. 2016 Sep;31(9):711-719. doi: 10.1016/j.tree.2016.07.002. Epub 2016 Jul 25.
6
Marginally Significant Effects as Evidence for Hypotheses: Changing Attitudes Over Four Decades.边际显著效应作为假说的证据:四十年来态度的变化。
Psychol Sci. 2016 Jul;27(7):1036-42. doi: 10.1177/0956797616645672. Epub 2016 May 16.
7
Problems in using p-curve analysis and text-mining to detect rate of p-hacking and evidential value.使用p曲线分析和文本挖掘来检测p值操纵率和证据价值时存在的问题。
PeerJ. 2016 Feb 18;4:e1715. doi: 10.7717/peerj.1715. eCollection 2016.
8
PSYCHOLOGY. Estimating the reproducibility of psychological science.心理学. 心理科学可重复性的评估.
Science. 2015 Aug 28;349(6251):aac4716. doi: 10.1126/science.aac4716.
9
p-Curve and Effect Size: Correcting for Publication Bias Using Only Significant Results.p 值曲线和效应量:仅使用显著结果校正发表偏倚。
Perspect Psychol Sci. 2014 Nov;9(6):666-81. doi: 10.1177/1745691614553988.
10
Is the Replicability Crisis Overblown? Three Arguments Examined.复制危机是否被夸大了?三个论点的考察。
Perspect Psychol Sci. 2012 Nov;7(6):531-6. doi: 10.1177/1745691612463401.