有证据表明，有时人们更倾向于选择非显著性结果：是反向 P 值操纵还是选择性报告？

Evidence that nonsignificant results are sometimes preferred: Reverse P-hacking or selective reporting?

机构信息

Division of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, Australia.

Department of Biological Sciences, Bishop's University, Sherbrooke, Canada.

出版信息

PLoS Biol. 2019 Jan 25;17(1):e3000127. doi: 10.1371/journal.pbio.3000127. eCollection 2019 Jan.

DOI:10.1371/journal.pbio.3000127

PMID:30682013

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6364929/

Abstract

There is increased concern about poor scientific practices arising from an excessive focus on P-values. Two particularly worrisome practices are selective reporting of significant results and 'P-hacking'. The latter is the manipulation of data collection, usage, or analyses to obtain statistically significant outcomes. Here, we introduce the novel, to our knowledge, concepts of selective reporting of nonsignificant results and 'reverse P-hacking' whereby researchers ensure that tests produce a nonsignificant result. We test whether these practices occur in experiments in which researchers randomly assign subjects to treatment and control groups to minimise differences in confounding variables that might affect the focal outcome. By chance alone, 5% of tests for a group difference in confounding variables should yield a significant result (P < 0.05). If researchers less often report significant findings and/or reverse P-hack to avoid significant outcomes that undermine the ethos that experimental and control groups only differ with respect to actively manipulated variables, we expect significant results from tests for group differences to be under-represented in the literature. We surveyed the behavioural ecology literature and found significantly more nonsignificant P-values reported for tests of group differences in potentially confounding variables than the expected 95% (P = 0.005; N = 250 studies). This novel, to our knowledge, publication bias could result from selective reporting of nonsignificant results and/or from reverse P-hacking. We encourage others to test for a bias toward publishing nonsignificant results in the equivalent context in their own research discipline.

摘要

人们越来越关注过度关注 P 值所带来的不良科学实践。两种特别令人担忧的做法是有选择地报告显著结果和“P 操纵”。后者是指操纵数据收集、使用或分析以获得统计学上显著的结果。在这里，我们引入了选择性报告无显著结果和“反向 P 操纵”的新概念，研究人员通过这些概念来确保测试产生无显著结果。我们测试了这些做法是否会出现在研究人员随机将受试者分配到处理组和对照组以最小化可能影响焦点结果的混杂变量差异的实验中。仅凭机会，5%的混杂变量组间差异测试应该会产生显著结果（P < 0.05）。如果研究人员较少报告显著发现，并且/或者为了避免显著结果破坏实验组和对照组仅在主动操纵变量方面存在差异的精神，我们预计组间差异测试的显著结果在文献中会被低估。我们调查了行为生态学文献，发现报告的潜在混杂变量组间差异测试的无显著 P 值明显多于预期的 95%（P = 0.005；N = 250 项研究）。这种新颖的、据我们所知的发表偏倚可能是由于有选择性地报告无显著结果和/或反向 P 操纵所致。我们鼓励其他人在自己的研究领域中测试在同等背景下发表无显著结果的偏向。

相似文献

Evidence that nonsignificant results are sometimes preferred: Reverse P-hacking or selective reporting?

PLoS Biol. 2019 Jan 25;17(1):e3000127. doi: 10.1371/journal.pbio.3000127. eCollection 2019 Jan.

The extent and consequences of p-hacking in science.

PLoS Biol. 2015 Mar 13;13(3):e1002106. doi: 10.1371/journal.pbio.1002106. eCollection 2015 Mar.

The distribution of P-values in medical research articles suggested selective reporting associated with statistical significance.

J Clin Epidemiol. 2017 Jul;87:70-77. doi: 10.1016/j.jclinepi.2017.04.003. Epub 2017 Apr 9.

P-Hacking in Orthopaedic Literature: A Twist to the Tail.

J Bone Joint Surg Am. 2016 Oct 19;98(20):e91. doi: 10.2106/JBJS.16.00479.

Is There Evidence of P-Hacking in Imaging Research?

Can Assoc Radiol J. 2023 Aug;74(3):497-507. doi: 10.1177/08465371221139418. Epub 2022 Nov 22.

p-Curve and Effect Size: Correcting for Publication Bias Using Only Significant Results.

Perspect Psychol Sci. 2014 Nov;9(6):666-81. doi: 10.1177/1745691614553988.

A survey of publication bias within evolutionary ecology.

Proc Biol Sci. 2004 Dec 7;271 Suppl 6(Suppl 6):S451-4. doi: 10.1098/rsbl.2004.0218.

How to design a pre-specified statistical analysis approach to limit p-hacking in clinical trials: the Pre-SPEC framework.

BMC Med. 2020 Sep 7;18(1):253. doi: 10.1186/s12916-020-01706-7.

P-curve: a key to the file-drawer.

J Exp Psychol Gen. 2014 Apr;143(2):534-47. doi: 10.1037/a0033242. Epub 2013 Jul 15.

The distribution of probability values in medical abstracts: an observational study.

BMC Res Notes. 2015 Nov 26;8:721. doi: 10.1186/s13104-015-1691-x.

引用本文的文献

Methodological challenges using routine clinical care data for real-world evidence: a rapid review utilizing a systematic literature search and focus group discussion.

BMC Med Res Methodol. 2025 Jan 14;25(1):8. doi: 10.1186/s12874-024-02440-x.

Methods for assessing inverse publication bias of adverse events.

Contemp Clin Trials. 2024 Oct;145:107646. doi: 10.1016/j.cct.2024.107646. Epub 2024 Jul 30.

Sex recognition does not modulate aggression toward nest intruders in a paper wasp.

Curr Zool. 2022 Jul 4;69(3):324-331. doi: 10.1093/cz/zoac051. eCollection 2023 Jun.

Null results of oxytocin and vasopressin administration on mentalizing in a large fMRI sample: evidence from a randomized controlled trial.

Psychol Med. 2023 Apr;53(6):2285-2295. doi: 10.1017/S0033291721004104. Epub 2021 Oct 15.

Tempest in a teacup: An analysis of p-Hacking in organizational research.

PLoS One. 2023 Feb 24;18(2):e0281938. doi: 10.1371/journal.pone.0281938. eCollection 2023.

Neurophysiological parameters of sensory perception and cognition among different modalities of learners.

J Educ Health Promot. 2020 Jun 30;9:162. doi: 10.4103/jehp.jehp_654_19. eCollection 2020.

Null results of oxytocin and vasopressin administration across a range of social cognitive and behavioral paradigms: Evidence from a randomized controlled trial.

Psychoneuroendocrinology. 2019 Sep;107:124-132. doi: 10.1016/j.psyneuen.2019.04.019. Epub 2019 Apr 29.

本文引用的文献

Imbalance values for baseline covariates in randomized controlled trials: a last resort for the use of values? A pro and contra debate.

Clin Epidemiol. 2018 May 8;10:531-535. doi: 10.2147/CLEP.S161508. eCollection 2018.

Proper experimental design requires randomization/balancing of molecular ecology experiments.

Ecol Evol. 2018 Jan 10;8(3):1786-1793. doi: 10.1002/ece3.3687. eCollection 2018 Feb.

Modelling science trustworthiness under publish or perish pressure.

R Soc Open Sci. 2018 Jan 10;5(1):171511. doi: 10.1098/rsos.171511. eCollection 2018 Jan.

Detecting and avoiding likely false-positive findings - a practical guide.

Biol Rev Camb Philos Soc. 2017 Nov;92(4):1941-1968. doi: 10.1111/brv.12315. Epub 2016 Nov 23.

Transparency in Ecology and Evolution: Real Problems, Real Solutions.

Trends Ecol Evol. 2016 Sep;31(9):711-719. doi: 10.1016/j.tree.2016.07.002. Epub 2016 Jul 25.

Marginally Significant Effects as Evidence for Hypotheses: Changing Attitudes Over Four Decades.

Psychol Sci. 2016 Jul;27(7):1036-42. doi: 10.1177/0956797616645672. Epub 2016 May 16.

Problems in using p-curve analysis and text-mining to detect rate of p-hacking and evidential value.

PeerJ. 2016 Feb 18;4:e1715. doi: 10.7717/peerj.1715. eCollection 2016.

PSYCHOLOGY. Estimating the reproducibility of psychological science.

Science. 2015 Aug 28;349(6251):aac4716. doi: 10.1126/science.aac4716.

p-Curve and Effect Size: Correcting for Publication Bias Using Only Significant Results.

Perspect Psychol Sci. 2014 Nov;9(6):666-81. doi: 10.1177/1745691614553988.

Is the Replicability Crisis Overblown? Three Arguments Examined.

Perspect Psychol Sci. 2012 Nov;7(6):531-6. doi: 10.1177/1745691612463401.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

有证据表明，有时人们更倾向于选择非显著性结果：是反向 P 值操纵还是选择性报告？

Evidence that nonsignificant results are sometimes preferred: Reverse P-hacking or selective reporting?

机构信息

Division of Ecology and Evolution, Research School of Biology, The Australian National University, Canberra, Australia.

Department of Biological Sciences, Bishop's University, Sherbrooke, Canada.

出版信息

PLoS Biol. 2019 Jan 25;17(1):e3000127. doi: 10.1371/journal.pbio.3000127. eCollection 2019 Jan.

DOI:10.1371/journal.pbio.3000127

PMID:30682013

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6364929/

Abstract

摘要

有证据表明，有时人们更倾向于选择非显著性结果：是反向 P 值操纵还是选择性报告？

Evidence that nonsignificant results are sometimes preferred: Reverse P-hacking or selective reporting?

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

有证据表明，有时人们更倾向于选择非显著性结果：是反向 P 值操纵还是选择性报告？

Evidence that nonsignificant results are sometimes preferred: Reverse P-hacking or selective reporting?

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献