• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用p曲线分析和文本挖掘来检测p值操纵率和证据价值时存在的问题。

Problems in using p-curve analysis and text-mining to detect rate of p-hacking and evidential value.

作者信息

Bishop Dorothy V M, Thompson Paul A

机构信息

Department of Experimental Psychology, University of Oxford , Oxford , United Kingdom.

出版信息

PeerJ. 2016 Feb 18;4:e1715. doi: 10.7717/peerj.1715. eCollection 2016.

DOI:10.7717/peerj.1715
PMID:26925335
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4768688/
Abstract

Background. The p-curve is a plot of the distribution of p-values reported in a set of scientific studies. Comparisons between ranges of p-values have been used to evaluate fields of research in terms of the extent to which studies have genuine evidential value, and the extent to which they suffer from bias in the selection of variables and analyses for publication, p-hacking. Methods. p-hacking can take various forms. Here we used R code to simulate the use of ghost variables, where an experimenter gathers data on several dependent variables but reports only those with statistically significant effects. We also examined a text-mined dataset used by Head et al. (2015) and assessed its suitability for investigating p-hacking. Results. We show that when there is ghost p-hacking, the shape of the p-curve depends on whether dependent variables are intercorrelated. For uncorrelated variables, simulated p-hacked data do not give the "p-hacking bump" just below .05 that is regarded as evidence of p-hacking, though there is a negative skew when simulated variables are inter-correlated. The way p-curves vary according to features of underlying data poses problems when automated text mining is used to detect p-values in heterogeneous sets of published papers. Conclusions. The absence of a bump in the p-curve is not indicative of lack of p-hacking. Furthermore, while studies with evidential value will usually generate a right-skewed p-curve, we cannot treat a right-skewed p-curve as an indicator of the extent of evidential value, unless we have a model specific to the type of p-values entered into the analysis. We conclude that it is not feasible to use the p-curve to estimate the extent of p-hacking and evidential value unless there is considerable control over the type of data entered into the analysis. In particular, p-hacking with ghost variables is likely to be missed.

摘要

背景。p曲线是一组科学研究中报告的p值分布的曲线图。p值范围之间的比较已被用于评估研究领域,包括研究具有真实证据价值的程度,以及它们在变量选择和发表分析方面遭受偏差(p值操纵)的程度。方法。p值操纵可以有多种形式。在这里,我们使用R代码模拟幽灵变量的使用,即实验者收集多个因变量的数据,但只报告那些具有统计学显著效应的变量。我们还检查了Head等人(2015年)使用的一个文本挖掘数据集,并评估了其在调查p值操纵方面的适用性。结果。我们表明,当存在幽灵p值操纵时,p曲线的形状取决于因变量是否相互关联。对于不相关的变量,模拟的p值操纵数据不会在略低于0.05处出现被视为p值操纵证据的“p值操纵凸起”,尽管当模拟变量相互关联时会有负偏态。当使用自动文本挖掘来检测已发表论文的异质集合中的p值时,p曲线根据基础数据特征的变化方式会带来问题。结论。p曲线中没有凸起并不表明不存在p值操纵。此外,虽然具有证据价值的研究通常会产生右偏的p曲线,但除非我们有一个特定于分析中输入的p值类型的模型,否则我们不能将右偏的p曲线视为证据价值程度的指标。我们得出结论,除非对分析中输入的数据类型有相当的控制,否则使用p曲线来估计p值操纵和证据价值的程度是不可行的。特别是,幽灵变量的p值操纵很可能会被遗漏。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd47/4768688/9677f6ba12e4/peerj-04-1715-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd47/4768688/bcbb9c2973d6/peerj-04-1715-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd47/4768688/3203257e753f/peerj-04-1715-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd47/4768688/e348adc09c2b/peerj-04-1715-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd47/4768688/9677f6ba12e4/peerj-04-1715-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd47/4768688/bcbb9c2973d6/peerj-04-1715-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd47/4768688/3203257e753f/peerj-04-1715-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd47/4768688/e348adc09c2b/peerj-04-1715-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/cd47/4768688/9677f6ba12e4/peerj-04-1715-g004.jpg

相似文献

1
Problems in using p-curve analysis and text-mining to detect rate of p-hacking and evidential value.使用p曲线分析和文本挖掘来检测p值操纵率和证据价值时存在的问题。
PeerJ. 2016 Feb 18;4:e1715. doi: 10.7717/peerj.1715. eCollection 2016.
2
Is There Evidence of P-Hacking in Imaging Research?影像学研究中存在 P 操纵证据吗?
Can Assoc Radiol J. 2023 Aug;74(3):497-507. doi: 10.1177/08465371221139418. Epub 2022 Nov 22.
3
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
4
Reanalyzing Head et al. (2015): investigating the robustness of widespread -hacking.重新分析黑德等人(2015年)的研究:探究广泛黑客攻击的稳健性。
PeerJ. 2017 Mar 2;5:e3068. doi: 10.7717/peerj.3068. eCollection 2017.
5
P-Hacking in Orthopaedic Literature: A Twist to the Tail.骨科文献中的P值篡改:结局的反转
J Bone Joint Surg Am. 2016 Oct 19;98(20):e91. doi: 10.2106/JBJS.16.00479.
6
P-curve: a key to the file-drawer.P曲线:文件抽屉问题的关键。
J Exp Psychol Gen. 2014 Apr;143(2):534-47. doi: 10.1037/a0033242. Epub 2013 Jul 15.
7
P-Curve Analysis of the Köhler Motivation Gain Effect in Exercise Settings: A Demonstration of a Novel Technique to Estimate Evidential Value Across Multiple Studies.运动情境下 Köhler 动机增益效应的 P 曲线分析:一种跨多项研究评估证据价值的新技术演示。
Ann Behav Med. 2021 Jun 2;55(6):543-556. doi: 10.1093/abm/kaaa080.
8
"Meta-analyses and P-curves support robust cycle shifts in women's mate preferences: Reply to Wood and Carden (2014) and Harris, Pashler, and Mickes (2014)": Correction to Gildersleeve, Haselton, and Fales (2014)."元分析和 P 曲线支持女性配偶偏好的稳健周期转变:对伍德和卡登(2014)以及哈里斯、帕什勒和米克斯(2014)的回复":对吉尔德斯leeve、哈塞尔顿和法尔斯(2014)的更正。
Psychol Bull. 2017 Nov;143(11):iii. doi: 10.1037/bul0000129.
9
-curve accurately rejects evidence for homeopathic ultramolecular dilutions.曲线准确地排除了顺势疗法超分子稀释的证据。
PeerJ. 2019 Jan 23;7:e6318. doi: 10.7717/peerj.6318. eCollection 2019.
10
A p-curve analysis of the emotional Stroop effect among women with eating disorders.进食障碍女性情绪 Stroop 效应的 p 曲线分析。
Int J Eat Disord. 2022 Nov;55(11):1459-1483. doi: 10.1002/eat.23807. Epub 2022 Sep 20.

引用本文的文献

1
Analysis of indications for selectively missing results in comparative registry-based studies in medicine: a meta-research study.医学中基于比较登记处研究的选择性缺失结果的指征分析:一项元研究。
Res Integr Peer Rev. 2025 Mar 5;10(1):2. doi: 10.1186/s41073-025-00159-x.
2
Tempest in a teacup: An analysis of p-Hacking in organizational research.小题大做:组织研究中 p-值操纵的分析。
PLoS One. 2023 Feb 24;18(2):e0281938. doi: 10.1371/journal.pone.0281938. eCollection 2023.
3
Big little lies: a compendium and simulation of -hacking strategies.

本文引用的文献

1
Better P-curves: Making P-curve analysis more robust to errors, fraud, and ambitious P-hacking, a Reply to Ulrich and Miller (2015).更好的P曲线:使P曲线分析对错误、欺诈和激进的P值操纵更具稳健性,对乌尔里希和米勒(2015年)的回应
J Exp Psychol Gen. 2015 Dec;144(6):1146-52. doi: 10.1037/xge0000104.
2
On the challenges of drawing conclusions from p-values just below 0.05.关于在 p 值刚刚低于 0.05 时得出结论所面临的挑战。
PeerJ. 2015 Jul 30;3:e1142. doi: 10.7717/peerj.1142. eCollection 2015.
3
Puzzlingly High Correlations in fMRI Studies of Emotion, Personality, and Social Cognition.
弥天大谎:-黑客攻击策略汇编与模拟
R Soc Open Sci. 2023 Feb 8;10(2):220346. doi: 10.1098/rsos.220346. eCollection 2023 Feb.
4
Is There Evidence of P-Hacking in Imaging Research?影像学研究中存在 P 操纵证据吗?
Can Assoc Radiol J. 2023 Aug;74(3):497-507. doi: 10.1177/08465371221139418. Epub 2022 Nov 22.
5
A systematic review and meta-analysis of the impact of cash transfers on subjective well-being and mental health in low- and middle-income countries.对低收入和中等收入国家现金转移对主观幸福感和心理健康影响的系统评价与荟萃分析。
Nat Hum Behav. 2022 Mar;6(3):359-370. doi: 10.1038/s41562-021-01252-z. Epub 2022 Jan 20.
6
Statistical Significance Filtering Overestimates Effects and Impedes Falsification: A Critique of.统计显著性筛选高估效应并阻碍证伪:对……的批判
Front Psychol. 2020 Dec 22;11:609647. doi: 10.3389/fpsyg.2020.609647. eCollection 2020.
7
Comparing the Efficacy of Cancer Therapies between Subgroups in Basket Trials.比较篮子试验亚组间癌症疗法的疗效。
Cell Syst. 2020 Nov 18;11(5):449-460.e2. doi: 10.1016/j.cels.2020.09.003.
8
Publication and related biases in health services research: a systematic review of empirical evidence.卫生服务研究中的发表偏倚和相关偏倚:系统评价的实证证据。
BMC Med Res Methodol. 2020 Jun 1;20(1):137. doi: 10.1186/s12874-020-01010-1.
9
Reproducible research into human chemical communication by cues and pheromones: learning from psychology's renaissance.通过线索和信息素来研究人类化学通讯的可重复性:从心理学的复兴中学习。
Philos Trans R Soc Lond B Biol Sci. 2020 Jun 8;375(1800):20190262. doi: 10.1098/rstb.2019.0262. Epub 2020 Apr 20.
10
Moving Sport and Exercise Science Forward: A Call for the Adoption of More Transparent Research Practices.推动运动与锻炼科学的发展:呼吁采用更透明的研究实践。
Sports Med. 2020 Mar;50(3):449-459. doi: 10.1007/s40279-019-01227-1.
令人费解的 fMRI 研究中情绪、个性和社会认知的高度相关性。
Perspect Psychol Sci. 2009 May;4(3):274-90. doi: 10.1111/j.1745-6924.2009.01125.x.
4
The extent and consequences of p-hacking in science.科学中的 p-值操纵的程度和后果。
PLoS Biol. 2015 Mar 13;13(3):e1002106. doi: 10.1371/journal.pbio.1002106. eCollection 2015 Mar.
5
A surge of p-values between 0.041 and 0.049 in recent decades (but negative results are increasing rapidly too).近几十年来,p值在0.041至0.049之间出现激增(但阴性结果也在迅速增加)。
PeerJ. 2015 Jan 22;3:e733. doi: 10.7717/peerj.733. eCollection 2015.
6
What p-hacking really looks like: a comment on Masicampo and LaLande (2012).p值篡改真面目:对马西坎波和拉兰德(2012年)的评论
Q J Exp Psychol (Hove). 2015;68(4):829-32. doi: 10.1080/17470218.2014.982664. Epub 2014 Dec 6.
7
How to make more published research true.如何让更多已发表的研究成果真实可靠。
PLoS Med. 2014 Oct 21;11(10):e1001747. doi: 10.1371/journal.pmed.1001747. eCollection 2014 Oct.
8
Common misconceptions about data analysis and statistics.关于数据分析和统计学的常见误解。
Br J Pharmacol. 2015 Apr;172(8):2126-32. doi: 10.1111/bph.12884. Epub 2014 Sep 26.
9
Publication and other reporting biases in cognitive sciences: detection, prevalence, and prevention.认知科学中的出版及其他报告偏倚:检测、发生率及预防
Trends Cogn Sci. 2014 May;18(5):235-41. doi: 10.1016/j.tics.2014.02.010. Epub 2014 Mar 18.
10
The meaning of "significance" for different types of research [translated and annotated by Eric-Jan Wagenmakers, Denny Borsboom, Josine Verhagen, Rogier Kievit, Marjan Bakker, Angelique Cramer, Dora Matzke, Don Mellenbergh, and Han L. J. van der Maas]. 1969.不同类型研究中“显著性”的含义[由埃里克 - 扬·瓦根梅克斯、丹尼·博斯博姆、乔西娜·韦尔哈根、罗吉尔·基维特、马尔扬·巴克、安热利克·克拉默、多拉·马特兹克、唐·梅伦伯格和汉·L·J·范德马斯翻译并注释]。1969年。
Acta Psychol (Amst). 2014 May;148:188-94. doi: 10.1016/j.actpsy.2014.02.001. Epub 2014 Mar 3.