1976 年至 2019 年健康和医学期刊中置信区间的考察：一项观察性研究。

Examination of CIs in health and medical journals from 1976 to 2019: an observational study.

机构信息

Institute of Health and Biomedical Innovation, Queensland University of Technology, Kelvin Grove, Queensland, Australia

Arthritis and Clinical Immunology Research Program, Division of Genomics and Data Sciences, Oklahoma Medical Research Foundation, Oklahoma City, Oklahoma, USA.

出版信息

BMJ Open. 2019 Nov 21;9(11):e032506. doi: 10.1136/bmjopen-2019-032506.

DOI:10.1136/bmjopen-2019-032506

PMID:31753893

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6887056/

Abstract

OBJECTIVES

Previous research has shown clear biases in the distribution of published p values, with an excess below the 0.05 threshold due to a combination of p-hacking and publication bias. We aimed to examine the bias for statistical significance using published confidence intervals.

DESIGN

Observational study.

SETTING

Papers published in since 1976.

PARTICIPANTS

Over 968 000 confidence intervals extracted from abstracts and over 350 000 intervals extracted from the full-text.

OUTCOME MEASURES

Cumulative distributions of lower and upper confidence interval limits for ratio estimates.

RESULTS

We found an excess of statistically significant results with a glut of lower intervals just above one and upper intervals just below 1. These excesses have not improved in recent years. The excesses did not appear in a set of over 100 000 confidence intervals that were not subject to p-hacking or publication bias.

CONCLUSIONS

The huge excesses of published confidence intervals that are just below the statistically significant threshold are not statistically plausible. Large improvements in research practice are needed to provide more results that better reflect the truth.

摘要

目的

先前的研究表明，已发表的 p 值分布存在明显的偏差，由于 p 值操纵和发表偏倚的综合作用，低于 0.05 阈值的 p 值过多。我们旨在使用已发表的置信区间来检验统计显著性的偏差。

设计

观察性研究。

设置

自 1976 年以来发表在《柳叶刀》上的论文。

参与者

从摘要中提取了超过 968000 个置信区间，从全文中提取了超过 350000 个置信区间。

主要观察指标

比值估计的置信区间下限和上限的累积分布。

结果

我们发现具有统计学意义的结果过多，且大量的下限刚好略高于 1，上限刚好略低于 1。近年来，这种过剩情况并没有改善。在一组不受 p 值操纵或发表偏倚影响的超过 100000 个置信区间中，没有出现这种过剩情况。

结论

大量略低于统计学显著阈值的发表置信区间的巨大过剩情况在统计学上是不合理的。需要在研究实践中做出重大改进，以提供更多更好地反映真实情况的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63f8/6887056/5ab29c65da9c/bmjopen-2019-032506f01.jpg

相似文献

Examination of CIs in health and medical journals from 1976 to 2019: an observational study.1976 年至 2019 年健康和医学期刊中置信区间的考察：一项观察性研究。

BMJ Open. 2019 Nov 21;9(11):e032506. doi: 10.1136/bmjopen-2019-032506.

The bias for statistical significance in sport and exercise medicine.运动医学中的统计学意义偏差。

J Sci Med Sport. 2023 Mar;26(3):164-168. doi: 10.1016/j.jsams.2023.03.002. Epub 2023 Mar 7.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

An unexpected influence of widely used significance thresholds on the distribution of reported P-values.广泛使用的显著性阈值对报告的P值分布产生的意外影响。

J Evol Biol. 2007 May;20(3):1082-9. doi: 10.1111/j.1420-9101.2006.01291.x.

Statistical significance and publication reporting bias in abstracts of reproductive medicine studies.生殖医学研究摘要中的统计学显著性与发表报告偏倚

Hum Reprod. 2023 Nov 28;39(3):548-558. doi: 10.1093/humrep/dead248.

The distribution of probability values in medical abstracts: an observational study.医学摘要中概率值的分布：一项观察性研究。

BMC Res Notes. 2015 Nov 26;8:721. doi: 10.1186/s13104-015-1691-x.

Is There Evidence of P-Hacking in Imaging Research?影像学研究中存在 P 操纵证据吗？

Can Assoc Radiol J. 2023 Aug;74(3):497-507. doi: 10.1177/08465371221139418. Epub 2022 Nov 22.

Evolution of Reporting P Values in the Biomedical Literature, 1990-2015.1990 年至 2015 年生物医学文献中报告 P 值的演变。

JAMA. 2016 Mar 15;315(11):1141-8. doi: 10.1001/jama.2016.1952.

Publication bias in the anesthesiology literature.麻醉学文献中的发表偏倚。

Anesth Analg. 2012 May;114(5):1042-8. doi: 10.1213/ANE.0b013e3182468fc6. Epub 2012 Feb 17.

Does direction of results of abstracts submitted to scientific conferences on drug addiction predict full publication?提交给药物成瘾科学会议的摘要结果方向能否预测全文发表？

BMC Med Res Methodol. 2009 Apr 8;9:23. doi: 10.1186/1471-2288-9-23.

引用本文的文献

Helping reviewers assess statistical analysis: A case study from analytic methods.协助评审人员评估统计分析：来自分析方法的案例研究

Anal Sci Adv. 2022 Jun 16;3(5-6):212-222. doi: 10.1002/ansa.202000159. eCollection 2022 Jun.

Evidence of questionable research practices in clinical prediction models.临床预测模型中存在可疑研究行为的证据。

BMC Med. 2023 Sep 4;21(1):339. doi: 10.1186/s12916-023-03048-6.

Climate change and infectious disease: a review of evidence and research trends.气候变化与传染病：证据与研究趋势综述。

Infect Dis Poverty. 2023 May 16;12(1):51. doi: 10.1186/s40249-023-01102-2.

本文引用的文献

Improving reproducibility by using high-throughput observational studies with empirical calibration.通过使用经实证校准的高通量观察性研究提高可重复性。

Philos Trans A Math Phys Eng Sci. 2018 Sep 13;376(2128). doi: 10.1098/rsta.2017.0356.

Statistical Significance and the Dichotomization of Evidence: The Relevance of the for Statisticians.统计显著性与证据的二分法：对统计学家的相关性。

J Am Stat Assoc. 2017;112(519):902-904. doi: 10.1080/01621459.2017.1311265. Epub 2017 Oct 30.

Algorithmic identification of discrepancies between published ratios and their reported confidence intervals and P-values.算法识别发表的比值与其报告的置信区间和 P 值之间的差异。

Bioinformatics. 2018 May 15;34(10):1758-1766. doi: 10.1093/bioinformatics/btx811.

Alternatives to P value: confidence interval and effect size.P值的替代方法：置信区间和效应量。

Korean J Anesthesiol. 2016 Dec;69(6):555-562. doi: 10.4097/kjae.2016.69.6.555. Epub 2016 Oct 25.

Evolution of Reporting P Values in the Biomedical Literature, 1990-2015.1990 年至 2015 年生物医学文献中报告 P 值的演变。

JAMA. 2016 Mar 15;315(11):1141-8. doi: 10.1001/jama.2016.1952.

Likelihood of Null Effects of Large NHLBI Clinical Trials Has Increased over Time.美国国立心肺血液研究所（NHLBI）大型临床试验出现无效结果的可能性随时间增加。

PLoS One. 2015 Aug 5;10(8):e0132382. doi: 10.1371/journal.pone.0132382. eCollection 2015.

p-Curve and Effect Size: Correcting for Publication Bias Using Only Significant Results.p 值曲线和效应量：仅使用显著结果校正发表偏倚。

Perspect Psychol Sci. 2014 Nov;9(6):666-81. doi: 10.1177/1745691614553988.

The extent and consequences of p-hacking in science.科学中的 p-值操纵的程度和后果。

PLoS Biol. 2015 Mar 13;13(3):e1002106. doi: 10.1371/journal.pbio.1002106. eCollection 2015 Mar.

Why the P-value culture is bad and confidence intervals a better alternative.为什么 P 值文化不好，而置信区间是更好的选择。

Osteoarthritis Cartilage. 2012 Aug;20(8):805-8. doi: 10.1016/j.joca.2012.04.001. Epub 2012 Apr 11.

Too many roads not taken.太多未选择的道路。

Nature. 2011 Feb 10;470(7333):163-5. doi: 10.1038/470163a.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

1976 年至 2019 年健康和医学期刊中置信区间的考察：一项观察性研究。

Examination of CIs in health and medical journals from 1976 to 2019: an observational study.

机构信息

出版信息

OBJECTIVES

DESIGN

SETTING

PARTICIPANTS

OUTCOME MEASURES

RESULTS

CONCLUSIONS

目的

设计

设置

参与者

主要观察指标

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献