Suppr超能文献

1976 年至 2019 年健康和医学期刊中置信区间的考察:一项观察性研究。

Examination of CIs in health and medical journals from 1976 to 2019: an observational study.

机构信息

Institute of Health and Biomedical Innovation, Queensland University of Technology, Kelvin Grove, Queensland, Australia

Arthritis and Clinical Immunology Research Program, Division of Genomics and Data Sciences, Oklahoma Medical Research Foundation, Oklahoma City, Oklahoma, USA.

出版信息

BMJ Open. 2019 Nov 21;9(11):e032506. doi: 10.1136/bmjopen-2019-032506.

Abstract

OBJECTIVES

Previous research has shown clear biases in the distribution of published p values, with an excess below the 0.05 threshold due to a combination of p-hacking and publication bias. We aimed to examine the bias for statistical significance using published confidence intervals.

DESIGN

Observational study.

SETTING

Papers published in since 1976.

PARTICIPANTS

Over 968 000 confidence intervals extracted from abstracts and over 350 000 intervals extracted from the full-text.

OUTCOME MEASURES

Cumulative distributions of lower and upper confidence interval limits for ratio estimates.

RESULTS

We found an excess of statistically significant results with a glut of lower intervals just above one and upper intervals just below 1. These excesses have not improved in recent years. The excesses did not appear in a set of over 100 000 confidence intervals that were not subject to p-hacking or publication bias.

CONCLUSIONS

The huge excesses of published confidence intervals that are just below the statistically significant threshold are not statistically plausible. Large improvements in research practice are needed to provide more results that better reflect the truth.

摘要

目的

先前的研究表明,已发表的 p 值分布存在明显的偏差,由于 p 值操纵和发表偏倚的综合作用,低于 0.05 阈值的 p 值过多。我们旨在使用已发表的置信区间来检验统计显著性的偏差。

设计

观察性研究。

设置

自 1976 年以来发表在《柳叶刀》上的论文。

参与者

从摘要中提取了超过 968000 个置信区间,从全文中提取了超过 350000 个置信区间。

主要观察指标

比值估计的置信区间下限和上限的累积分布。

结果

我们发现具有统计学意义的结果过多,且大量的下限刚好略高于 1,上限刚好略低于 1。近年来,这种过剩情况并没有改善。在一组不受 p 值操纵或发表偏倚影响的超过 100000 个置信区间中,没有出现这种过剩情况。

结论

大量略低于统计学显著阈值的发表置信区间的巨大过剩情况在统计学上是不合理的。需要在研究实践中做出重大改进,以提供更多更好地反映真实情况的结果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/63f8/6887056/5ab29c65da9c/bmjopen-2019-032506f01.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验