Australian Centre for Health Services Innovation and Centre for Healthcare Transformation, School of Public Health and Social Work, Queensland University of Technology, Australia.
Australian Centre for Health Services Innovation and Centre for Healthcare Transformation, School of Public Health and Social Work, Queensland University of Technology, Australia.
J Sci Med Sport. 2023 Mar;26(3):164-168. doi: 10.1016/j.jsams.2023.03.002. Epub 2023 Mar 7.
We aimed to examine the bias for statistical significance using published confidence intervals in sport and exercise medicine research.
Observational study.
The abstracts of 48,390 articles, published in 18 sports and exercise medicine journals between 2002 and 2022, were searched using a validated text-mining algorithm that identified and extracted ratio confidence intervals (odds, hazard, and risk ratios). The algorithm identified 1744 abstracts that included ratio confidence intervals, from which 4484 intervals were extracted. After excluding ineligible intervals, the analysis used 3819 intervals, reported as 95 % confidence intervals, from 1599 articles. The cumulative distributions of lower and upper confidence limits were plotted to identify any abnormal patterns, particularly around a ratio of 1 (the null hypothesis). The distributions were compared to those from unbiased reference data, which was not subjected to p-hacking or publication bias. A bias for statistical significance was further investigated using a histogram plot of z-values calculated from the extracted 95 % confidence intervals.
There was a marked change in the cumulative distribution of lower and upper bound intervals just over and just under a ratio of 1. The bias for statistical significance was also clear in a stark under-representation of z-values between -1.96 and +1.96, corresponding to p-values above 0.05.
There was an excess of published research with statistically significant results just below the standard significance threshold of 0.05, which is indicative of publication bias. Transparent research practices, including the use of registered reports, are needed to reduce the bias in published research.
本研究旨在通过审查运动医学研究中已发表的置信区间,探讨统计学意义的偏差。
观察性研究。
使用一种经过验证的文本挖掘算法,检索了 2002 年至 2022 年期间在 18 种运动与运动医学期刊上发表的 48390 篇文章的摘要,该算法可以识别和提取比值置信区间(比值比、风险比和危险比)。该算法从包含比值置信区间的 1744 篇摘要中提取了 4484 个区间,排除不符合条件的区间后,共纳入 1599 篇文章中的 3819 个区间。使用 95%置信区间报告这些区间,分析这些区间的下限和上限的累积分布,以确定是否存在异常模式,特别是在比值为 1(零假设)附近。将分布与不受 p 值操纵或发表偏倚影响的无偏参考数据进行比较。使用从提取的 95%置信区间计算得到的 z 值的直方图进一步研究统计学意义的偏差。
在比值为 1 左右的区间下限和上限的累积分布出现明显变化。从提取的 95%置信区间计算得到的 z 值明显偏态分布,在 -1.96 到 +1.96 之间(对应 p 值大于 0.05)的 z 值明显偏少,这也表明存在统计学意义的偏差。
存在大量发表的研究结果仅略低于 0.05 的标准显著性阈值,表明存在发表偏倚。需要采用透明的研究实践,包括使用预先注册报告,以减少发表研究中的偏差。