Ioannidis John P A
Clinical and Molecular Epidemiology Unit, Department of Hygiene and Epidemiology, University of Ioannina School of Medicine, Ioannina, Greece.
Am J Epidemiol. 2008 Aug 15;168(4):374-83; discussion 384-90. doi: 10.1093/aje/kwn156. Epub 2008 Jul 8.
The author evaluated the implications of nominal statistical significance for changing the credibility of null versus alternative hypotheses across a large number of observational associations for which formal statistical significance (p < 0.05) was claimed. Calculation of the Bayes factor (B) under different assumptions was performed on 272 observational associations published in 2004-2005 and a data set of 50 meta-analyses on gene-disease associations (752 studies) for which statistically significant associations had been claimed (p < 0.05). Depending on the formulation of the prior, statistically significant results offered less than strong support to the credibility (B > 0.10) for 54-77% of the 272 epidemiologic associations for diverse risk factors and 44-70% of the 50 associations from genetic meta-analyses. Sometimes nominally statistically significant results even decreased the credibility of the probed association in comparison with what was thought before the study was conducted. Five of six meta-analyses with less than substantial support (B > 0.032) lost their nominal statistical significance in a subsequent (more recent) meta-analysis, while this did not occur in any of seven meta-analyses with decisive support (B < 0.01). In these large data sets of observational associations, formal statistical significance alone failed to increase much the credibility of many postulated associations. Bayes factors may be used routinely to interpret "significant" associations.
作者评估了名义统计显著性对于改变大量观察性关联中零假设与备择假设可信度的影响,这些关联都声称具有形式上的统计显著性(p < 0.05)。在2004 - 2005年发表的272个观察性关联以及一个关于基因 - 疾病关联的50项荟萃分析数据集(752项研究)上进行了不同假设下的贝叶斯因子(B)计算,这些研究都声称具有统计学显著关联(p < 0.05)。根据先验的设定,对于272个不同风险因素的流行病学关联中的54 - 77%以及基因荟萃分析中的50个关联中的44 - 70%,具有统计学显著的结果对可信度(B > 0.10)的支持并不强烈。有时,名义上具有统计学显著的结果相较于研究开展前的预期,甚至降低了所探究关联的可信度。六项支持力度不足(B > 0.032)的荟萃分析中有五项在后续(更新的)荟萃分析中失去了其名义上的统计显著性,而在七项具有决定性支持(B < 0.01)的荟萃分析中均未出现这种情况。在这些大量的观察性关联数据集中,仅形式上的统计显著性并不能大幅提高许多假定关联的可信度。贝叶斯因子可常规用于解释“显著”关联。