Ecology. 2014 Mar;95(3):611-7. doi: 10.1890/13-0590.1.
Statistical hypothesis testing has been widely criticized by ecologists in recent years. I review some of the more persistent criticisms of P values and argue that most stem from misunderstandings or incorrect interpretations, rather than from intrinsic shortcomings of the P value. I show that P values are intimately linked to confidence intervals and to differences in Akaike's information criterion (deltaAIC), two metrics that have been advocated as replacements for the P value. The choice of a threshold value of deltaAIC that breaks ties among competing models is as arbitrary as the choice of the probability of a Type I error in hypothesis testing, and several other criticisms of the P value apply equally to deltaAIC. Since P values, confidence intervals, and deltaAIC are based on the same statistical information, all have their places in modern statistical practice. The choice of which to use should be stylistic, dictated by details of the application rather than by dogmatic, a priori considerations.
近年来,统计假设检验受到了生态学家的广泛批评。我回顾了一些对 P 值更持久的批评,并认为大多数批评源于误解或不正确的解释,而不是 P 值本身的固有缺陷。我表明 P 值与置信区间以及 Akaike 信息准则(deltaAIC)密切相关,这两个指标被认为是 P 值的替代品。选择打破竞争模型之间平局的 deltaAIC 的阈值值与在假设检验中选择一类错误的概率一样任意,并且 P 值的其他几个批评同样适用于 deltaAIC。由于 P 值、置信区间和 deltaAIC 基于相同的统计信息,因此它们都在现代统计实践中有其应用。选择使用哪种方法应该是风格上的,取决于应用的细节,而不是教条的、先验的考虑。