Muller K E, Benignus V A
Department of Biostatistics, University of North Carolina, Chapel Hill 27599.
Neurotoxicol Teratol. 1992 May-Jun;14(3):211-9. doi: 10.1016/0892-0362(92)90019-7.
A survey of basic ideas in statistical power analysis demonstrates the advantages and ease of using power analysis throughout the design, analysis, and interpretation of research. The power of a statistical test is the probability of rejecting the null hypothesis of the test. The traditional approach to power involves computation of only a single power value. The more general power curve allows examining the range of power determinants, which are sample size, population difference, and error variance, in traditional ANOVA. Power analysis can be useful not only in study planning, but also in the evaluation of existing research. An important application is in concluding that no scientifically important treatment difference exists. Choosing an appropriate power depends on: a) opportunity costs, b) ethical trade-offs, c) the size of effect considered important, d) the uncertainty of parameter estimates, and e) the analyst's preferences. Although precise rules seem inappropriate, several guidelines are defensible. First, the sensitivity of the power curve to particular characteristics of the study, such as the error variance, should be examined in any power analysis. Second, just as a small type I error rate should be demonstrated in order to declare a difference nonzero, a small type II error should be demonstrated in order to declare a difference zero. Third, when ethical and opportunity costs do not preclude it, power should be at least .84, and preferably greater than .90.
一项关于统计功效分析基本概念的调查表明,在研究的设计、分析和解释过程中使用功效分析具有诸多优势且操作简便。统计检验的功效是拒绝该检验原假设的概率。传统的功效分析方法仅涉及计算单个功效值。更通用的功效曲线则允许考察传统方差分析中功效决定因素的范围,这些因素包括样本量、总体差异和误差方差。功效分析不仅在研究规划中有用,在评估现有研究时也很有用。一个重要的应用是得出不存在具有科学重要性的治疗差异这一结论。选择合适的功效取决于:a)机会成本,b)伦理权衡,c)被认为重要的效应大小,d)参数估计的不确定性,以及e)分析师的偏好。尽管精确的规则似乎并不合适,但有几条指导原则是合理的。首先,在任何功效分析中都应考察功效曲线对研究特定特征(如误差方差)的敏感性。其次,正如为了宣称差异非零应证明具有较小的I类错误率一样,为了宣称差异为零应证明具有较小的II类错误率。第三,当伦理和机会成本不排除时,功效应至少为0.84,最好大于0.90。