Tryon W W
Department of Psychology, Fordham University, Bronx, New York 10458-5198, USA.
Psychol Methods. 2001 Dec;6(4):371-86.
Null hypothesis statistical testing (NHST) has been debated extensively but always successfully defended. The technical merits of NHST are not disputed in this article. The widespread misuse of NHST has created a human factors problem that this article intends to ameliorate. This article describes an integrated, alternative inferential confidence interval approach to testing for statistical difference, equivalence, and indeterminacy that is algebraically equivalent to standard NHST procedures and therefore exacts the same evidential standard. The combined numeric and graphic tests of statistical difference, equivalence, and indeterminacy are designed to avoid common interpretive problems associated with NHST procedures. Multiple comparisons, power, sample size, test reliability, effect size, and cause-effect ratio are discussed. A section on the proper interpretation of confidence intervals is followed by a decision rule summary and caveats.
零假设统计检验(NHST)一直备受广泛争议,但总能成功地得到辩护。本文并不质疑NHST的技术优点。NHST的广泛滥用造成了一个人为因素问题,而本文旨在改善这一问题。本文描述了一种综合的、替代性的推断置信区间方法,用于检验统计差异、等效性和不确定性,该方法在代数上等同于标准NHST程序,因此要求相同的证据标准。统计差异、等效性和不确定性的数值和图形组合检验旨在避免与NHST程序相关的常见解释问题。文中讨论了多重比较、功效、样本量、检验可靠性、效应大小和因果比。在关于置信区间正确解释的一节之后,是决策规则总结和注意事项。