Killeen Peter R
Department of Psychology, Arizona State University, Box 1104, Tempe, AZ 85287-1104, USA.
Psychon Bull Rev. 2006 Aug;13(4):549-62. doi: 10.3758/bf03193962.
Traditional null hypothesis significance testing does not yield the probability of the null or its alternative and, therefore, cannot logically ground scientific decisions. The decision theory proposed here calculates the expected utility of an effect on the basis of (1) the probability of replicating it and (2) a utility function on its size. It takes significance tests--which place all value on the replicability of an effect and none on its magnitude--as a special case, one in which the cost of a false positive is revealed to be an order of magnitude greater than the value of a true positive. More realistic utility functions credit both replicability and effect size, integrating them for a single index of merit. The analysis incorporates opportunity cost and is consistent with alternate measures of effect size, such as r2 and information transmission, and with Bayesian model selection criteria. An alternate formulation is functionally equivalent to the formal theory, transparent, and easy to compute.
传统的零假设显著性检验无法得出零假设或其备择假设的概率,因此,无法从逻辑上为科学决策提供依据。这里提出的决策理论基于以下两点来计算效应的预期效用:(1)效应可重复性的概率;(2)效应大小的效用函数。它将显著性检验——只关注效应的可重复性而不考虑效应大小——视为一种特殊情况,即假阳性的代价被揭示比真阳性的价值大一个数量级。更现实的效用函数会同时考虑可重复性和效应大小,并将它们整合为一个单一的优劣指标。该分析纳入了机会成本,并且与效应大小的其他度量方法(如r2和信息传递)以及贝叶斯模型选择标准相一致。另一种表述在功能上等同于形式理论,具有透明度高且易于计算的特点。