Woolson R F, Kleinman J C
University of Iowa, Department of Preventive Medicine, Iowa City 52242.
Annu Rev Public Health. 1989;10:423-40. doi: 10.1146/annurev.pu.10.050189.002231.
The question of whether statistical significance testing should be used for the analysis of public health and epidemiologic data has received considerable attention in recent years. In this paper we have described some of the arguments for and against the use of hypothesis testing for the analysis of biomedical data. In addition, we have reviewed the literature from related fields, in particular sociology and psychology, in which similar discussions have taken place within the last 30 years. Many of the significance testing criticisms in these scientific fields have been raised in the more recent discussions taking place in the biomedical field. We present an example that emphasizes the use of both confidence interval estimation and significance testing. The example is particularly pertinent because it represents a more complex problem than has generally been discussed by critics of significance testing. Much of the discussion on this topic has focused on simple data analysis, such as the analysis of a 2 x 2 table or problems involving simple linear regression. Most epidemiologic data are far more complicated and warrant the use of both confidence interval estimation and significance testing for statistical analysis. Both of these techniques have no doubt been misused in the analysis of data. These misuses may have arisen from a lack of understanding of the role of statistical methods in data analysis and the choice of such methods for data analysis. If used prudently and judiciously, significance testing can help reduce the number of variables involved in a statistical analysis, thereby resulting in shorter confidence intervals for the models presented. Both significance testing and confidence interval estimation can serve and have served very useful functions for the analysis of public health and biomedical data.
近年来,统计显著性检验是否应用于公共卫生和流行病学数据的分析这一问题受到了相当多的关注。在本文中,我们描述了支持和反对使用假设检验来分析生物医学数据的一些论据。此外,我们回顾了相关领域的文献,特别是社会学和心理学领域,在过去30年里这些领域也进行了类似的讨论。这些科学领域中许多对显著性检验的批评在生物医学领域最近的讨论中也被提出。我们给出一个例子,强调同时使用置信区间估计和显著性检验。这个例子特别贴切,因为它代表了一个比显著性检验的批评者通常讨论的更为复杂的问题。关于这个主题的许多讨论都集中在简单的数据分析上,比如2×2表格的分析或涉及简单线性回归的问题。大多数流行病学数据要复杂得多,在统计分析中需要同时使用置信区间估计和显著性检验。毫无疑问,这两种技术在数据分析中都被滥用过。这些滥用可能源于对统计方法在数据分析中的作用以及数据分析方法选择的理解不足。如果谨慎明智地使用,显著性检验可以帮助减少统计分析中涉及的变量数量,从而使所呈现模型的置信区间更短。显著性检验和置信区间估计都能够并且已经在公共卫生和生物医学数据的分析中发挥了非常有用的作用。