Suppr超能文献

卡方检验:它们在医学研究数据分析中有多大用处?

Chi 2 tests: how useful are they in the analysis of medical research data?

作者信息

Osborn J F

出版信息

Ann Ig. 1989 May-Aug;1(3-4):417-32.

PMID:2483622
Abstract

This paper has outlined analyses of data for which the chi 2 test is commonly applied and has shown that on its own, a chi 2 test provides very little information about the interpretation of the data. In goodness to fit tests, at best, the chi 2 test is a mere first step in the interpretation of the data and much more information can be gleaned from the deviations between the observed and theoretical distributions, and graphical methods are more sensitive for assessing Normality. Similarly, the chi 2 test as the analysis of a 2 x 2 contingency table misses the most important features of the data. A chi 2 value and associated p value should not be presented without an estimation of the effect and its confidence interval. The use of NS, (not significant), after a chi 2 value (or any other test statistic) is particularly meaningless as it does not even specify a level of significance. The analysis of 2 x 2 contingency tables should specify a measure of the effect such as the difference between two proportions, their ratio or the odds ratio. Confidence intervals should be calculated and least importantly, statistical significance assessed by the Standard Normal Deviate. Chi 2 has no useful role to play in the analysis of 2 x 2 contingency tables. In fact, even with larger two dimensional tables, although chi 2 can be used to test the significance of associations between the rows and columns, such results are seldom of much interest as they give no indication of where in the table associations may exist. They are insensitive particularly if the categories of the row or column variables are quantitative since chi 2 takes no account of the ordering of the rows and columns. Although simple analyses of such tables are inefficient the advent of desk-top computers means that more sophisticated techniques such as logistic regression for proportions and Poisson regression for larger or multidimensional contingency tables can be applied. Although these methods may involve chi 2 tests for testing the significance of including or excluding variables, they are always associated with estimates of effects as measured by regression coefficients. The use of chi 2 for the simple analysis of contingency tables has no place in the interpretation of medical research data.

摘要

本文概述了通常应用卡方检验的数据的分析情况,并表明仅靠卡方检验本身,对于数据的解读提供的信息非常少。在拟合优度检验中,卡方检验充其量只是数据解读的第一步,从观察到的分布与理论分布之间的偏差中可以收集到更多信息,而且图形方法在评估正态性方面更为敏感。同样,作为对2×2列联表的分析,卡方检验忽略了数据最重要的特征。在没有估计效应及其置信区间的情况下,不应给出卡方值和相关的p值。在卡方值(或任何其他检验统计量)之后使用“NS”(无显著性)尤其没有意义,因为它甚至没有指定显著性水平。对2×2列联表的分析应指定效应的度量,例如两个比例之间的差异、它们的比率或比值比。应计算置信区间,最不重要的是,通过标准正态偏差评估统计显著性。卡方检验在2×2列联表的分析中没有有用的作用。事实上,即使对于更大的二维表,虽然卡方检验可用于检验行与列之间关联的显著性,但这样的结果很少有太大意义,因为它们没有表明表中可能存在关联的位置。如果行或列变量的类别是定量的,它们就不敏感,因为卡方检验不考虑行和列的顺序。虽然对这种表的简单分析效率不高,但台式计算机的出现意味着可以应用更复杂的技术,如用于比例的逻辑回归和用于更大或多维列联表的泊松回归。虽然这些方法可能涉及用于检验包含或排除变量显著性的卡方检验,但它们总是与通过回归系数测量的效应估计相关联。在医学研究数据的解读中,使用卡方检验对列联表进行简单分析毫无用处。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验