Moses L E, Emerson J D, Hosseini H
N Engl J Med. 1984 Aug 16;311(7):442-8. doi: 10.1056/NEJM198408163110705.
Clinical investigations often involve data in the form of ordered categories--e.g., "worse," "unchanged," "improved," "much improved." Comparison of two groups when the data are of this kind should not be done by the chi-square test, which wastes information and is insensitive in this context. The Wilcoxon-Mann-Whitney test provides a proper analysis. Alternatively, scores may be assigned to the categories in order, and the t-test applied. We demonstrate both approaches here. Sometimes data in ordered categories are reduced to a two-by-two table by the collapsing of the high categories into one category and the low categories into another. This practice is inefficient; moreover, it entails avoidable subjectivity in the choice of the cutting point that defines the two super-categories. The Wilcoxon-Mann-Whitney procedure (or the t-test with use of ordered scores) is preferable. A survey of research articles in Volume 306 of the New England Journal of Medicine shows many instances of ordered-category data (about 20 per cent of the articles had such data) and no instance of analysis by the preferred methods presented here. We suggest that investigators who are unfamiliar with these methods should seek the assistance of a professional statistician when they must deal with such data.
临床研究常常涉及有序分类形式的数据,例如“恶化”“未改变”“改善”“显著改善”。当数据属于此类时,两组之间的比较不应通过卡方检验进行,因为卡方检验会浪费信息且在此种情况下不够灵敏。 Wilcoxon-Mann-Whitney检验可提供恰当的分析。或者,可以按顺序为类别赋值,然后应用t检验。我们在此展示这两种方法。有时,通过将高类别合并为一个类别,低类别合并为另一个类别,有序分类数据会被简化为一个二乘二表格。这种做法效率低下;此外,在定义两个超级类别的切点选择上存在不可避免的主观性。 Wilcoxon-Mann-Whitney方法(或使用有序分数的t检验)更可取。对《新英格兰医学杂志》第306卷中的研究文章进行的一项调查显示,有序分类数据的实例很多(约20%的文章有此类数据),但没有一篇文章采用此处介绍的首选方法进行分析。我们建议,不熟悉这些方法的研究人员在必须处理此类数据时应寻求专业统计学家的帮助。