Reiser Mark
School of Social and Family Dynamics, Arizona State University, Box 873701, Tempe, AZ 85287-3701, USA.
Br J Math Stat Psychol. 2008 Nov;61(Pt 2):331-60. doi: 10.1348/000711007X204215. Epub 2007 Apr 21.
The goodness-of-fit test based on Pearson's chi-squared statistic is sometimes considered to be an omnibus test that gives little guidance to the source of poor fit when the null hypothesis is rejected. It has also been recognized that the omnibus test can often be outperformed by focused or directional tests of lower order. In this paper, a test is considered for a model on a data table formed by the cross-classification of q dichotomous variables, and a score statistic on overlapping cells that correspond to the first- through qth-order marginal frequencies is presented. Then orthogonal components of the Pearson-Fisher statistic are defined on marginal frequencies. The orthogonal components may be used to form test statistics, and a log-linear version of an item response model is used to investigate the order and dilution of a test based on these components, as well as the projection of components onto the space of lower-order marginals. The advantage of the components in terms of power and detection of the source of poor fit is demonstrated. Overcoming the adverse effects of sparseness provides another motive for using components based on marginal frequencies because an asymptotic chi-squared distribution will be more reliable for a statistic formed on overlapping cells if expected frequencies in the joint distribution are small.
基于皮尔逊卡方统计量的拟合优度检验有时被视为一种综合检验,当原假设被拒绝时,它几乎无法为拟合不佳的根源提供指导。人们也已经认识到,这种综合检验往往会被低阶的聚焦或定向检验超越。在本文中,我们考虑对由q个二分变量交叉分类形成的数据表上的一个模型进行检验,并给出了一个对应于一阶到q阶边际频率的重叠单元格上的得分统计量。然后在边际频率上定义了皮尔逊 - 费希尔统计量的正交分量。这些正交分量可用于形成检验统计量,并且使用项目反应模型的对数线性版本来研究基于这些分量的检验的阶数和稀释情况,以及分量在低阶边际空间上的投影。展示了这些分量在功效和识别拟合不佳根源方面的优势。克服稀疏性的不利影响为使用基于边际频率的分量提供了另一个动机,因为如果联合分布中的期望频率较小,对于在重叠单元格上形成的统计量,渐近卡方分布将更可靠。