Yuan Ke-Hai
a Department of Psychology , University of Notre Dame .
Multivariate Behav Res. 2008 Jan-Mar;43(1):109-36. doi: 10.1080/00273170701836729.
In the literature of mean and covariance structure analysis, noncentral chi-square distribution is commonly used to describe the behavior of the likelihood ratio (LR) statistic under alternative hypothesis. Due to the inaccessibility of the rather technical literature for the distribution of the LR statistic, it is widely believed that the noncentral chi-square distribution is justified by statistical theory. Actually, when the null hypothesis is not trivially violated, the noncentral chi-square distribution cannot describe the LR statistic well even when data are normally distributed and the sample size is large. Using the one-dimensional case, this article provides the details showing that the LR statistic asymptotically follows a normal distribution, which also leads to an asymptotically correct confidence interval for the discrepancy between the null hypothesis/model and the population. For each one-dimensional result, the corresponding results in the higher dimensional case are pointed out and references are provided. Examples with real data illustrate the difference between the noncentral chi-square distribution and the normal distribution. Monte Carlo results compare the strength of the normal distribution against that of the noncentral chi-square distribution. The implication to data analysis is discussed whenever relevant. The development is built upon the concepts of basic calculous, linear algebra, and introductory probability and statistics. The aim is to provide the least technical material for quantitative graduate students in social science to understand the condition and limitation of the noncentral chi-square distribution.
在均值和协方差结构分析的文献中,非中心卡方分布通常用于描述备择假设下似然比(LR)统计量的行为。由于关于LR统计量分布的技术文献难以获取,人们普遍认为非中心卡方分布有统计理论作为支撑。实际上,当原假设未被轻易违背时,即使数据呈正态分布且样本量很大,非中心卡方分布也不能很好地描述LR统计量。本文通过一维情形详细说明了LR统计量渐近服从正态分布,这也为原假设/模型与总体之间的差异得出了渐近正确的置信区间。对于每个一维结果,都指出了高维情形下的相应结果并提供了参考文献。实际数据的例子说明了非中心卡方分布与正态分布之间的差异。蒙特卡罗结果比较了正态分布与非中心卡方分布的优势。只要相关,就会讨论对数据分析的影响。本文的推导基于基础微积分、线性代数以及概率与统计入门的概念。目的是为社会科学领域的定量研究生提供最少的技术材料,以理解非中心卡方分布的条件和局限性。