Chang Shuhua, Li Deli, Qi Yongcheng
Coordinated Innovation Center for Computable Modeling in Management Science, Yango University, Fuzhou, Fujian, People's Republic of China.
Coordinated Innovation Center for Computable Modeling in Management Science, Tianjin University of Finance and Economics, Tianjin, People's Republic of China.
J Appl Stat. 2021 Dec 30;50(5):1078-1093. doi: 10.1080/02664763.2021.2017413. eCollection 2023.
Pearson's chi-squared test is widely used to test the goodness of fit between categorical data and a given discrete distribution function. When the number of sets of the categorical data, say , is a fixed integer, Pearson's chi-squared test statistic converges in distribution to a chi-squared distribution with -1 degrees of freedom when the sample size goes to infinity. In real applications, the number often changes with and may be even much larger than . By using the martingale techniques, we prove that Pearson's chi-squared test statistic converges to the normal under quite general conditions. We also propose a new test statistic which is more powerful than chi-squared test statistic based on our simulation study. A real application to lottery data is provided to illustrate our methodology.
皮尔逊卡方检验被广泛用于检验分类数据与给定离散分布函数之间的拟合优度。当分类数据的组数(比如说(k))为固定整数时,当样本量(n)趋于无穷大时,皮尔逊卡方检验统计量依分布收敛到自由度为(k - 1)的卡方分布。在实际应用中,组数(k)常常随(n)变化,甚至可能比(n)大得多。通过使用鞅技术,我们证明了在相当一般的条件下,皮尔逊卡方检验统计量收敛到正态分布。基于我们的模拟研究,我们还提出了一种比卡方检验统计量更具功效的新检验统计量。提供了一个彩票数据的实际应用示例来说明我们的方法。