Verma Surendra P, Díaz-González Lorena, Rosales-Rivera Mauricio, Quiroz-Ruiz Alfredo
Departamento de Sistemas Energéticos, Instituto de Energías Renovables, Universidad Nacional Autónoma de México, 62580 Temixco, MOR, Mexico.
Facultad de Ciencias, Universidad Autónoma del Estado de Morelos, 62209 Cuernavaca, MOR, Mexico.
ScientificWorldJournal. 2014 Mar 11;2014:746451. doi: 10.1155/2014/746451. eCollection 2014.
Using highly precise and accurate Monte Carlo simulations of 20,000,000 replications and 102 independent simulation experiments with extremely low simulation errors and total uncertainties, we evaluated the performance of four single outlier discordancy tests (Grubbs test N2, Dixon test N8, skewness test N14, and kurtosis test N15) for normal samples of sizes 5 to 20. Statistical contaminations of a single observation resulting from parameters called δ from ±0.1 up to ±20 for modeling the slippage of central tendency or ε from ±1.1 up to ±200 for slippage of dispersion, as well as no contamination (δ = 0 and ε = ±1), were simulated. Because of the use of precise and accurate random and normally distributed simulated data, very large replications, and a large number of independent experiments, this paper presents a novel approach for precise and accurate estimations of power functions of four popular discordancy tests and, therefore, should not be considered as a simple simulation exercise unrelated to probability and statistics. From both criteria of the Power of Test proposed by Hayes and Kinsella and the Test Performance Criterion of Barnett and Lewis, Dixon test N8 performs less well than the other three tests. The overall performance of these four tests could be summarized as N2≅N15 > N14 > N8.
我们使用高度精确且准确的蒙特卡罗模拟,进行了20000000次重复以及102次独立模拟实验,模拟误差和总不确定性极低,以此评估了四种单个异常值不一致性检验(格拉布斯检验N2、狄克逊检验N8、偏度检验N14和峰度检验N15)在样本量为5至20的正态样本中的性能。模拟了由参数δ(用于模拟集中趋势的偏移,范围从±0.1到±20)或ε(用于模拟离散度的偏移,范围从±1.1到±200)导致的单个观测值的统计污染,以及无污染情况(δ = 0且ε = ±1)。由于使用了精确且准确的随机正态分布模拟数据、大量的重复次数以及大量独立实验,本文提出了一种新颖的方法来精确估计四种常用不一致性检验的功效函数,因此不应被视为与概率和统计无关的简单模拟练习。从海斯和金塞拉提出的检验功效标准以及巴尼特和刘易斯的检验性能标准这两个标准来看,狄克逊检验N8的表现不如其他三种检验。这四种检验的总体性能可总结为N2≅N15 > N14 > N8。