Bishara Anthony J, Li Jiexiang, Nash Thomas
Department of Psychology, College of Charleston, South Carolina, USA.
Department of Mathematics, College of Charleston, South Carolina, USA.
Br J Math Stat Psychol. 2018 Feb;71(1):167-185. doi: 10.1111/bmsp.12113. Epub 2017 Sep 4.
When bivariate normality is violated, the default confidence interval of the Pearson correlation can be inaccurate. Two new methods were developed based on the asymptotic sampling distribution of Fisher's z' under the general case where bivariate normality need not be assumed. In Monte Carlo simulations, the most successful of these methods relied on the (Vale & Maurelli, 1983, Psychometrika, 48, 465) family to approximate a distribution via the marginal skewness and kurtosis of the sample data. In Simulation 1, this method provided more accurate confidence intervals of the correlation in non-normal data, at least as compared to no adjustment of the Fisher z' interval, or to adjustment via the sample joint moments. In Simulation 2, this approximate distribution method performed favourably relative to common non-parametric bootstrap methods, but its performance was mixed relative to an observed imposed bootstrap and two other robust methods (PM1 and HC4). No method was completely satisfactory. An advantage of the approximate distribution method, though, is that it can be implemented even without access to raw data if sample skewness and kurtosis are reported, making the method particularly useful for meta-analysis. Supporting information includes R code.
当违背二元正态性时,皮尔逊相关系数的默认置信区间可能不准确。在无需假定二元正态性的一般情况下,基于费希尔z'的渐近抽样分布开发了两种新方法。在蒙特卡罗模拟中,这些方法中最成功的一种依赖于(Vale和Maurelli,1983年,《心理测量学》,48卷,465页)的方法族,通过样本数据的边际偏度和峰度来近似分布。在模拟1中,该方法在非正态数据中提供了更准确的相关系数置信区间,至少与不调整费希尔z'区间或通过样本联合矩进行调整相比是这样。在模拟2中,这种近似分布方法相对于常见的非参数自助法表现良好,但其相对于观察施加自助法和其他两种稳健方法(PM1和HC4)的表现好坏参半。没有一种方法是完全令人满意的。不过,近似分布方法的一个优点是,如果报告了样本偏度和峰度,即使无法获取原始数据也可以实施,这使得该方法对元分析特别有用。支持信息包括R代码。