Hunsberger Sally, Long Lori, Reese Sarah E, Hong Gloria H, Myles Ian A, Zerbe Christa S, Chetchotisakd Pleonchan, Shih Joanna H
National Institute of Allergy and Infectious Diseases, Biostatistics Research Branch, 5601 Fishers Lane, Bethesda, Maryland.
Emmes Corporation, Rockville, Maryland.
Stat Neerl. 2022 Aug;76(3):309-330. doi: 10.1111/stan.12261. Epub 2022 Jan 12.
This paper develops methods to test for associations between two variables with clustered data using a -Statistic approach with a second-order approximation to the variance of the parameter estimate for the test statistic. The tests that are presented are for clustered versions of: Pearsons test, the Spearman rank correlation and Kendall's for continuous data or ordinal data and for alternative measures of Kendall's that allow for ties in the data. Shih and Fay use the -Statistic approach but only consider a first-order approximation. The first-order approximation has inflated significance level in scenarios with small sample sizes. We derive the test statistics using the second-order approximations aiming to improve the type I error rates. The method applies to data where clusters have the same number of measurements for each variable or where one of the variables may be measured once per cluster while the other variable may be measured multiple times. We evaluate the performance of the test statistics through simulation with small sample sizes. The methods are all available in the R package cluscor.
本文开发了一些方法,用于使用一种 - 统计量方法来检验具有聚类数据的两个变量之间的关联,该方法对检验统计量的参数估计方差采用二阶近似。所提出的检验适用于以下聚类版本:用于连续数据或有序数据的皮尔逊 检验、斯皮尔曼等级相关和肯德尔 ,以及允许数据中存在 ties 的肯德尔 的替代度量。施和费伊使用 - 统计量方法,但只考虑一阶近似。在小样本量的情况下,一阶近似会使显著性水平膨胀。我们使用二阶近似来推导检验统计量,旨在提高第一类错误率。该方法适用于每个变量在聚类中有相同测量次数的数据,或者一个变量在每个聚类中可能只测量一次而另一个变量可能测量多次的数据。我们通过小样本量的模拟来评估检验统计量的性能。这些方法在 R 包cluscor中均可用。