Xu Zhenyu, Fine Jason P, Song Wenling, Yan Jun
Department of Statistics, University of Connecticut, Storrs, Connecticut.
Department of Statistics, University of Pittsburgh, Pittsburgh, Pennsylvania.
Stat Med. 2025 Jan 15;44(1-2):e10271. doi: 10.1002/sim.10271. Epub 2024 Dec 12.
Generalized estimating equations (GEE) are of great importance in analyzing clustered data without full specification of multivariate distributions. A recent approach by Luo and Pan jointly models the mean, variance, and correlation coefficients of clustered data through three sets of regressions. We note that it represents a specific case of the more general estimating equations proposed by Yan and Fine which further allow the variance to depend on the mean through a variance function. In certain scenarios, the proposed variance estimators for the variance and correlation parameters in Luo and Pan may face challenges due to the subtle dependence induced by the nested structure of the estimating equations. We characterize specific model settings where their variance estimation approach may encounter limitations and illustrate how the variance estimators in Yan and Fine can correctly account for such dependencies. In addition, we introduce a novel model selection criterion that enables the simultaneous selection of the mean-scale-correlation model. The sandwich variance estimator and the proposed model selection criterion are tested by several simulation studies and real data analysis, which validate its effectiveness in variance estimation and model selection. Our work also extends the R package geepack with the flexibility to apply different working covariance matrices for the variance and correlation structures.
广义估计方程(GEE)在分析聚类数据时非常重要,无需完全指定多元分布。罗和潘最近提出的一种方法通过三组回归对聚类数据的均值、方差和相关系数进行联合建模。我们注意到,它代表了严和法恩提出的更一般估计方程的一个具体情况,该方程进一步允许方差通过方差函数依赖于均值。在某些情况下,罗和潘对方差和相关参数提出的方差估计器可能会面临挑战,因为估计方程的嵌套结构会导致微妙的依赖性。我们刻画了其方差估计方法可能遇到局限性的特定模型设置,并说明了严和法恩的方差估计器如何能够正确地考虑这种依赖性。此外,我们引入了一种新颖的模型选择标准,能够同时选择均值 - 尺度 - 相关模型。通过几个模拟研究和实际数据分析对方差估计器和提出的模型选择标准进行了测试,验证了其在方差估计和模型选择方面的有效性。我们的工作还扩展了R包geepack,使其能够灵活地应用不同的工作协方差矩阵来处理方差和相关结构。