1 Department of Psychiatry, School of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
2 Department of Biostatistics and Informatics, School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
Stat Methods Med Res. 2019 May;28(5):1399-1411. doi: 10.1177/0962280218758357. Epub 2018 Feb 28.
Medical research is often designed to investigate changes in a collection of response variables that are measured repeatedly on the same subjects. The multivariate generalized linear mixed model (MGLMM) can be used to evaluate random coefficient associations (e.g. simple correlations, partial regression coefficients) among outcomes that may be non-normal and differently distributed by specifying a multivariate normal distribution for their random effects and then evaluating the latent relationship between them. Empirical Bayes predictors are readily available for each subject from any mixed model and are observable and hence, plotable. Here, we evaluate whether second-stage association analyses of empirical Bayes predictors from a MGLMM, provide a good approximation and visual representation of these latent association analyses using medical examples and simulations. Additionally, we compare these results with association analyses of empirical Bayes predictors generated from separate mixed models for each outcome, a procedure that could circumvent computational problems that arise when the dimension of the joint covariance matrix of random effects is large and prohibits estimation of latent associations. As has been shown in other analytic contexts, the p-values for all second-stage coefficients that were determined by naively assuming normality of empirical Bayes predictors provide a good approximation to p-values determined via permutation analysis. Analyzing outcomes that are interrelated with separate models in the first stage and then associating the resulting empirical Bayes predictors in a second stage results in different mean and covariance parameter estimates from the maximum likelihood estimates generated by a MGLMM. The potential for erroneous inference from using results from these separate models increases as the magnitude of the association among the outcomes increases. Thus if computable, scatterplots of the conditionally independent empirical Bayes predictors from a MGLMM are always preferable to scatterplots of empirical Bayes predictors generated by separate models, unless the true association between outcomes is zero.
医学研究通常旨在研究同一组受试者反复测量的一组反应变量的变化。多变量广义线性混合模型 (MGLMM) 可用于评估可能是非正态且分布不同的结果之间的随机系数关联(例如,简单相关、偏回归系数),方法是为其随机效应指定多元正态分布,然后评估它们之间的潜在关系。从任何混合模型中,每个受试者都可以方便地获得经验贝叶斯预测值,这些预测值是可观察的,因此是可绘制的。在这里,我们使用医学实例和模拟评估 MGLMM 中经验贝叶斯预测值的第二阶段关联分析是否为这些潜在关联分析提供了良好的近似值和直观表示。此外,我们将这些结果与针对每个结果生成的单独混合模型的经验贝叶斯预测值的关联分析进行了比较,这种方法可以避免由于随机效应联合协方差矩阵的维度较大而导致的计算问题,从而无法估计潜在关联。正如在其他分析环境中所表明的那样,通过简单地假设经验贝叶斯预测值的正态性来确定的所有第二阶段系数的 p 值为通过置换分析确定的 p 值提供了很好的近似值。在第一阶段将相关的结果分析为独立模型,然后在第二阶段关联得到的经验贝叶斯预测值,会导致来自 MGLMM 的最大似然估计的均值和协方差参数估计与来自最大似然估计的均值和协方差参数估计不同。如果可以计算,则使用这些单独模型的结果进行推断的潜在错误会随着结果之间关联的大小而增加。因此,除非结果之间的真实关联为零,否则从 MGLMM 获得的条件独立经验贝叶斯预测值的散点图始终优于单独模型生成的经验贝叶斯预测值的散点图。