Eliot Melissa, Ferguson Jane, Reilly Muredach P, Foulkes Andrea S
University of Massachusetts Amherst, USA.
Int J Biostat. 2011;7(1):Article 37. doi: 10.2202/1557-4679.1353. Epub 2011 Sep 27.
Technological advances facilitating the acquisition of large arrays of biomarker data have led to new opportunities to understand and characterize disease progression over time. This creates an analytical challenge, however, due to the large numbers of potentially informative markers, the high degrees of correlation among them, and the time-dependent trajectories of association. We propose a mixed ridge estimator, which integrates ridge regression into the mixed effects modeling framework in order to account for both the correlation induced by repeatedly measuring an outcome on each individual over time, as well as the potentially high degree of correlation among possible predictor variables. An expectation-maximization algorithm is described to account for unknown variance and covariance parameters. Model performance is demonstrated through a simulation study and an application of the mixed ridge approach to data arising from a study of cardiometabolic biomarker responses to evoked inflammation induced by experimental low-dose endotoxemia.
技术进步促进了大量生物标志物数据的获取,为随时间推移理解和表征疾病进展带来了新机遇。然而,这也带来了分析挑战,因为存在大量潜在的信息性标志物、它们之间的高度相关性以及关联的时间依赖性轨迹。我们提出了一种混合岭估计器,它将岭回归整合到混合效应建模框架中,以便既考虑因随时间对每个个体重复测量结果而产生的相关性,又考虑可能的预测变量之间潜在的高度相关性。描述了一种期望最大化算法来处理未知的方差和协方差参数。通过模拟研究以及将混合岭方法应用于一项关于心脏代谢生物标志物对实验性低剂量内毒素血症诱发的炎症反应的数据研究,展示了模型性能。