Dunson David B, Herring Amy, Siega-Riz Anna Maria
Biostatistics Branch, MD A3-03, National Institute of Environmental Health Sciences, P.O. Box 12233, RTP, NC 27709.
Department of Biostatistics, The University of North Carolina at Chapel Hill.
J Am Stat Assoc. 2008;103(484):1508-1517. doi: 10.1198/016214508000001039. Epub 2012 Jan 1.
In epidemiology, it is often of interest to assess how individuals with different trajectories over time in an environmental exposure or biomarker differ with respect to a continuous response. For ease in interpretation and presentation of results, epidemiologists typically categorize predictors prior to analysis. To extend this approach to time-varying predictors, one can cluster individuals by their predictor trajectory, with the cluster index included as a predictor in a regression model for the response. This article develops a semiparametric Bayes approach, which avoids assuming a pre-specified number of clusters and allows the response to vary nonparametrically over predictor clusters. This methodology is motivated by interest in relating trajectories in weight gain during pregnancy to the distribution of birth weight adjusted for gestational age at delivery. In this setting, the proposed approach allows the tails of the birth weight density to vary flexibly over weight gain clusters.
在流行病学中,评估环境暴露或生物标志物随时间具有不同变化轨迹的个体在连续反应方面如何不同通常很有意义。为便于结果的解释和呈现,流行病学家通常在分析之前对预测变量进行分类。为将此方法扩展到随时间变化的预测变量,可根据其预测变量轨迹对个体进行聚类,并将聚类指标作为反应回归模型中的一个预测变量。本文提出一种半参数贝叶斯方法,该方法避免假设预先指定的聚类数量,并允许反应在预测变量聚类上非参数地变化。这种方法的动机源于对将孕期体重增加轨迹与根据分娩时胎龄调整后的出生体重分布相关联的兴趣。在这种情况下,所提出的方法允许出生体重密度的尾部在体重增加聚类上灵活变化。