Canale Antonio, Durante Daniele, Dunson David B
Department of Statistical Sciences, University of Padua, Padua, Italy.
Department of Decision Sciences, Bocconi University and Bocconi Institute for Data Science and Analytics, Milan, Italy.
Biometrics. 2018 Dec;74(4):1331-1340. doi: 10.1111/biom.12917. Epub 2018 Jun 12.
There is wide interest in studying how the distribution of a continuous response changes with a predictor. We are motivated by environmental applications in which the predictor is the dose of an exposure and the response is a health outcome. A main focus in these studies is inference on dose levels associated with a given increase in risk relative to a baseline. In addressing this goal, popular methods either dichotomize the continuous response or focus on modeling changes with the dose in the expectation of the outcome. Such choices may lead to information loss and provide inaccurate inference on dose-response relationships. We instead propose a Bayesian convex mixture regression model that allows the entire distribution of the health outcome to be unknown and changing with the dose. To balance flexibility and parsimony, we rely on a mixture model for the density at the extreme doses, and express the conditional density at each intermediate dose via a convex combination of these extremal densities. This representation generalizes classical dose-response models for quantitative outcomes, and provides a more parsimonious, but still powerful, formulation compared to nonparametric methods, thereby improving interpretability and efficiency in inference on risk functions. A Markov chain Monte Carlo algorithm for posterior inference is developed, and the benefits of our methods are outlined in simulations, along with a study on the impact of dde exposure on gestational age.
研究连续响应的分布如何随预测变量变化引起了广泛关注。我们的动机来自于环境应用,其中预测变量是暴露剂量,响应是健康结果。这些研究的一个主要重点是推断相对于基线风险给定增加所对应的剂量水平。在实现这一目标时,常用方法要么将连续响应二分,要么专注于对结果期望中随剂量的变化进行建模。这些选择可能导致信息丢失,并对剂量 - 反应关系提供不准确的推断。相反,我们提出了一种贝叶斯凸混合回归模型,该模型允许健康结果的整个分布未知且随剂量变化。为了平衡灵活性和简约性,我们对极端剂量处的密度使用混合模型,并通过这些极端密度的凸组合来表示每个中间剂量处的条件密度。这种表示推广了用于定量结果的经典剂量 - 反应模型,并且与非参数方法相比提供了一种更简约但仍然强大的公式,从而提高了风险函数推断的可解释性和效率。开发了一种用于后验推断的马尔可夫链蒙特卡罗算法,并在模拟中概述了我们方法的优点,以及一项关于滴滴涕暴露对孕周影响的研究。