Fakultät Statistik, Technische Universität Dortmund, Dortmund, Germany.
Biostatistics. 2010 Jul;11(3):419-31. doi: 10.1093/biostatistics/kxq001. Epub 2010 Feb 11.
In various application areas, prior information is available about the direction of the effects of multiple predictors on the conditional response distribution. For example, in epidemiology studies of potentially adverse exposures and continuous health responses, one can typically assume a priori that increasing the level of an exposure does not lead to an improvement in the health response. Such an assumption can be formalized through a stochastic ordering assumption in each of the exposures, leading to a potentially large improvement in efficiency in nonparametric modeling of the conditional response distribution. This article proposes a Bayesian nonparametric approach to this problem based on characterizing the conditional response density as a Gaussian mixture, with the locations of the Gaussian means varying flexibly with predictors subject to minimal constraints to ensure stochastic ordering. Theoretical properties are considered and Markov chain Monte Carlo methods are developed for posterior computation. The methods are illustrated using simulation examples and a reproductive epidemiology application.
在各种应用领域中,通常可以获得关于多个预测变量对条件响应分布的影响方向的先验信息。例如,在潜在不良暴露和连续健康反应的流行病学研究中,人们通常可以先验地假设增加暴露水平不会导致健康反应的改善。这种假设可以通过在每个暴露中进行随机排序假设来形式化,从而可以有效地提高非参数建模条件响应分布的效率。本文提出了一种基于条件响应密度作为高斯混合的贝叶斯非参数方法,其中高斯均值的位置随预测变量灵活变化,受到最小约束以确保随机排序。考虑了理论性质,并开发了用于后验计算的马尔可夫链蒙特卡罗方法。该方法通过模拟示例和生殖流行病学应用进行了说明。