Zhu Bin, Dunson David B
Tenure-Track Principal Investigator, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20852.
Arts & Sciences Distinguished Professor, Department of Statistical Science, Duke University, Durham, NC 27708.
J Am Stat Assoc. 2013;108(504). doi: 10.1080/01621459.2013.838568.
We propose a nested Gaussian process (nGP) as a locally adaptive prior for Bayesian nonparametric regression. Specified through a set of stochastic differential equations (SDEs), the nGP imposes a Gaussian process prior for the function's th-order derivative. The nesting comes in through including a local instantaneous mean function, which is drawn from another Gaussian process inducing adaptivity to locally-varying smoothness. We discuss the support of the nGP prior in terms of the closure of a reproducing kernel Hilbert space, and consider theoretical properties of the posterior. The posterior mean under the nGP prior is shown to be equivalent to the minimizer of a nested penalized sum-of-squares involving penalties for both the global and local roughness of the function. Using highly-efficient Markov chain Monte Carlo for posterior inference, the proposed method performs well in simulation studies compared to several alternatives, and is scalable to massive data, illustrated through a proteomics application.
我们提出一种嵌套高斯过程(nGP)作为贝叶斯非参数回归的局部自适应先验。通过一组随机微分方程(SDE)指定,nGP对函数的 阶导数施加高斯过程先验。嵌套是通过包含一个局部瞬时均值函数实现的,该函数来自另一个高斯过程,从而对局部变化的平滑度产生适应性。我们根据再生核希尔伯特空间的闭包讨论 nGP 先验的支撑,并考虑后验的理论性质。结果表明,在 nGP 先验下的后验均值等同于一个嵌套惩罚平方和的极小值,该平方和涉及对函数全局和局部粗糙度的惩罚。通过使用高效的马尔可夫链蒙特卡罗进行后验推断,与几种替代方法相比,所提出的方法在模拟研究中表现良好,并且可扩展到海量数据,蛋白质组学应用对此进行了说明。