通过嵌套高斯过程实现的局部自适应贝叶斯非参数回归

Locally Adaptive Bayes Nonparametric Regression via Nested Gaussian Processes.

作者信息

Zhu Bin, Dunson David B

机构信息

Tenure-Track Principal Investigator, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD 20852.

Arts & Sciences Distinguished Professor, Department of Statistical Science, Duke University, Durham, NC 27708.

出版信息

J Am Stat Assoc. 2013;108(504). doi: 10.1080/01621459.2013.838568.

DOI:10.1080/01621459.2013.838568

PMID:25328260

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4196220/

Abstract

We propose a nested Gaussian process (nGP) as a locally adaptive prior for Bayesian nonparametric regression. Specified through a set of stochastic differential equations (SDEs), the nGP imposes a Gaussian process prior for the function's th-order derivative. The nesting comes in through including a local instantaneous mean function, which is drawn from another Gaussian process inducing adaptivity to locally-varying smoothness. We discuss the support of the nGP prior in terms of the closure of a reproducing kernel Hilbert space, and consider theoretical properties of the posterior. The posterior mean under the nGP prior is shown to be equivalent to the minimizer of a nested penalized sum-of-squares involving penalties for both the global and local roughness of the function. Using highly-efficient Markov chain Monte Carlo for posterior inference, the proposed method performs well in simulation studies compared to several alternatives, and is scalable to massive data, illustrated through a proteomics application.

摘要

我们提出一种嵌套高斯过程（nGP）作为贝叶斯非参数回归的局部自适应先验。通过一组随机微分方程（SDE）指定，nGP对函数的阶导数施加高斯过程先验。嵌套是通过包含一个局部瞬时均值函数实现的，该函数来自另一个高斯过程，从而对局部变化的平滑度产生适应性。我们根据再生核希尔伯特空间的闭包讨论 nGP 先验的支撑，并考虑后验的理论性质。结果表明，在 nGP 先验下的后验均值等同于一个嵌套惩罚平方和的极小值，该平方和涉及对函数全局和局部粗糙度的惩罚。通过使用高效的马尔可夫链蒙特卡罗进行后验推断，与几种替代方法相比，所提出的方法在模拟研究中表现良好，并且可扩展到海量数据，蛋白质组学应用对此进行了说明。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8129/4196220/d7886b7ad634/nihms529826f1.jpg

相似文献

Locally Adaptive Bayes Nonparametric Regression via Nested Gaussian Processes.通过嵌套高斯过程实现的局部自适应贝叶斯非参数回归

J Am Stat Assoc. 2013;108(504). doi: 10.1080/01621459.2013.838568.

Bayesian nonparametric regression and density estimation using integrated nested Laplace approximations.使用集成嵌套拉普拉斯近似的贝叶斯非参数回归与密度估计

J Biom Biostat. 2013 Jun 25;4. doi: 10.4172/2155-6180.1000e125.

Efficient Bayesian hierarchical functional data analysis with basis function approximations using Gaussian-Wishart processes.使用高斯-威沙特过程并基于基函数近似的高效贝叶斯分层函数数据分析。

Biometrics. 2017 Dec;73(4):1082-1091. doi: 10.1111/biom.12705. Epub 2017 Apr 10.

Bayesian semiparametric intensity estimation for inhomogeneous spatial point processes.非齐次空间点过程的贝叶斯半参数强度估计

Biometrics. 2011 Sep;67(3):937-46. doi: 10.1111/j.1541-0420.2010.01531.x. Epub 2010 Dec 22.

Optimal Penalized Function-on-Function Regression under a Reproducing Kernel Hilbert Space Framework.再生核希尔伯特空间框架下的最优惩罚函数对函数回归

J Am Stat Assoc. 2018;113(524):1601-1611. doi: 10.1080/01621459.2017.1356320. Epub 2018 Jun 19.

Nonparametric estimation of stochastic differential equations with sparse Gaussian processes.基于稀疏高斯过程的随机微分方程的非参数估计

Phys Rev E. 2017 Aug;96(2-1):022104. doi: 10.1103/PhysRevE.96.022104. Epub 2017 Aug 2.

An Online Projection Estimator for Nonparametric Regression in Reproducing Kernel Hilbert Spaces.再生核希尔伯特空间中非参数回归的在线投影估计器。

Stat Sin. 2023 Jan;33(1):127-148. doi: 10.5705/ss.202021.0018.

Adaptive Incremental Mixture Markov Chain Monte Carlo.自适应增量混合马尔可夫链蒙特卡罗方法

J Comput Graph Stat. 2019;28(4):790-805. doi: 10.1080/10618600.2019.1598872. Epub 2019 Jun 7.

Locally Adaptive Smoothing with Markov Random Fields and Shrinkage Priors.基于马尔可夫随机场和收缩先验的局部自适应平滑

Bayesian Anal. 2018 Mar;13(1):225-252. doi: 10.1214/17-BA1050. Epub 2017 Feb 24.

Structured functional additive regression in reproducing kernel Hilbert spaces.再生核希尔伯特空间中的结构化函数加法回归

J R Stat Soc Series B Stat Methodol. 2014 Jun 1;76(3):581-603. doi: 10.1111/rssb.12036.

引用本文的文献

Locally Adaptive Smoothing with Markov Random Fields and Shrinkage Priors.基于马尔可夫随机场和收缩先验的局部自适应平滑

Bayesian Anal. 2018 Mar;13(1):225-252. doi: 10.1214/17-BA1050. Epub 2017 Feb 24.

本文引用的文献

Variable Selection for Nonparametric Gaussian Process Priors: Models and Computational Strategies.非参数高斯过程先验的变量选择：模型与计算策略

Stat Sci. 2011 Feb 1;26(1):130-149. doi: 10.1214/11-STS354.

Stochastic functional data analysis: a diffusion model-based approach.随机泛函数据分析：一种基于扩散模型的方法。

Biometrics. 2011 Dec;67(4):1295-304. doi: 10.1111/j.1541-0420.2011.01591.x. Epub 2011 Mar 18.

Nonparametric Bayesian variable selection with applications to multiple quantitative trait loci mapping with epistasis and gene-environment interaction.非参数贝叶斯变量选择及其在具有上位性和基因-环境互作的多个数量性状基因座作图中的应用。

Genetics. 2010 Sep;186(1):385-94. doi: 10.1534/genetics.109.113688. Epub 2010 Jun 15.

Understanding the characteristics of mass spectrometry data through the use of simulation.通过模拟来理解质谱数据的特征。

Cancer Inform. 2005;1(1):41-52.

Bayesian analysis of mass spectrometry proteomic data using wavelet-based functional mixed models.使用基于小波的功能混合模型对质谱蛋白质组学数据进行贝叶斯分析。

Biometrics. 2008 Jun;64(2):479-89. doi: 10.1111/j.1541-0420.2007.00895.x. Epub 2007 Sep 20.

Mass spectrometry and protein analysis.质谱分析与蛋白质分析。

Science. 2006 Apr 14;312(5771):212-7. doi: 10.1126/science.1124619.

Improved peak detection and quantification of mass spectrometry data acquired from surface-enhanced laser desorption and ionization by denoising spectra with the undecimated discrete wavelet transform.通过使用未抽取离散小波变换对光谱进行去噪，改进从表面增强激光解吸电离获得的质谱数据的峰检测和定量。

Proteomics. 2005 Nov;5(16):4107-17. doi: 10.1002/pmic.200401261.

Feature extraction and quantification for mass spectrometry in biomedical applications using the mean spectrum.使用平均光谱进行生物医学应用中质谱的特征提取和定量分析。

Bioinformatics. 2005 May 1;21(9):1764-75. doi: 10.1093/bioinformatics/bti254. Epub 2005 Jan 26.

Sample classification from protein mass spectrometry, by 'peak probability contrasts'.通过“峰概率对比”对蛋白质质谱样本进行分类。

Bioinformatics. 2004 Nov 22;20(17):3034-44. doi: 10.1093/bioinformatics/bth357. Epub 2004 Jun 29.

Probability-based protein identification by searching sequence databases using mass spectrometry data.通过使用质谱数据搜索序列数据库进行基于概率的蛋白质鉴定。

Electrophoresis. 1999 Dec;20(18):3551-67. doi: 10.1002/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验