用于机器学习的高斯过程

Gaussian processes for machine learning.

作者信息

Seeger Matthias

机构信息

Department of EECS, University of California at Berkeley, 485 Soda Hall, Berkeley, CA 94720-1776, USA.

出版信息

Int J Neural Syst. 2004 Apr;14(2):69-106. doi: 10.1142/S0129065704001899.

DOI:10.1142/S0129065704001899

PMID:15112367

Abstract

Gaussian processes (GPs) are natural generalisations of multivariate Gaussian random variables to infinite (countably or continuous) index sets. GPs have been applied in a large number of fields to a diverse range of ends, and very many deep theoretical analyses of various properties are available. This paper gives an introduction to Gaussian processes on a fairly elementary level with special emphasis on characteristics relevant in machine learning. It draws explicit connections to branches such as spline smoothing models and support vector machines in which similar ideas have been investigated. Gaussian process models are routinely used to solve hard machine learning problems. They are attractive because of their flexible non-parametric nature and computational simplicity. Treated within a Bayesian framework, very powerful statistical methods can be implemented which offer valid estimates of uncertainties in our predictions and generic model selection procedures cast as nonlinear optimization problems. Their main drawback of heavy computational scaling has recently been alleviated by the introduction of generic sparse approximations.13,78,31 The mathematical literature on GPs is large and often uses deep concepts which are not required to fully understand most machine learning applications. In this tutorial paper, we aim to present characteristics of GPs relevant to machine learning and to show up precise connections to other "kernel machines" popular in the community. Our focus is on a simple presentation, but references to more detailed sources are provided.

摘要

高斯过程（GPs）是多元高斯随机变量到无限（可数或连续）索引集的自然推广。高斯过程已被应用于大量领域，以实现各种各样的目的，并且有许多关于其各种性质的深入理论分析。本文在相当基础的层面上介绍高斯过程，特别强调与机器学习相关的特征。它明确地与诸如样条平滑模型和支持向量机等分支建立联系，在这些分支中已经研究了类似的思想。高斯过程模型经常用于解决困难的机器学习问题。它们具有吸引力，因为其具有灵活的非参数性质和计算简单性。在贝叶斯框架内进行处理，可以实现非常强大的统计方法，这些方法能够对我们预测中的不确定性提供有效的估计，并将通用的模型选择过程转化为非线性优化问题。最近，通过引入通用的稀疏近似，它们计算量过大的主要缺点得到了缓解。关于高斯过程的数学文献很多，并且经常使用一些深奥的概念，而大多数机器学习应用并不需要完全理解这些概念。在本教程论文中，我们旨在介绍与机器学习相关的高斯过程的特征，并展示其与该领域中其他流行的“核机器”的确切联系。我们的重点是进行简单的阐述，但也会提供指向更详细资料来源的参考文献。

相似文献

Gaussian processes for machine learning.用于机器学习的高斯过程

Int J Neural Syst. 2004 Apr;14(2):69-106. doi: 10.1142/S0129065704001899.

Bayesian framework for least-squares support vector machine classifiers, gaussian processes, and kernel Fisher discriminant analysis.用于最小二乘支持向量机分类器、高斯过程和核Fisher判别分析的贝叶斯框架。

Neural Comput. 2002 May;14(5):1115-47. doi: 10.1162/089976602753633411.

Constructing Bayesian formulations of sparse kernel learning methods.构建稀疏核学习方法的贝叶斯公式。

Neural Netw. 2005 Jun-Jul;18(5-6):674-83. doi: 10.1016/j.neunet.2005.06.002.

Bayesian multitask classification with Gaussian process priors.具有高斯过程先验的贝叶斯多任务分类

IEEE Trans Neural Netw. 2011 Dec;22(12):2011-21. doi: 10.1109/TNN.2011.2168568. Epub 2011 Oct 10.

A practical approach to model selection for support vector machines with a Gaussian kernel.一种用于具有高斯核的支持向量机的模型选择实用方法。

IEEE Trans Syst Man Cybern B Cybern. 2011 Apr;41(2):330-40. doi: 10.1109/TSMCB.2010.2053026. Epub 2010 Aug 9.

Sparse kernel learning with LASSO and Bayesian inference algorithm.基于 LASSO 和贝叶斯推断算法的稀疏核学习。

Neural Netw. 2010 Mar;23(2):257-64. doi: 10.1016/j.neunet.2009.07.001. Epub 2009 Jul 9.

Penalized gaussian process regression and classification for high-dimensional nonlinear data.用于高维非线性数据的惩罚高斯过程回归与分类

Biometrics. 2011 Dec;67(4):1285-94. doi: 10.1111/j.1541-0420.2011.01576.x. Epub 2011 Mar 8.

Best harmony, unified RPCL and automated model selection for unsupervised and supervised learning on Gaussian mixtures, three-layer nets and ME-RBF-SVM models.高斯混合模型、三层网络和ME-RBF-SVM模型上无监督和监督学习的最佳协调、统一RPCL与自动模型选择。

Int J Neural Syst. 2001 Feb;11(1):43-69. doi: 10.1142/S0129065701000497.

Probabilistic machine learning and artificial intelligence.概率机器学习和人工智能。

Nature. 2015 May 28;521(7553):452-9. doi: 10.1038/nature14541.

Real-time model learning using Incremental Sparse Spectrum Gaussian Process Regression.使用增量稀疏谱高斯过程回归进行实时模型学习。

Neural Netw. 2013 May;41:59-69. doi: 10.1016/j.neunet.2012.08.011. Epub 2012 Sep 6.

引用本文的文献

Bio-inspired acoustic metamaterials for traffic noise control: bridging the gap with machine learning.用于交通噪声控制的生物启发式声学超材料：与机器学习接轨

Commun Eng. 2025 Jul 29;4(1):136. doi: 10.1038/s44172-025-00470-x.

Utilizing Circadian Heart Rate Variability Features and Machine Learning for Estimating Left Ventricular Ejection Fraction Levels in Hypertensive Patients: A Composite Multiscale Entropy Analysis.利用昼夜心率变异性特征和机器学习估计高血压患者左心室射血分数水平：一种复合多尺度熵分析

Biosensors (Basel). 2025 Jul 10;15(7):442. doi: 10.3390/bios15070442.

Following the robot's lead: Predicting human and robot movement from EEG in a motor learning HRI task.跟随机器人的引导：在运动学习人机交互任务中通过脑电图预测人类和机器人的运动

iScience. 2025 Jun 18;28(7):112914. doi: 10.1016/j.isci.2025.112914. eCollection 2025 Jul 18.

Advancing genetic engineering with active learning: theory, implementations and potential opportunities.通过主动学习推进基因工程：理论、实现与潜在机遇

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf286.

Predicting Quality of Life in People Living with HIV: A Machine Learning Model Integrating Multidimensional Determinants.预测艾滋病毒感染者的生活质量：一种整合多维决定因素的机器学习模型。

Health Qual Life Outcomes. 2025 Jul 4;23(1):68. doi: 10.1186/s12955-025-02398-4.

NKAPL suppresses NSCLC progression by enhancing the protein stability of TRIM21 and further inhibiting the NF-κB signaling pathway.NKAPL通过增强TRIM21的蛋白质稳定性并进一步抑制NF-κB信号通路来抑制非小细胞肺癌的进展。

Genes Dis. 2025 Mar 11;12(5):101598. doi: 10.1016/j.gendis.2025.101598. eCollection 2025 Sep.

Unlocking gene regulatory networks for crop resilience and sustainable agriculture.解锁作物抗逆性和可持续农业的基因调控网络。

Nat Biotechnol. 2025 Jul 2. doi: 10.1038/s41587-025-02727-4.

New insight into viscosity prediction of imidazolium-based ionic liquids and their mixtures with machine learning models.基于机器学习模型对咪唑基离子液体及其混合物粘度预测的新见解。

Sci Rep. 2025 Jul 2;15(1):22672. doi: 10.1038/s41598-025-08947-7.

Deciphering the performance of different surface models for corneal topography.解读不同角膜地形图表面模型的性能

Ophthalmic Physiol Opt. 2025 Sep;45(6):1270-1281. doi: 10.1111/opo.13539. Epub 2025 Jun 19.

Predicting high-fitness viral protein variants with Bayesian active learning and biophysics.利用贝叶斯主动学习和生物物理学预测高适应性病毒蛋白变体

Proc Natl Acad Sci U S A. 2025 Jun 17;122(24):e2503742122. doi: 10.1073/pnas.2503742122. Epub 2025 Jun 9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于机器学习的高斯过程

Gaussian processes for machine learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献