Ton Jean-Francois, Flaxman Seth, Sejdinovic Dino, Bhatt Samir
Department of Statistics, University of Oxford, Oxford, OX1 3LB, UK.
Department of Mathematics and Data Science Institute, Imperial College London, London, SW7 2AZ, UK.
Spat Stat. 2018 Dec;28:59-78. doi: 10.1016/j.spasta.2018.02.002.
The use of covariance kernels is ubiquitous in the field of spatial statistics. Kernels allow data to be mapped into high-dimensional feature spaces and can thus extend simple linear additive methods to nonlinear methods with higher order interactions. However, until recently, there has been a strong reliance on a limited class of stationary kernels such as the Matérn or squared exponential, limiting the expressiveness of these modelling approaches. Recent machine learning research has focused on spectral representations to model arbitrary stationary kernels and introduced more general representations that include classes of nonstationary kernels. In this paper, we exploit the connections between Fourier feature representations, Gaussian processes and neural networks to generalise previous approaches and develop a simple and efficient framework to learn arbitrarily complex nonstationary kernel functions directly from the data, while taking care to avoid overfitting using state-of-the-art methods from deep learning. We highlight the very broad array of kernel classes that could be created within this framework. We apply this to a time series dataset and a remote sensing problem involving land surface temperature in Eastern Africa. We show that without increasing the computational or storage complexity, nonstationary kernels can be used to improve generalisation performance and provide more interpretable results.
协方差核在空间统计领域中无处不在。核函数允许将数据映射到高维特征空间,从而能够将简单的线性加法方法扩展到具有高阶相互作用的非线性方法。然而,直到最近,人们一直严重依赖于有限类别的平稳核,如马特恩核或平方指数核,这限制了这些建模方法的表现力。最近的机器学习研究集中在频谱表示上,以对任意平稳核进行建模,并引入了更通用的表示形式,其中包括非平稳核类别。在本文中,我们利用傅里叶特征表示、高斯过程和神经网络之间的联系,对先前的方法进行推广,并开发了一个简单高效的框架,直接从数据中学习任意复杂的非平稳核函数,同时注意使用深度学习的最新方法避免过拟合。我们强调了在此框架内可以创建的非常广泛的核类别。我们将此应用于一个时间序列数据集和一个涉及东非地表温度的遥感问题。我们表明,在不增加计算或存储复杂度的情况下,非平稳核可用于提高泛化性能并提供更具可解释性的结果。