School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an Shaanxi, 710049, China.
School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an Shaanxi, 710049, China.
Neural Netw. 2024 May;173:106201. doi: 10.1016/j.neunet.2024.106201. Epub 2024 Feb 28.
Spatial prediction tasks are challenging when observed samples are sparse and prediction samples are abundant. Gaussian processes (GPs) are commonly used in spatial prediction tasks and have the advantage of measuring the uncertainty of the interpolation result. However, as the sample size increases, GPs suffer from significant overhead. Standard neural networks (NNs) provide a powerful and scalable solution for modeling spatial data, but they often overfit small sample data. Based on conditional neural processes (CNPs), which combine the advantages of GPs and NNs, we propose a new framework called Spatial Multi-Attention Conditional Neural Processes (SMACNPs) for spatial small sample prediction tasks. SMACNPs are a modular model that can predict targets by employing different attention mechanisms to extract relevant information from different forms of sample data. The task representation is inferred by measuring the spatial correlation contained in different sample points and the relationship contained in attribute variables, respectively. The distribution of the target variable is predicted by GPs parameterized by NNs. SMACNPs allow us to obtain accurate predictions of the target value while quantifying the prediction uncertainty. Experiments on spatial prediction tasks on simulated and real-world datasets demonstrate that this framework flexibly incorporates spatial context and correlation into the model, achieving state-of-the-art results in spatial small sample prediction tasks in terms of both predictive performance and reliability. For example, on the California housing dataset, our method reduces MAE by 8% and MSE by 7% compared to the second-best method. In addition, a spatiotemporal prediction task to forecast traffic speed further confirms the effectiveness and generality of our method.
当观测样本稀疏而预测样本丰富时,空间预测任务具有挑战性。高斯过程(Gaussian processes,简称 GPs)常用于空间预测任务,具有测量插值结果不确定性的优势。然而,随着样本数量的增加,GPs 会面临显著的开销。标准神经网络(neural networks,简称 NNs)为建模空间数据提供了强大且可扩展的解决方案,但它们经常对小样本数据过度拟合。基于条件神经过程(conditional neural processes,简称 CNP),结合了 GPs 和 NNs 的优点,我们提出了一种新的框架,称为空间多注意条件神经过程(Spatial Multi-Attention Conditional Neural Processes,简称 SMACNPs),用于空间小样本预测任务。SMACNPs 是一种模块化模型,可以通过采用不同的注意力机制从不同形式的样本数据中提取相关信息来预测目标。任务表示是通过分别测量不同样本点之间包含的空间相关性和属性变量之间包含的关系来推断的。目标变量的分布是由 NN 参数化的 GPs 预测的。SMACNPs 允许我们在量化预测不确定性的同时,对目标值进行准确预测。在模拟和真实世界数据集上的空间预测任务实验表明,该框架能够灵活地将空间上下文和相关性纳入模型,在空间小样本预测任务的预测性能和可靠性方面均达到了最新水平。例如,在加利福尼亚住房数据集上,与排名第二的方法相比,我们的方法将 MAE 降低了 8%,MSE 降低了 7%。此外,对交通速度进行时空预测的任务进一步证实了我们方法的有效性和通用性。