空间多注意条件神经过程。

Spatial multi-attention conditional neural processes.

机构信息

School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an Shaanxi, 710049, China.

出版信息

Neural Netw. 2024 May;173:106201. doi: 10.1016/j.neunet.2024.106201. Epub 2024 Feb 28.

DOI:10.1016/j.neunet.2024.106201

Abstract

Spatial prediction tasks are challenging when observed samples are sparse and prediction samples are abundant. Gaussian processes (GPs) are commonly used in spatial prediction tasks and have the advantage of measuring the uncertainty of the interpolation result. However, as the sample size increases, GPs suffer from significant overhead. Standard neural networks (NNs) provide a powerful and scalable solution for modeling spatial data, but they often overfit small sample data. Based on conditional neural processes (CNPs), which combine the advantages of GPs and NNs, we propose a new framework called Spatial Multi-Attention Conditional Neural Processes (SMACNPs) for spatial small sample prediction tasks. SMACNPs are a modular model that can predict targets by employing different attention mechanisms to extract relevant information from different forms of sample data. The task representation is inferred by measuring the spatial correlation contained in different sample points and the relationship contained in attribute variables, respectively. The distribution of the target variable is predicted by GPs parameterized by NNs. SMACNPs allow us to obtain accurate predictions of the target value while quantifying the prediction uncertainty. Experiments on spatial prediction tasks on simulated and real-world datasets demonstrate that this framework flexibly incorporates spatial context and correlation into the model, achieving state-of-the-art results in spatial small sample prediction tasks in terms of both predictive performance and reliability. For example, on the California housing dataset, our method reduces MAE by 8% and MSE by 7% compared to the second-best method. In addition, a spatiotemporal prediction task to forecast traffic speed further confirms the effectiveness and generality of our method.

摘要

当观测样本稀疏而预测样本丰富时，空间预测任务具有挑战性。高斯过程（Gaussian processes，简称 GPs）常用于空间预测任务，具有测量插值结果不确定性的优势。然而，随着样本数量的增加，GPs 会面临显著的开销。标准神经网络（neural networks，简称 NNs）为建模空间数据提供了强大且可扩展的解决方案，但它们经常对小样本数据过度拟合。基于条件神经过程（conditional neural processes，简称 CNP），结合了 GPs 和 NNs 的优点，我们提出了一种新的框架，称为空间多注意条件神经过程（Spatial Multi-Attention Conditional Neural Processes，简称 SMACNPs），用于空间小样本预测任务。SMACNPs 是一种模块化模型，可以通过采用不同的注意力机制从不同形式的样本数据中提取相关信息来预测目标。任务表示是通过分别测量不同样本点之间包含的空间相关性和属性变量之间包含的关系来推断的。目标变量的分布是由 NN 参数化的 GPs 预测的。SMACNPs 允许我们在量化预测不确定性的同时，对目标值进行准确预测。在模拟和真实世界数据集上的空间预测任务实验表明，该框架能够灵活地将空间上下文和相关性纳入模型，在空间小样本预测任务的预测性能和可靠性方面均达到了最新水平。例如，在加利福尼亚住房数据集上，与排名第二的方法相比，我们的方法将 MAE 降低了 8%，MSE 降低了 7%。此外，对交通速度进行时空预测的任务进一步证实了我们方法的有效性和通用性。

相似文献

Spatial multi-attention conditional neural processes.空间多注意条件神经过程。

Neural Netw. 2024 May;173:106201. doi: 10.1016/j.neunet.2024.106201. Epub 2024 Feb 28.

Robust Traffic Prediction From Spatial-Temporal Data Based on Conditional Distribution Learning.基于条件分布学习的时空数据稳健交通预测。

IEEE Trans Cybern. 2022 Dec;52(12):13458-13471. doi: 10.1109/TCYB.2021.3131285. Epub 2022 Nov 18.

Multicomponent Spatial-Temporal Graph Attention Convolution Networks for Traffic Prediction with Spatially Sparse Data.具有空间稀疏数据的交通预测的多分量时空图注意卷积网络。

Comput Intell Neurosci. 2021 Dec 23;2021:9134942. doi: 10.1155/2021/9134942. eCollection 2021.

Uncertainty propagation for dropout-based Bayesian neural networks.基于 dropout 的贝叶斯神经网络的不确定性传播。

Neural Netw. 2021 Dec;144:394-406. doi: 10.1016/j.neunet.2021.09.005. Epub 2021 Sep 9.

Medical multivariate time series imputation and forecasting based on a recurrent conditional Wasserstein GAN and attention.基于循环条件瓦瑟斯坦生成对抗网络和注意力机制的医学多变量时间序列插补与预测

J Biomed Inform. 2023 Mar;139:104320. doi: 10.1016/j.jbi.2023.104320. Epub 2023 Feb 13.

Deep Kernel learning for reaction outcome prediction and optimization.用于反应结果预测与优化的深度核学习

Commun Chem. 2024 Jun 14;7(1):136. doi: 10.1038/s42004-024-01219-x.

Assessment and statistical modeling of the relationship between remotely sensed aerosol optical depth and PM2.5 in the eastern United States.美国东部地区遥感气溶胶光学厚度与PM2.5之间关系的评估及统计建模

Res Rep Health Eff Inst. 2012 May(167):5-83; discussion 85-91.

Uncertainty-Gated Stochastic Sequential Model for EHR Mortality Prediction.基于不确定性门控的电子病历死亡率预测随机序贯模型。

IEEE Trans Neural Netw Learn Syst. 2021 Sep;32(9):4052-4062. doi: 10.1109/TNNLS.2020.3016670. Epub 2021 Aug 31.

Estimation with Uncertainty via Conditional Generative Adversarial Networks.基于条件生成对抗网络的不确定性估计。

Sensors (Basel). 2021 Sep 15;21(18):6194. doi: 10.3390/s21186194.

DM-CNN: Dynamic Multi-scale Convolutional Neural Network with uncertainty quantification for medical image classification.DM-CNN：具有不确定性量化的动态多尺度卷积神经网络，用于医学图像分类。

Comput Biol Med. 2024 Jan;168:107758. doi: 10.1016/j.compbiomed.2023.107758. Epub 2023 Nov 29.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

空间多注意条件神经过程。

Spatial multi-attention conditional neural processes.

机构信息

出版信息

相似文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献