Saad Feras, Burnim Jacob, Carroll Colin, Patton Brian, Köster Urs, A Saurous Rif, Hoffman Matthew
Computer Science Department, Carnegie Mellon University, Pittsburgh, PA, USA.
Google Research, Mountain View, CA, USA.
Nat Commun. 2024 Sep 11;15(1):7942. doi: 10.1038/s41467-024-51477-5.
Spatiotemporal datasets, which consist of spatially-referenced time series, are ubiquitous in diverse applications, such as air pollution monitoring, disease tracking, and cloud-demand forecasting. As the scale of modern datasets increases, there is a growing need for statistical methods that are flexible enough to capture complex spatiotemporal dynamics and scalable enough to handle many observations. This article introduces the Bayesian Neural Field (BAYESNF), a domain-general statistical model that infers rich spatiotemporal probability distributions for data-analysis tasks including forecasting, interpolation, and variography. BAYESNF integrates a deep neural network architecture for high-capacity function estimation with hierarchical Bayesian inference for robust predictive uncertainty quantification. Evaluations against prominent baselines show that BAYESNF delivers improvements on prediction problems from climate and public health data containing tens to hundreds of thousands of measurements. Accompanying the paper is an open-source software package ( https://github.com/google/bayesnf ) that runs on GPU and TPU accelerators through the JAX machine learning platform.
时空数据集由空间参考时间序列组成,在空气污染监测、疾病跟踪和云需求预测等各种应用中无处不在。随着现代数据集规模的增加,对统计方法的需求也日益增长,这些方法需要足够灵活以捕捉复杂的时空动态,并且具有足够的可扩展性以处理大量观测数据。本文介绍了贝叶斯神经场(BAYESNF),这是一种通用领域的统计模型,可推断用于包括预测、插值和变异函数分析等数据分析任务的丰富时空概率分布。BAYESNF将用于高容量函数估计的深度神经网络架构与用于稳健预测不确定性量化的分层贝叶斯推理相结合。与突出基线的评估表明,BAYESNF在包含数万到数十万测量值的气候和公共卫生数据的预测问题上有改进。本文还附带了一个开源软件包(https://github.com/google/bayesnf),该软件包通过JAX机器学习平台在GPU和TPU加速器上运行。