• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于大数据贝叶斯回归的空间多元树

Spatial Multivariate Trees for Big Data Bayesian Regression.

作者信息

Peruzzi Michele, Dunson David B

机构信息

Department of Statistical Science, Duke University, Durham, NC 27708-0251, USA.

出版信息

J Mach Learn Res. 2022;23.

PMID:35891979
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9311452/
Abstract

High resolution geospatial data are challenging because standard geostatistical models based on Gaussian processes are known to not scale to large data sizes. While progress has been made towards methods that can be computed more efficiently, considerably less attention has been devoted to methods for large scale data that allow the description of complex relationships between several outcomes recorded at high resolutions by different sensors. Our Bayesian multivariate regression models based on spatial multivariate trees (SpamTrees) achieve scalability via conditional independence assumptions on latent random effects following a treed directed acyclic graph. Information-theoretic arguments and considerations on computational efficiency guide the construction of the tree and the related efficient sampling algorithms in imbalanced multivariate settings. In addition to simulated data examples, we illustrate SpamTrees using a large climate data set which combines satellite data with land-based station data. Software and source code are available on CRAN at https://CRAN.R-project.org/package=spamtree.

摘要

高分辨率地理空间数据具有挑战性,因为基于高斯过程的标准地理统计模型已知无法扩展到大数据规模。虽然在可更高效计算的方法方面已取得进展,但对于大规模数据的方法关注较少,这些方法要能描述由不同传感器在高分辨率下记录的多个结果之间的复杂关系。我们基于空间多元树(SpamTrees)的贝叶斯多元回归模型通过在遵循树形有向无环图的潜在随机效应上的条件独立性假设来实现可扩展性。信息论观点和对计算效率的考虑指导了树的构建以及在不平衡多元设置中的相关高效采样算法。除了模拟数据示例外,我们还使用一个将卫星数据与地面站数据相结合的大型气候数据集来说明SpamTrees。软件和源代码可在CRAN上获取,网址为https://CRAN.R-project.org/package=spamtree。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/74b59c87565b/nihms-1815550-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/1b9a4a216b4f/nihms-1815550-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/b8847447dbfd/nihms-1815550-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/5b0c50022b73/nihms-1815550-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/1642bf394e92/nihms-1815550-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/94c48f124620/nihms-1815550-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/2d6f784a4366/nihms-1815550-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/0bc36dee50ae/nihms-1815550-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/74b59c87565b/nihms-1815550-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/1b9a4a216b4f/nihms-1815550-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/b8847447dbfd/nihms-1815550-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/5b0c50022b73/nihms-1815550-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/1642bf394e92/nihms-1815550-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/94c48f124620/nihms-1815550-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/2d6f784a4366/nihms-1815550-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/0bc36dee50ae/nihms-1815550-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/de3e/9311452/74b59c87565b/nihms-1815550-f0013.jpg

相似文献

1
Spatial Multivariate Trees for Big Data Bayesian Regression.用于大数据贝叶斯回归的空间多元树
J Mach Learn Res. 2022;23.
2
Highly Scalable Bayesian Geostatistical Modeling via Meshed Gaussian Processes on Partitioned Domains.通过分区域上的网格化高斯过程实现的高度可扩展贝叶斯地理统计建模
J Am Stat Assoc. 2022;117(538):969-982. doi: 10.1080/01621459.2020.1833889. Epub 2020 Nov 24.
3
An Efficient Independence Sampler for Updating Branches in Bayesian Markov chain Monte Carlo Sampling of Phylogenetic Trees.一种用于在系统发育树的贝叶斯马尔可夫链蒙特卡罗采样中更新分支的高效独立采样器。
Syst Biol. 2016 Jan;65(1):161-76. doi: 10.1093/sysbio/syv051. Epub 2015 Jul 30.
4
Bayesian spatial modelling of geostatistical data using INLA and SPDE methods: A case study predicting malaria risk in Mozambique.使用INLA和SPDE方法对地质统计数据进行贝叶斯空间建模:以预测莫桑比克疟疾风险为例的研究。
Spat Spatiotemporal Epidemiol. 2021 Nov;39:100440. doi: 10.1016/j.sste.2021.100440. Epub 2021 Aug 3.
5
parallelMCMCcombine: an R package for bayesian methods for big data and analytics.parallelMCMCcombine:一个用于大数据和分析的贝叶斯方法的R包。
PLoS One. 2014 Sep 26;9(9):e108425. doi: 10.1371/journal.pone.0108425. eCollection 2014.
6
Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets.用于大型地理统计数据集的分层最近邻高斯过程模型。
J Am Stat Assoc. 2016;111(514):800-812. doi: 10.1080/01621459.2015.1044091. Epub 2016 Aug 18.
7
spBayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models.spBayes:一个用于单变量和多变量分层点参照空间模型的R软件包。
J Stat Softw. 2007 Apr;19(4):1-24. doi: 10.18637/jss.v019.i04.
8
Semiparametric Bayesian latent variable regression for skewed multivariate data.用于偏态多元数据的半参数贝叶斯潜变量回归
Biometrics. 2019 Jun;75(2):528-538. doi: 10.1111/biom.12989. Epub 2019 Mar 29.
9
A Gibbs Sampler for Learning DAGs.一种用于学习有向无环图的吉布斯采样器。
J Mach Learn Res. 2016 Apr;17(30):1-39.
10
Bayesian Modeling and Analysis of Geostatistical Data.贝叶斯地理统计数据建模与分析
Annu Rev Stat Appl. 2017 Mar;4:245-266. doi: 10.1146/annurev-statistics-060116-054155. Epub 2016 Nov 28.

引用本文的文献

1
Spatial meshing for general Bayesian multivariate models.一般贝叶斯多元模型的空间网格划分
J Mach Learn Res. 2024 Mar;25.
2
Radial Neighbors for Provably Accurate Scalable Approximations of Gaussian Processes.用于高斯过程可证精确可扩展近似的径向邻域
Biometrika. 2024 Dec;111(4):1151-1167. doi: 10.1093/biomet/asae029. Epub 2024 Jun 14.
3
Powering Research through Innovative Methods for Mixtures in Epidemiology (PRIME) Program: Novel and Expanded Statistical Methods.通过创新方法研究混合物在流行病学中的应用(PRIME)计划:新颖和扩展的统计方法。

本文引用的文献

1
Fast increased fidelity samplers for approximate Bayesian Gaussian process regression.用于近似贝叶斯高斯过程回归的快速提高保真度采样器。
J R Stat Soc Series B Stat Methodol. 2022 Sep;84(4):1198-1228. doi: 10.1111/rssb.12494. Epub 2022 Apr 19.
2
Highly Scalable Bayesian Geostatistical Modeling via Meshed Gaussian Processes on Partitioned Domains.通过分区域上的网格化高斯过程实现的高度可扩展贝叶斯地理统计建模
J Am Stat Assoc. 2022;117(538):969-982. doi: 10.1080/01621459.2020.1833889. Epub 2020 Nov 24.
3
Modeling Massive Spatial Datasets Using a Conjugate Bayesian Linear Modeling Framework.
Int J Environ Res Public Health. 2022 Jan 26;19(3):1378. doi: 10.3390/ijerph19031378.
使用共轭贝叶斯线性建模框架对大规模空间数据集进行建模。
Spat Stat. 2020 Jun;37. doi: 10.1016/j.spasta.2020.100417. Epub 2020 Feb 7.
4
Efficient algorithms for Bayesian Nearest Neighbor Gaussian Processes.用于贝叶斯最近邻高斯过程的高效算法。
J Comput Graph Stat. 2019;28(2):401-414. doi: 10.1080/10618600.2018.1537924. Epub 2019 Apr 1.
5
A Case Study Competition Among Methods for Analyzing Large Spatial Data.大型空间数据分析方法的案例研究竞赛
J Agric Biol Environ Stat. 2019;24(3):398-425. doi: 10.1007/s13253-018-00348-w. Epub 2018 Dec 14.
6
Permutation and Grouping Methods for Sharpening Gaussian Process Approximations.用于锐化高斯过程近似的排列与分组方法
Technometrics. 2018;60(4):415-429. doi: 10.1080/00401706.2018.1437476. Epub 2018 Jun 18.
7
Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geostatistical Datasets.用于大型地理统计数据集的分层最近邻高斯过程模型。
J Am Stat Assoc. 2016;111(514):800-812. doi: 10.1080/01621459.2015.1044091. Epub 2016 Aug 18.
8
NONSEPARABLE DYNAMIC NEAREST NEIGHBOR GAUSSIAN PROCESS MODELS FOR LARGE SPATIO-TEMPORAL DATA WITH AN APPLICATION TO PARTICULATE MATTER ANALYSIS.用于大时空数据的不可分离动态最近邻高斯过程模型及其在颗粒物分析中的应用
Ann Appl Stat. 2016 Sep;10(3):1286-1316. doi: 10.1214/16-AOAS931. Epub 2016 Sep 28.
9
High-Dimensional Bayesian Geostatistics.高维贝叶斯地质统计学
Bayesian Anal. 2017 Jun;12(2):583-614. doi: 10.1214/17-BA1056R. Epub 2017 May 16.
10
Fast Direct Methods for Gaussian Processes.快速直接高斯过程方法。
IEEE Trans Pattern Anal Mach Intell. 2016 Feb;38(2):252-65. doi: 10.1109/TPAMI.2015.2448083.