Suppr超能文献

高维多元二元响应的贝叶斯推断。

Bayesian inference on high-dimensional multivariate binary responses.

作者信息

Chakraborty Antik, Ou Rihui, Dunson David B

机构信息

Department of Statistics, Purdue University.

Department of Statistical Science, Duke University.

出版信息

J Am Stat Assoc. 2024;119(548):2560-2571. doi: 10.1080/01621459.2023.2260053. Epub 2023 Nov 9.

Abstract

It has become increasingly common to collect high-dimensional binary response data; for example, with the emergence of new sampling techniques in ecology. In smaller dimensions, multivariate probit (MVP) models are routinely used for inferences. However, algorithms for fitting such models face issues in scaling up to high dimensions due to the intractability of the likelihood, involving an integral over a multivariate normal distribution having no analytic form. Although a variety of algorithms have been proposed to approximate this intractable integral, these approaches are difficult to implement and/or inaccurate in high dimensions. Our main focus is in accommodating high-dimensional binary response data with a small-to-moderate number of covariates. We propose a two-stage approach for inference on model parameters while taking care of uncertainty propagation between the stages. We use the special structure of latent Gaussian models to reduce the highly expensive computation involved in joint parameter estimation to focus inference on marginal distributions of model parameters. This essentially makes the method embarrassingly parallel for both stages. We illustrate performance in simulations and applications to joint species distribution modeling in ecology.

摘要

收集高维二元响应数据变得越来越普遍;例如,随着生态学中新采样技术的出现。在较小维度中,多变量概率单位(MVP)模型通常用于进行推断。然而,由于似然性难以处理,涉及对没有解析形式的多元正态分布进行积分,用于拟合此类模型的算法在扩展到高维度时面临问题。尽管已经提出了各种算法来近似这个难以处理的积分,但这些方法在高维度中难以实现和/或不准确。我们的主要重点是处理具有少量到中等数量协变量的高维二元响应数据。我们提出了一种两阶段方法来推断模型参数,同时处理阶段之间的不确定性传播。我们利用潜在高斯模型的特殊结构来减少联合参数估计中涉及的高成本计算,将推断重点放在模型参数的边际分布上。这本质上使该方法在两个阶段都易于并行处理。我们在模拟和生态学中联合物种分布建模的应用中展示了性能。

相似文献

1
Bayesian inference on high-dimensional multivariate binary responses.高维多元二元响应的贝叶斯推断。
J Am Stat Assoc. 2024;119(548):2560-2571. doi: 10.1080/01621459.2023.2260053. Epub 2023 Nov 9.

本文引用的文献

1
SiGMoiD: A super-statistical generative model for binary data.SiGMoiD:一种用于二值数据的超统计生成模型。
PLoS Comput Biol. 2021 Aug 6;17(8):e1009275. doi: 10.1371/journal.pcbi.1009275. eCollection 2021 Aug.
4
So Many Variables: Joint Modeling in Community Ecology.如此多的变量:群落生态学中的联合建模。
Trends Ecol Evol. 2015 Dec;30(12):766-779. doi: 10.1016/j.tree.2015.09.007. Epub 2015 Oct 28.
5
SPARSE LOGISTIC PRINCIPAL COMPONENTS ANALYSIS FOR BINARY DATA.二元数据的稀疏逻辑主成分分析
Ann Appl Stat. 2010 Sep 1;4(3):1579-1601. doi: 10.1214/10-AOAS327SUPP.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验