Suppr超能文献

用于混合数据的贝叶斯高斯Copula因子模型

Bayesian Gaussian Copula Factor Models for Mixed Data.

作者信息

Murray Jared S, Dunson David B, Carin Lawrence, Lucas Joseph E

机构信息

Dept. of Statistical Science, Duke University, Durham, NC 27708 (

出版信息

J Am Stat Assoc. 2013 Jun 1;108(502):656-665. doi: 10.1080/01621459.2012.762328.

Abstract

Gaussian factor models have proven widely useful for parsimoniously characterizing dependence in multivariate data. There is a rich literature on their extension to mixed categorical and continuous variables, using latent Gaussian variables or through generalized latent trait models acommodating measurements in the exponential family. However, when generalizing to non-Gaussian measured variables the latent variables typically influence both the dependence structure and the form of the marginal distributions, complicating interpretation and introducing artifacts. To address this problem we propose a novel class of Bayesian Gaussian copula factor models which decouple the latent factors from the marginal distributions. A semiparametric specification for the marginals based on the extended rank likelihood yields straightforward implementation and substantial computational gains. We provide new theoretical and empirical justifications for using this likelihood in Bayesian inference. We propose new default priors for the factor loadings and develop efficient parameter-expanded Gibbs sampling for posterior computation. The methods are evaluated through simulations and applied to a dataset in political science. The models in this paper are implemented in the R package bfa.

摘要

高斯因子模型已被证明在简洁地刻画多元数据中的相关性方面非常有用。关于将其扩展到混合分类变量和连续变量,有大量文献,这些文献使用潜在高斯变量或通过广义潜在特质模型来处理指数族中的测量值。然而,当推广到非高斯测量变量时,潜在变量通常会同时影响依赖结构和边际分布的形式,这使得解释变得复杂并引入了人为因素。为了解决这个问题,我们提出了一类新颖的贝叶斯高斯copula因子模型,该模型将潜在因子与边际分布解耦。基于扩展秩似然的边际半参数规范实现起来很直接,并且在计算上有很大的优势。我们为在贝叶斯推断中使用这种似然提供了新的理论和实证依据。我们为因子载荷提出了新的默认先验,并开发了用于后验计算的高效参数扩展吉布斯抽样。通过模拟对这些方法进行了评估,并将其应用于一个政治学数据集。本文中的模型在R包bfa中实现。

相似文献

1
Bayesian Gaussian Copula Factor Models for Mixed Data.用于混合数据的贝叶斯高斯Copula因子模型
J Am Stat Assoc. 2013 Jun 1;108(502):656-665. doi: 10.1080/01621459.2012.762328.
2
Information bounds for Gaussian copulas.高斯Copula的信息界。
Bernoulli (Andover). 2014;20(2):604-622. doi: 10.3150/12-BEJ499.
3
4
Simplex Factor Models for Multivariate Unordered Categorical Data.多元无序分类数据的单纯形因子模型
J Am Stat Assoc. 2012 Mar 1;107(497):362-377. doi: 10.1080/01621459.2011.646934.
5
Bayesian Variable Selection for Gaussian copula regression models.高斯Copula回归模型的贝叶斯变量选择
J Comput Graph Stat. 2020 Dec 10;30(3):578-593. doi: 10.1080/10618600.2020.1840997.
8
Fast computation of latent correlations.潜在相关性的快速计算。
J Comput Graph Stat. 2021;30(4):1249-1256. doi: 10.1080/10618600.2021.1882468. Epub 2021 Mar 29.

引用本文的文献

3
Bayesian Variable Selection for Gaussian copula regression models.高斯Copula回归模型的贝叶斯变量选择
J Comput Graph Stat. 2020 Dec 10;30(3):578-593. doi: 10.1080/10618600.2020.1840997.
4
Generalized infinite factorization models.广义无限分解模型。
Biometrika. 2022 Sep;109(3):817-835. doi: 10.1093/biomet/asab056. Epub 2022 Jan 19.
5
Fast Moment Estimation for Generalized Latent Dirichlet Models.广义潜在狄利克雷模型的快速矩估计
J Am Stat Assoc. 2018;113(524):1528-1540. doi: 10.1080/01621459.2017.1341839. Epub 2018 Nov 13.
6
Bayesian Factor Analysis for Inference on Interactions.用于交互作用推断的贝叶斯因子分析
J Am Stat Assoc. 2021;116(535):1521-1532. doi: 10.1080/01621459.2020.1745813. Epub 2020 Apr 20.
7
A statistical model for describing and simulating microbial community profiles.用于描述和模拟微生物群落分布的统计模型。
PLoS Comput Biol. 2021 Sep 13;17(9):e1008913. doi: 10.1371/journal.pcbi.1008913. eCollection 2021 Sep.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验