• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于异构数据的协变量辅助贝叶斯图学习

Covariate-Assisted Bayesian Graph Learning for Heterogeneous Data.

作者信息

Niu Yabo, Ni Yang, Pati Debdeep, Mallick Bani K

机构信息

Department of Mathematics, University of Houston.

Department of Statistics, Texas A&M University.

出版信息

J Am Stat Assoc. 2024;119(547):1985-1999. doi: 10.1080/01621459.2023.2233744. Epub 2023 Sep 6.

DOI:10.1080/01621459.2023.2233744
PMID:39507103
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11536292/
Abstract

In a traditional Gaussian graphical model, data homogeneity is routinely assumed with no extra variables affecting the conditional independence. In modern genomic datasets, there is an abundance of auxiliary information, which often gets under-utilized in determining the joint dependency structure. In this article, we consider a Bayesian approach to model undirected graphs underlying heterogeneous multivariate observations with additional assistance from covariates. Building on product partition models, we propose a novel covariate-dependent Gaussian graphical model that allows graphs to vary with covariates so that observations whose covariates are similar share a similar undirected graph. To efficiently embed Gaussian graphical models into our proposed framework, we explore both Gaussian likelihood and pseudo-likelihood functions. For Gaussian likelihood, a G-Wishart distribution is used as a natural conjugate prior, and for the pseudo-likelihood, a product of Gaussianconditionals is used. Moreover, the proposed model has large prior support and is flexible to approximate any -Hölder conditional variance-covariance matrices with . We further show that based on the theory of fractional likelihood, the rate of posterior contraction is minimax optimal assuming the true density to be a Gaussian mixture with a known number of components. The efficacy of the approach is demonstrated via simulation studies and an analysis of a protein network for a breast cancer dataset assisted by mRNA gene expression as covariates.

摘要

在传统的高斯图形模型中,通常假定数据具有同质性,不存在影响条件独立性的额外变量。在现代基因组数据集中,存在大量辅助信息,而这些信息在确定联合依赖结构时往往未得到充分利用。在本文中,我们考虑一种贝叶斯方法,在协变量的额外辅助下,对异构多变量观测值背后的无向图进行建模。基于乘积划分模型,我们提出了一种新颖的依赖协变量的高斯图形模型,该模型允许图随协变量变化,使得协变量相似的观测值共享一个相似的无向图。为了将高斯图形模型有效地嵌入到我们提出的框架中,我们探索了高斯似然函数和伪似然函数。对于高斯似然,使用G-Wishart分布作为自然共轭先验,对于伪似然,使用高斯条件的乘积。此外,所提出的模型具有较大的先验支持,并且能够灵活地近似任何具有的-Hölder条件方差-协方差矩阵。我们进一步表明,基于分数似然理论,假设真实密度为具有已知成分数量的高斯混合模型,则后验收缩率是极小极大最优的。通过模拟研究以及对以mRNA基因表达作为协变量辅助的乳腺癌数据集的蛋白质网络分析,证明了该方法的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/79bc71a643c6/nihms-1917957-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/7c3eed881bb0/nihms-1917957-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/d321e53f7e85/nihms-1917957-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/589f16399e96/nihms-1917957-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/6c08ff9dfba9/nihms-1917957-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/f90ea16fe1c2/nihms-1917957-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/9bd804f2ebec/nihms-1917957-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/059defd4389f/nihms-1917957-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/601d57f6c961/nihms-1917957-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/8bda3d53d5bd/nihms-1917957-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/4b27c5cc383f/nihms-1917957-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/cf34cfc40b89/nihms-1917957-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/79bc71a643c6/nihms-1917957-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/7c3eed881bb0/nihms-1917957-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/d321e53f7e85/nihms-1917957-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/589f16399e96/nihms-1917957-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/6c08ff9dfba9/nihms-1917957-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/f90ea16fe1c2/nihms-1917957-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/9bd804f2ebec/nihms-1917957-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/059defd4389f/nihms-1917957-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/601d57f6c961/nihms-1917957-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/8bda3d53d5bd/nihms-1917957-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/4b27c5cc383f/nihms-1917957-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/cf34cfc40b89/nihms-1917957-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a71/11536292/79bc71a643c6/nihms-1917957-f0012.jpg

相似文献

1
Covariate-Assisted Bayesian Graph Learning for Heterogeneous Data.用于异构数据的协变量辅助贝叶斯图学习
J Am Stat Assoc. 2024;119(547):1985-1999. doi: 10.1080/01621459.2023.2233744. Epub 2023 Sep 6.
2
Bayesian Covariate-Dependent Gaussian Graphical Models with Varying Structure.具有可变结构的贝叶斯协变量相关高斯图形模型
J Mach Learn Res. 2022;23(242).
3
Bayesian graph selection consistency under model misspecification.模型误设下的贝叶斯图选择一致性
Bernoulli (Andover). 2021 Feb;27(1):637-672. doi: 10.3150/20-BEJ1253. Epub 2020 Nov 20.
4
Robust Bayesian graphical regression models for assessing tumor heterogeneity in proteomic networks.用于评估蛋白质组学网络中肿瘤异质性的稳健贝叶斯图形回归模型。
Biometrics. 2025 Jan 7;81(1). doi: 10.1093/biomtc/ujae160.
5
Bayesian Graphical Regression.贝叶斯图形回归
J Am Stat Assoc. 2019;114(525):184-197. doi: 10.1080/01621459.2017.1389739. Epub 2018 Jun 28.
6
Bayesian Inference of Multiple Gaussian Graphical Models.多个高斯图形模型的贝叶斯推断
J Am Stat Assoc. 2015 Mar 1;110(509):159-174. doi: 10.1080/01621459.2014.896806.
7
Bayesian multiple Gaussian graphical models for multilevel variables from unknown classes.用于来自未知类别的多级变量的贝叶斯多重高斯图形模型。
Stat Methods Med Res. 2022 Apr;31(4):594-611. doi: 10.1177/09622802211022405. Epub 2022 Feb 15.
8
Graph Estimation From Multi-Attribute Data.基于多属性数据的图估计
J Mach Learn Res. 2014 May;15(May):1713-1750.
9
Graphical Gaussian Process Models for Highly Multivariate Spatial Data.用于高度多元空间数据的图形高斯过程模型
Biometrika. 2022 Dec;109(4):993-1014. doi: 10.1093/biomet/asab061. Epub 2021 Dec 4.
10
Regression-Based Bayesian Estimation and Structure Learning for Nonparanormal Graphical Models.基于回归的非正态图形模型的贝叶斯估计与结构学习
Stat Anal Data Min. 2022 Oct;15(5):611-629. doi: 10.1002/sam.11576. Epub 2022 Feb 28.

本文引用的文献

1
Bayesian Graphical Regression.贝叶斯图形回归
J Am Stat Assoc. 2019;114(525):184-197. doi: 10.1080/01621459.2017.1389739. Epub 2018 Jun 28.
2
A generalized likelihood-based Bayesian approach for scalable joint regression and covariance selection in high dimensions.一种基于广义似然的贝叶斯方法,用于高维数据中可扩展的联合回归和协方差选择。
Stat Comput. 2022 Jun;32(3). doi: 10.1007/s11222-022-10102-5. Epub 2022 Jun 3.
3
Hierarchical Normalized Completely Random Measures for Robust Graphical Modeling.用于稳健图形建模的分层归一化完全随机测度
Bayesian Anal. 2019 Dec;14(4):1271-1301. doi: 10.1214/19-ba1153. Epub 2019 Mar 28.
4
TCGA-assembler 2: software pipeline for retrieval and processing of TCGA/CPTAC data.TCGA汇编器2:用于检索和处理TCGA/CPTAC数据的软件管道
Bioinformatics. 2018 May 1;34(9):1615-1617. doi: 10.1093/bioinformatics/btx812.
5
Covariate-Adjusted Precision Matrix Estimation with an Application in Genetical Genomics.协变量调整的精度矩阵估计及其在遗传基因组学中的应用
Biometrika. 2013 Mar;100(1):139-156. doi: 10.1093/biomet/ass058. Epub 2012 Nov 30.
6
On joint estimation of Gaussian graphical models for spatial and temporal data.关于空间和时间数据的高斯图形模型的联合估计
Biometrics. 2017 Sep;73(3):769-779. doi: 10.1111/biom.12650. Epub 2017 Jan 18.
7
Sparse covariance estimation in heterogeneous samples.异质样本中的稀疏协方差估计
Electron J Stat. 2011;5:981-1014. doi: 10.1214/11-EJS634. Epub 2011 Sep 15.
8
Bayesian Inference for General Gaussian Graphical Models With Application to Multivariate Lattice Data.具有多元格点数据应用的一般高斯图形模型的贝叶斯推断。
J Am Stat Assoc. 2011;106(496):1418-1433. doi: 10.1198/jasa.2011.tm10465. Epub 2012 Dec 24.
9
Modeling Protein Expression and Protein Signaling Pathways.蛋白质表达与蛋白质信号通路建模
J Am Stat Assoc. 2011;107(500):1372-1384. doi: 10.1080/01621459.2012.706121.
10
DINGO: differential network analysis in genomics.DINGO:基因组学中的差异网络分析
Bioinformatics. 2015 Nov 1;31(21):3413-20. doi: 10.1093/bioinformatics/btv406. Epub 2015 Jul 6.