• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

协变量调整的精度矩阵估计及其在遗传基因组学中的应用

Covariate-Adjusted Precision Matrix Estimation with an Application in Genetical Genomics.

作者信息

Cai T Tony, Li Hongzhe, Liu Weidong, Xie Jichun

机构信息

Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA.

Department of Biostatistics and Epidemiology, University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania 19104, USA.

出版信息

Biometrika. 2013 Mar;100(1):139-156. doi: 10.1093/biomet/ass058. Epub 2012 Nov 30.

DOI:10.1093/biomet/ass058
PMID:28316337
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5351557/
Abstract

Motivated by analysis of genetical genomics data, we introduce a sparse high dimensional multivariate regression model for studying conditional independence relationships among a set of genes adjusting for possible genetic effects. The precision matrix in the model specifies a covariate-adjusted Gaussian graph, which presents the conditional dependence structure of gene expression after the confounding genetic effects on gene expression are taken into account. We present a covariate-adjusted precision matrix estimation method using a constrained ℓ minimization, which can be easily implemented by linear programming. Asymptotic convergence rates in various matrix norms and sign consistency are established for the estimators of the regression coefficients and the precision matrix, allowing both the number of genes and the number of the genetic variants to diverge. Simulation shows that the proposed method results in significant improvements in both precision matrix estimation and graphical structure selection when compared to the standard Gaussian graphical model assuming constant means. The proposed method is also applied to analyze a yeast genetical genomics data for the identification of the gene network among a set of genes in the mitogen-activated protein kinase pathway.

摘要

受遗传基因组学数据分析的启发,我们引入了一种稀疏高维多元回归模型,用于研究一组基因之间的条件独立关系,并对可能的遗传效应进行调整。模型中的精度矩阵指定了一个协变量调整后的高斯图,它呈现了在考虑基因表达上的混杂遗传效应后基因表达的条件依赖结构。我们提出了一种使用约束ℓ最小化的协变量调整精度矩阵估计方法,该方法可以通过线性规划轻松实现。为回归系数和精度矩阵的估计量建立了各种矩阵范数下的渐近收敛速度和符号一致性,允许基因数量和遗传变异数量都发散。模拟表明,与假设均值恒定的标准高斯图形模型相比,所提出的方法在精度矩阵估计和图形结构选择方面都有显著改进。所提出的方法还应用于分析酵母遗传基因组学数据,以识别丝裂原活化蛋白激酶途径中一组基因之间的基因网络。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7418/5351557/923f2a8ad2fa/nihms852665f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7418/5351557/f3b5ed99438f/nihms852665f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7418/5351557/ed72ce9cadac/nihms852665f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7418/5351557/923f2a8ad2fa/nihms852665f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7418/5351557/f3b5ed99438f/nihms852665f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7418/5351557/ed72ce9cadac/nihms852665f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7418/5351557/923f2a8ad2fa/nihms852665f3.jpg

相似文献

1
Covariate-Adjusted Precision Matrix Estimation with an Application in Genetical Genomics.协变量调整的精度矩阵估计及其在遗传基因组学中的应用
Biometrika. 2013 Mar;100(1):139-156. doi: 10.1093/biomet/ass058. Epub 2012 Nov 30.
2
A SPARSE CONDITIONAL GAUSSIAN GRAPHICAL MODEL FOR ANALYSIS OF GENETICAL GENOMICS DATA.一种用于遗传基因组数据分析的稀疏条件高斯图形模型。
Ann Appl Stat. 2011 Dec;5(4):2630-2650. doi: 10.1214/11-AOAS494.
3
Adjusting for High-dimensional Covariates in Sparse Precision Matrix Estimation by ℓ-Penalization.通过ℓ惩罚在稀疏精度矩阵估计中对高维协变量进行调整。
J Multivar Anal. 2013 Apr 1;116:365-381. doi: 10.1016/j.jmva.2013.01.005.
4
Learning directed acyclic graphical structures with genetical genomics data.利用遗传基因组学数据学习有向无环图结构
Bioinformatics. 2015 Dec 15;31(24):3953-60. doi: 10.1093/bioinformatics/btv513. Epub 2015 Sep 2.
5
Sparse Estimation of Conditional Graphical Models With Application to Gene Networks.条件图形模型的稀疏估计及其在基因网络中的应用
J Am Stat Assoc. 2012 Jan 1;107(497):152-167. doi: 10.1080/01621459.2011.644498.
6
Regularization Methods for High-Dimensional Instrumental Variables Regression With an Application to Genetical Genomics.高维工具变量回归的正则化方法及其在遗传基因组学中的应用
J Am Stat Assoc. 2015;110(509):270-288. doi: 10.1080/01621459.2014.908125.
7
High-Dimensional Gaussian Graphical Regression Models with Covariates.具有协变量的高维高斯图形回归模型
J Am Stat Assoc. 2023;118(543):2088-2100. doi: 10.1080/01621459.2022.2034632. Epub 2022 Mar 14.
8
Joint Learning of Multiple Sparse Matrix Gaussian Graphical Models.联合学习多个稀疏矩阵高斯图模型。
IEEE Trans Neural Netw Learn Syst. 2015 Nov;26(11):2606-20. doi: 10.1109/TNNLS.2014.2384201. Epub 2015 Mar 4.
9
Joint Estimation of Multiple High-dimensional Precision Matrices.多个高维精度矩阵的联合估计
Stat Sin. 2016 Apr;26(2):445-464. doi: 10.5705/ss.2014.256.
10
Model Selection and Estimation in the Matrix Normal Graphical Model.矩阵正态图形模型中的模型选择与估计
J Multivar Anal. 2012 May 1;107:119-140. doi: 10.1016/j.jmva.2012.01.005.

引用本文的文献

1
Multi-task Learning for Gaussian Graphical Regressions with High Dimensional Covariates.用于高维协变量高斯图形回归的多任务学习
J Comput Graph Stat. 2024 Dec 20. doi: 10.1080/10618600.2024.2421246.
2
Connectivity Regression.连通性回归
Biostatistics. 2024 Dec 31;26(1). doi: 10.1093/biostatistics/kxaf002.
3
Covariate-Assisted Bayesian Graph Learning for Heterogeneous Data.用于异构数据的协变量辅助贝叶斯图学习

本文引用的文献

1
Sparse Multivariate Regression With Covariance Estimation.带协方差估计的稀疏多元回归
J Comput Graph Stat. 2010 Fall;19(4):947-962. doi: 10.1198/jcgs.2010.09188.
2
Sparse Estimation of Conditional Graphical Models With Application to Gene Networks.条件图形模型的稀疏估计及其在基因网络中的应用
J Am Stat Assoc. 2012 Jan 1;107(497):152-167. doi: 10.1080/01621459.2011.644498.
3
Regularized Multivariate Regression for Identifying Master Predictors with Application to Integrative Genomics Study of Breast Cancer.用于识别主要预测因子的正则化多元回归及其在乳腺癌综合基因组学研究中的应用
J Am Stat Assoc. 2024;119(547):1985-1999. doi: 10.1080/01621459.2023.2233744. Epub 2023 Sep 6.
4
Multi-response Regression for Block-missing Multi-modal Data without Imputation.无插补的块缺失多模态数据的多响应回归
Stat Sin. 2024 Apr;34(2):527-546. doi: 10.5705/ss.202021.0170.
5
A generalized likelihood-based Bayesian approach for scalable joint regression and covariance selection in high dimensions.一种基于广义似然的贝叶斯方法,用于高维数据中可扩展的联合回归和协方差选择。
Stat Comput. 2022 Jun;32(3). doi: 10.1007/s11222-022-10102-5. Epub 2022 Jun 3.
6
Bayesian Edge Regression in Undirected Graphical Models to Characterize Interpatient Heterogeneity in Cancer.无向图模型中的贝叶斯边缘回归用于刻画癌症患者间的异质性
J Am Stat Assoc. 2022;117(538):533-546. doi: 10.1080/01621459.2021.2000866. Epub 2022 Jan 5.
7
A Bayesian two-step integrative procedure incorporating prior knowledge for the identification of miRNA-mRNAs involved in hepatocellular carcinoma.贝叶斯两步整合程序,纳入先验知识,用于鉴定肝癌中涉及的 miRNA-mRNAs。
Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:81-86. doi: 10.1109/EMBC48229.2022.9871330.
8
LARGE-SCALE MULTIPLE INFERENCE OF COLLECTIVE DEPENDENCE WITH APPLICATIONS TO PROTEIN FUNCTION.集体依赖性的大规模多重推断及其在蛋白质功能中的应用
Ann Appl Stat. 2021 Jun;15(2):902-924. doi: 10.1214/20-aoas1431. Epub 2021 Jul 12.
9
Conditional Gaussian graphical model for estimating personalized disease symptom networks.用于估计个性化疾病症状网络的条件高斯图形模型。
Stat Med. 2022 Feb 10;41(3):543-553. doi: 10.1002/sim.9274. Epub 2021 Dec 5.
10
Fused Lasso Regression for Identifying Differential Correlations in Brain Connectome Graphs.用于识别脑连接组图中差异相关性的融合套索回归
Stat Anal Data Min. 2018 Oct;11(5):203-226. doi: 10.1002/sam.11382. Epub 2018 Jul 11.
Ann Appl Stat. 2010 Mar;4(1):53-77. doi: 10.1214/09-AOAS271SUPP.
4
A SPARSE CONDITIONAL GAUSSIAN GRAPHICAL MODEL FOR ANALYSIS OF GENETICAL GENOMICS DATA.一种用于遗传基因组数据分析的稀疏条件高斯图形模型。
Ann Appl Stat. 2011 Dec;5(4):2630-2650. doi: 10.1214/11-AOAS494.
5
Partial Correlation Estimation by Joint Sparse Regression Models.基于联合稀疏回归模型的偏相关估计
J Am Stat Assoc. 2009 Jun 1;104(486):735-746. doi: 10.1198/jasa.2009.0126.
6
KEGG for representation and analysis of molecular networks involving diseases and drugs.KEGG 用于表示和分析涉及疾病和药物的分子网络。
Nucleic Acids Res. 2010 Jan;38(Database issue):D355-60. doi: 10.1093/nar/gkp896. Epub 2009 Oct 30.
7
Sparse inverse covariance estimation with the graphical lasso.使用图模型选择法进行稀疏逆协方差估计。
Biostatistics. 2008 Jul;9(3):432-41. doi: 10.1093/biostatistics/kxm045. Epub 2007 Dec 12.
8
Gradient directed regularization for sparse Gaussian concentration graphs, with applications to inference of genetic networks.用于稀疏高斯浓度图的梯度导向正则化及其在遗传网络推断中的应用
Biostatistics. 2006 Apr;7(2):302-17. doi: 10.1093/biostatistics/kxj008. Epub 2005 Dec 2.
9
From signatures to models: understanding cancer using microarrays.从特征到模型:利用微阵列技术理解癌症
Nat Genet. 2005 Jun;37 Suppl:S38-45. doi: 10.1038/ng1561.
10
The landscape of genetic complexity across 5,700 gene expression traits in yeast.酵母中5700个基因表达性状的遗传复杂性全景。
Proc Natl Acad Sci U S A. 2005 Feb 1;102(5):1572-7. doi: 10.1073/pnas.0408709102. Epub 2005 Jan 19.