通过惩罚高斯最大似然法进行同时多重响应回归和逆协方差矩阵估计

Simultaneous Multiple Response Regression and Inverse Covariance Matrix Estimation via Penalized Gaussian Maximum Likelihood.

作者信息

Lee Wonyul, Liu Yufeng

机构信息

Department of Statistics and Operations Research, Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.

出版信息

J Multivar Anal. 2012 Oct 1;111:241-255. doi: 10.1016/j.jmva.2012.03.013. Epub 2012 Apr 27.

DOI:10.1016/j.jmva.2012.03.013

PMID:22791925

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3392174/

Abstract

Multivariate regression is a common statistical tool for practical problems. Many multivariate regression techniques are designed for univariate response cases. For problems with multiple response variables available, one common approach is to apply the univariate response regression technique separately on each response variable. Although it is simple and popular, the univariate response approach ignores the joint information among response variables. In this paper, we propose three new methods for utilizing joint information among response variables. All methods are in a penalized likelihood framework with weighted L(1) regularization. The proposed methods provide sparse estimators of conditional inverse co-variance matrix of response vector given explanatory variables as well as sparse estimators of regression parameters. Our first approach is to estimate the regression coefficients with plug-in estimated inverse covariance matrices, and our second approach is to estimate the inverse covariance matrix with plug-in estimated regression parameters. Our third approach is to estimate both simultaneously. Asymptotic properties of these methods are explored. Our numerical examples demonstrate that the proposed methods perform competitively in terms of prediction, variable selection, as well as inverse covariance matrix estimation.

摘要

多元回归是解决实际问题常用的统计工具。许多多元回归技术是针对单变量响应情形设计的。对于有多个响应变量的问题，一种常见方法是对每个响应变量分别应用单变量响应回归技术。尽管这种方法简单且常用，但单变量响应方法忽略了响应变量之间的联合信息。在本文中，我们提出了三种利用响应变量之间联合信息的新方法。所有方法都在惩罚似然框架下，采用加权(L(1))正则化。所提出的方法提供了给定解释变量时响应向量的条件逆协方差矩阵的稀疏估计以及回归参数的稀疏估计。我们的第一种方法是用代入估计的逆协方差矩阵来估计回归系数，第二种方法是用代入估计的回归参数来估计逆协方差矩阵。我们的第三种方法是同时进行估计。探索了这些方法的渐近性质。我们的数值例子表明，所提出的方法在预测、变量选择以及逆协方差矩阵估计方面具有竞争力。

相似文献

Simultaneous Multiple Response Regression and Inverse Covariance Matrix Estimation via Penalized Gaussian Maximum Likelihood.通过惩罚高斯最大似然法进行同时多重响应回归和逆协方差矩阵估计

J Multivar Anal. 2012 Oct 1;111:241-255. doi: 10.1016/j.jmva.2012.03.013. Epub 2012 Apr 27.

Multiple Response Regression for Gaussian Mixture Models with Known Labels.具有已知标签的高斯混合模型的多响应回归

Stat Anal Data Min. 2012 Dec 1;5(6). doi: 10.1002/sam.11158.

Sparse estimation of a covariance matrix.协方差矩阵的稀疏估计。

Biometrika. 2011 Dec;98(4):807-820. doi: 10.1093/biomet/asr054.

Fast Component Pursuit for Large-Scale Inverse Covariance Estimation.用于大规模逆协方差估计的快速成分追踪

KDD. 2016 Aug;2016:1585-1594. doi: 10.1145/2939672.2939851.

Joint Estimation of Multiple Precision Matrices with Common Structures.具有共同结构的多个精度矩阵的联合估计

J Mach Learn Res. 2015;16:1035-1062.

Multi-response Regression for Block-missing Multi-modal Data without Imputation.无插补的块缺失多模态数据的多响应回归

Stat Sin. 2024 Apr;34(2):527-546. doi: 10.5705/ss.202021.0170.

Estimation of Large-Dimensional Covariance Matrices via Second-Order Stein-Type Regularization.通过二阶斯坦因型正则化估计大维度协方差矩阵

Entropy (Basel). 2022 Dec 27;25(1):53. doi: 10.3390/e25010053.

Penalized Estimating Functions and Variable Selection in Semiparametric Regression Models.半参数回归模型中的惩罚估计函数与变量选择

J Am Stat Assoc. 2008 Jun 1;103(482):672-680. doi: 10.1198/016214508000000184.

Sparse Multivariate Regression With Covariance Estimation.带协方差估计的稀疏多元回归

J Comput Graph Stat. 2010 Fall;19(4):947-962. doi: 10.1198/jcgs.2010.09188.

Shrinkage estimators for covariance matrices.协方差矩阵的收缩估计量。

Biometrics. 2001 Dec;57(4):1173-84. doi: 10.1111/j.0006-341x.2001.01173.x.

引用本文的文献

Connectivity Regression.连通性回归

Biostatistics. 2024 Dec 31;26(1). doi: 10.1093/biostatistics/kxaf002.

Covariate-Assisted Bayesian Graph Learning for Heterogeneous Data.用于异构数据的协变量辅助贝叶斯图学习

J Am Stat Assoc. 2024;119(547):1985-1999. doi: 10.1080/01621459.2023.2233744. Epub 2023 Sep 6.

On the Use of Minimum Penalties in Statistical Learning.关于统计学习中最小惩罚的使用

J Comput Graph Stat. 2024;33(1):138-151. doi: 10.1080/10618600.2023.2210174. Epub 2023 Jun 20.

Multi-response Regression for Block-missing Multi-modal Data without Imputation.无插补的块缺失多模态数据的多响应回归

Stat Sin. 2024 Apr;34(2):527-546. doi: 10.5705/ss.202021.0170.

A generalized likelihood-based Bayesian approach for scalable joint regression and covariance selection in high dimensions.一种基于广义似然的贝叶斯方法，用于高维数据中可扩展的联合回归和协方差选择。

Stat Comput. 2022 Jun;32(3). doi: 10.1007/s11222-022-10102-5. Epub 2022 Jun 3.

Spatiotemporal variable selection and air quality impact assessment of COVID-19 lockdown.新冠疫情封锁措施的时空变量选择与空气质量影响评估

Spat Stat. 2022 Jun;49:100549. doi: 10.1016/j.spasta.2021.100549. Epub 2021 Oct 29.

A COVARIANCE-ENHANCED APPROACH TO MULTI-TISSUE JOINT EQTL MAPPING WITH APPLICATION TO TRANSCRIPTOME-WIDE ASSOCIATION STUDIES.一种用于多组织联合表达数量性状位点定位的协方差增强方法及其在全转录组关联研究中的应用

Ann Appl Stat. 2021 Jun;15(2):998-1016. doi: 10.1214/20-aoas1432. Epub 2021 Jul 12.

Bayesian Structure Learning in Multi-layered Genomic Networks.多层基因组网络中的贝叶斯结构学习

J Am Stat Assoc. 2021;116(534):605-618. doi: 10.1080/01621459.2020.1775611. Epub 2020 Jul 24.

Sparse Single Index Models for Multivariate Responses.用于多变量响应的稀疏单指标模型

J Comput Graph Stat. 2021;30(1):115-124. doi: 10.1080/10618600.2020.1779080. Epub 2020 Jul 28.

L2,1-norm regularized multivariate regression model with applications to genomic prediction.基于 L2,1-范数正则化的多元回归模型及其在基因组预测中的应用。

Bioinformatics. 2021 Sep 29;37(18):2896-2904. doi: 10.1093/bioinformatics/btab212.

本文引用的文献

Sparse Multivariate Regression With Covariance Estimation.带协方差估计的稀疏多元回归

J Comput Graph Stat. 2010 Fall;19(4):947-962. doi: 10.1198/jcgs.2010.09188.

Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1.整合基因组分析确定了具有 PDGFRA、IDH1、EGFR 和 NF1 异常的胶质母细胞瘤的临床相关亚型。

Cancer Cell. 2010 Jan 19;17(1):98-110. doi: 10.1016/j.ccr.2009.12.020.

Partial Correlation Estimation by Joint Sparse Regression Models.基于联合稀疏回归模型的偏相关估计

J Am Stat Assoc. 2009 Jun 1;104(486):735-746. doi: 10.1198/jasa.2009.0126.

Comprehensive genomic characterization defines human glioblastoma genes and core pathways.全面的基因组特征分析确定了人类胶质母细胞瘤的基因和核心通路。

Nature. 2008 Oct 23;455(7216):1061-8. doi: 10.1038/nature07385. Epub 2008 Sep 4.

Sparse inverse covariance estimation with the graphical lasso.使用图模型选择法进行稀疏逆协方差估计。

Biostatistics. 2008 Jul;9(3):432-41. doi: 10.1093/biostatistics/kxm045. Epub 2007 Dec 12.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验