Suppr超能文献

具有已知标签的高斯混合模型的多响应回归

Multiple Response Regression for Gaussian Mixture Models with Known Labels.

作者信息

Lee Wonyul, Du Ying, Sun Wei, Hayes D Neil, Liu Yufeng

机构信息

University of North Carolina at Chapel Hill.

出版信息

Stat Anal Data Min. 2012 Dec 1;5(6). doi: 10.1002/sam.11158.

Abstract

Multiple response regression is a useful regression technique to model multiple response variables using the same set of predictor variables. Most existing methods for multiple response regression are designed for modeling homogeneous data. In many applications, however, one may have heterogeneous data where the samples are divided into multiple groups. Our motivating example is a cancer dataset where the samples belong to multiple cancer subtypes. In this paper, we consider modeling the data coming from a mixture of several Gaussian distributions with known group labels. A naive approach is to split the data into several groups according to the labels and model each group separately. Although it is simple, this approach ignores potential common structures across different groups. We propose new penalized methods to model all groups jointly in which the common and unique structures can be identified. The proposed methods estimate the regression coefficient matrix, as well as the conditional inverse covariance matrix of response variables. Asymptotic properties of the proposed methods are explored. Through numerical examples, we demonstrate that both estimation and prediction can be improved by modeling all groups jointly using the proposed methods. An application to a glioblastoma cancer dataset reveals some interesting common and unique gene relationships across different cancer subtypes.

摘要

多响应回归是一种有用的回归技术,用于使用同一组预测变量对多个响应变量进行建模。大多数现有的多响应回归方法是为对同质数据进行建模而设计的。然而,在许多应用中,可能会遇到异质数据,其中样本被分为多个组。我们的激励示例是一个癌症数据集,其中样本属于多个癌症亚型。在本文中,我们考虑对来自具有已知组标签的多个高斯分布混合的数据进行建模。一种简单的方法是根据标签将数据分成几个组,并分别对每个组进行建模。虽然这种方法很简单,但它忽略了不同组之间潜在的共同结构。我们提出了新的惩罚方法来对所有组进行联合建模,从而可以识别共同结构和独特结构。所提出的方法估计回归系数矩阵以及响应变量的条件逆协方差矩阵。我们探索了所提出方法的渐近性质。通过数值示例,我们证明了使用所提出的方法对所有组进行联合建模可以提高估计和预测效果。对胶质母细胞瘤癌症数据集的应用揭示了不同癌症亚型之间一些有趣的共同和独特的基因关系。

相似文献

4
Estimation of multiple networks in Gaussian mixture models.高斯混合模型中多个网络的估计
Electron J Stat. 2016;10:1133-1154. doi: 10.1214/16-EJS1135. Epub 2016 May 2.
5
Joint estimation of multiple graphical models.多个图形模型的联合估计
Biometrika. 2011 Mar;98(1):1-15. doi: 10.1093/biomet/asq060. Epub 2011 Feb 9.
6
The graphical lasso: New insights and alternatives.图形套索:新见解与替代方法。
Electron J Stat. 2012 Nov 9;6:2125-2149. doi: 10.1214/12-EJS740.
7
Joint Estimation of Multiple Conditional Gaussian Graphical Models.多个条件高斯图形模型的联合估计
IEEE Trans Neural Netw Learn Syst. 2018 Jul;29(7):3034-3046. doi: 10.1109/TNNLS.2017.2710090. Epub 2017 Jun 28.

本文引用的文献

2
Sparse Multivariate Regression With Covariance Estimation.带协方差估计的稀疏多元回归
J Comput Graph Stat. 2010 Fall;19(4):947-962. doi: 10.1198/jcgs.2010.09188.
3
Joint estimation of multiple graphical models.多个图形模型的联合估计
Biometrika. 2011 Mar;98(1):1-15. doi: 10.1093/biomet/asq060. Epub 2011 Feb 9.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验