Suppr超能文献

基于网络的惩罚回归及其在基因组数据中的应用。

Network-based penalized regression with application to genomic data.

作者信息

Kim Sunkyung, Pan Wei, Shen Xiaotong

机构信息

Division of Biostatistics, University of Minnesota, Minneapolis, Minnesota 55405, U.S.A.

出版信息

Biometrics. 2013 Sep;69(3):582-93. doi: 10.1111/biom.12035. Epub 2013 Jul 3.

Abstract

Penalized regression approaches are attractive in dealing with high-dimensional data such as arising in high-throughput genomic studies. New methods have been introduced to utilize the network structure of predictors, for example, gene networks, to improve parameter estimation and variable selection. All the existing network-based penalized methods are based on an assumption that parameters, for example, regression coefficients, of neighboring nodes in a network are close in magnitude, which however may not hold. Here we propose a novel penalized regression method based on a weaker prior assumption that the parameters of neighboring nodes in a network are likely to be zero (or non-zero) at the same time, regardless of their specific magnitudes. We propose a novel non-convex penalty function to incorporate this prior, and an algorithm based on difference convex programming. We use simulated data and two breast cancer gene expression datasets to demonstrate the advantages of the proposed methods over some existing methods. Our proposed methods can be applied to more general problems for group variable selection.

摘要

惩罚回归方法在处理高维数据(如高通量基因组研究中出现的数据)方面具有吸引力。已经引入了新的方法来利用预测变量的网络结构,例如基因网络,以改进参数估计和变量选择。所有现有的基于网络的惩罚方法都基于这样一个假设,即网络中相邻节点的参数(例如回归系数)在大小上相近,但这一假设可能并不成立。在此,我们基于一个较弱的先验假设提出了一种新颖的惩罚回归方法,即网络中相邻节点的参数可能同时为零(或非零),而不管其具体大小如何。我们提出了一种新颖的非凸惩罚函数来纳入这一先验,并提出了一种基于差分凸规划的算法。我们使用模拟数据和两个乳腺癌基因表达数据集来证明所提出的方法相对于一些现有方法的优势。我们提出的方法可应用于更一般的组变量选择问题。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec13/4007772/98f7669cc73a/nihms450179f1.jpg

相似文献

7
Regression-Based Network Estimation for High-Dimensional Genetic Data.基于回归的高维遗传数据网络估计
J Comput Biol. 2019 Apr;26(4):336-349. doi: 10.1089/cmb.2018.0225. Epub 2019 Jan 17.

引用本文的文献

4
Prediction models with graph kernel regularization for network data.用于网络数据的带有图核正则化的预测模型。
J Appl Stat. 2022 Jan 31;50(6):1400-1417. doi: 10.1080/02664763.2022.2028745. eCollection 2023.

本文引用的文献

4
Likelihood-based selection and sharp parameter estimation.基于似然性的选择与精确参数估计。
J Am Stat Assoc. 2012 Jan 1;107(497):223-232. doi: 10.1080/01621459.2011.645783. Epub 2012 Jun 11.
6
Grouping pursuit through a regularization solution surface.通过正则化解曲面进行分组追踪。
J Am Stat Assoc. 2010 Jun 1;105(490):727-739. doi: 10.1198/jasa.2010.tm09380.
9
Network-based multiple locus linkage analysis of expression traits.基于网络的表达性状多位点连锁分析。
Bioinformatics. 2009 Jun 1;25(11):1390-6. doi: 10.1093/bioinformatics/btp177. Epub 2009 Mar 31.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验