• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

将预测网络纳入惩罚回归并应用于微阵列数据。

Incorporating predictor network in penalized regression with application to microarray data.

作者信息

Pan Wei, Xie Benhuai, Shen Xiaotong

机构信息

Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota 55455, USA.

出版信息

Biometrics. 2010 Jun;66(2):474-84. doi: 10.1111/j.1541-0420.2009.01296.x. Epub 2009 Jul 23.

DOI:10.1111/j.1541-0420.2009.01296.x
PMID:19645699
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3338337/
Abstract

We consider penalized linear regression, especially for "large p, small n" problems, for which the relationships among predictors are described a priori by a network. A class of motivating examples includes modeling a phenotype through gene expression profiles while accounting for coordinated functioning of genes in the form of biological pathways or networks. To incorporate the prior knowledge of the similar effect sizes of neighboring predictors in a network, we propose a grouped penalty based on the L(gamma)-norm that smoothes the regression coefficients of the predictors over the network. The main feature of the proposed method is its ability to automatically realize grouped variable selection and exploit grouping effects. We also discuss effects of the choices of the gamma and some weights inside the L(gamma)-norm. Simulation studies demonstrate the superior finite-sample performance of the proposed method as compared to Lasso, elastic net, and a recently proposed network-based method. The new method performs best in variable selection across all simulation set-ups considered. For illustration, the method is applied to a microarray dataset to predict survival times for some glioblastoma patients using a gene expression dataset and a gene network compiled from some Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways.

摘要

我们考虑惩罚线性回归,特别是针对“高维小样本”问题,其中预测变量之间的关系由一个网络先验描述。一类具有启发性的例子包括通过基因表达谱对表型进行建模,同时考虑以生物途径或网络形式存在的基因协同功能。为了纳入网络中相邻预测变量具有相似效应大小的先验知识,我们提出了一种基于L(γ)范数的分组惩罚,它能在网络上平滑预测变量的回归系数。所提方法的主要特点是能够自动实现分组变量选择并利用分组效应。我们还讨论了γ的选择以及L(γ)范数内一些权重的影响。模拟研究表明,与套索回归、弹性网络以及最近提出的基于网络的方法相比,所提方法具有更优的有限样本性能。在所考虑的所有模拟设置中,新方法在变量选择方面表现最佳。为作说明,该方法应用于一个微阵列数据集,使用一个基因表达数据集和一个从《京都基因与基因组百科全书》(KEGG)途径编译的基因网络来预测一些胶质母细胞瘤患者的生存时间。

相似文献

1
Incorporating predictor network in penalized regression with application to microarray data.将预测网络纳入惩罚回归并应用于微阵列数据。
Biometrics. 2010 Jun;66(2):474-84. doi: 10.1111/j.1541-0420.2009.01296.x. Epub 2009 Jul 23.
2
Network-constrained regularization and variable selection for analysis of genomic data.用于基因组数据分析的网络约束正则化和变量选择
Bioinformatics. 2008 May 1;24(9):1175-82. doi: 10.1093/bioinformatics/btn081. Epub 2008 Mar 1.
3
Penalized Cox regression analysis in the high-dimensional and low-sample size settings, with applications to microarray gene expression data.高维小样本情况下的惩罚Cox回归分析及其在微阵列基因表达数据中的应用
Bioinformatics. 2005 Jul 1;21(13):3001-8. doi: 10.1093/bioinformatics/bti422. Epub 2005 Apr 6.
4
Network-based penalized regression with application to genomic data.基于网络的惩罚回归及其在基因组数据中的应用。
Biometrics. 2013 Sep;69(3):582-93. doi: 10.1111/biom.12035. Epub 2013 Jul 3.
5
Network-based support vector machine for classification of microarray samples.基于网络的支持向量机用于微阵列样本分类
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S21. doi: 10.1186/1471-2105-10-S1-S21.
6
Sparse canonical correlation analysis for identifying, connecting and completing gene-expression networks.稀疏典型相关分析用于识别、连接和完成基因表达网络。
BMC Bioinformatics. 2009 Sep 28;10:315. doi: 10.1186/1471-2105-10-315.
7
Estimating sparse gene regulatory networks using a bayesian linear regression.使用贝叶斯线性回归估计稀疏基因调控网络。
IEEE Trans Nanobioscience. 2010 Jun;9(2):121-31. doi: 10.1109/TNB.2010.2043444.
8
Incorporating prior knowledge of predictors into penalized classifiers with multiple penalty terms.将预测变量的先验知识纳入具有多个惩罚项的惩罚分类器中。
Bioinformatics. 2007 Jul 15;23(14):1775-82. doi: 10.1093/bioinformatics/btm234. Epub 2007 May 5.
9
The L regularization network Cox model for analysis of genomic data.L 正则化网络 Cox 模型用于分析基因组数据。
Comput Biol Med. 2018 Sep 1;100:203-208. doi: 10.1016/j.compbiomed.2018.07.009. Epub 2018 Jul 17.
10
Bayesian Gene Selection Based on Pathway Information and Network-Constrained Regularization.基于通路信息和网络约束正则化的贝叶斯基因选择。
Comput Math Methods Med. 2021 Aug 4;2021:7471516. doi: 10.1155/2021/7471516. eCollection 2021.

引用本文的文献

1
Knowledge-guided learning methods for integrative analysis of multi-omics data.用于多组学数据综合分析的知识引导学习方法。
Comput Struct Biotechnol J. 2024 Apr 30;23:1945-1950. doi: 10.1016/j.csbj.2024.04.053. eCollection 2024 Dec.
2
Bayesian functional analysis for untargeted metabolomics data with matching uncertainty and small sample sizes.贝叶斯功能分析用于具有匹配不确定性和小样本量的非靶向代谢组学数据。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae141.
3
Integrative Learning of Structured High-Dimensional Data from Multiple Datasets.从多个数据集对结构化高维数据进行整合学习。
Stat Anal Data Min. 2023 Apr;16(2):120-134. doi: 10.1002/sam.11601. Epub 2022 Nov 8.
4
Prediction models with graph kernel regularization for network data.用于网络数据的带有图核正则化的预测模型。
J Appl Stat. 2022 Jan 31;50(6):1400-1417. doi: 10.1080/02664763.2022.2028745. eCollection 2023.
5
Incorporating spatial structure into inclusion probabilities for Bayesian variable selection in generalized linear models with the spike-and-slab elastic net.在具有尖劈板弹性网的广义线性模型中,将空间结构纳入贝叶斯变量选择的包含概率中。
J Stat Plan Inference. 2022 Mar;217:141-152. doi: 10.1016/j.jspi.2021.07.010. Epub 2021 Jul 29.
6
Integrative analysis of multi-omics and imaging data with incorporation of biological information via structural Bayesian factor analysis.基于结构贝叶斯因子分析,结合生物学信息,对多组学和影像数据进行综合分析。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad073.
7
Identifying important gene signatures of BMI using network structure-aided nonparametric quantile regression.利用网络结构辅助非参数分位数回归鉴定 BMI 的重要基因特征。
Stat Med. 2023 May 10;42(10):1625-1639. doi: 10.1002/sim.9691. Epub 2023 Feb 23.
8
Regularized regression when covariates are linked on a network: the 3CoSE algorithm.当协变量在网络上相关联时的正则化回归:3CoSE算法
J Appl Stat. 2021 Oct 7;50(3):535-554. doi: 10.1080/02664763.2021.1982878. eCollection 2023.
9
Identifying brain hierarchical structures associated with Alzheimer's disease using a regularized regression method with tree predictors.使用带树型预测器的正则化回归方法识别与阿尔茨海默病相关的大脑层次结构。
Biometrics. 2023 Sep;79(3):2333-2345. doi: 10.1111/biom.13775. Epub 2022 Nov 4.
10
Knowledge-Guided Statistical Learning Methods for Analysis of High-Dimensional -Omics Data in Precision Oncology.用于精准肿瘤学中高维组学数据分析的知识引导统计学习方法
JCO Precis Oncol. 2019 Oct 24;3. doi: 10.1200/PO.19.00018. eCollection 2019 Oct.

本文引用的文献

1
Network-based multiple locus linkage analysis of expression traits.基于网络的表达性状多位点连锁分析。
Bioinformatics. 2009 Jun 1;25(11):1390-6. doi: 10.1093/bioinformatics/btp177. Epub 2009 Mar 31.
2
Network-based support vector machine for classification of microarray samples.基于网络的支持向量机用于微阵列样本分类
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S21. doi: 10.1186/1471-2105-10-S1-S21.
3
Network-constrained regularization and variable selection for analysis of genomic data.用于基因组数据分析的网络约束正则化和变量选择
Bioinformatics. 2008 May 1;24(9):1175-82. doi: 10.1093/bioinformatics/btn081. Epub 2008 Mar 1.
4
Simultaneous regression shrinkage, variable selection, and supervised clustering of predictors with OSCAR.使用OSCAR进行预测变量的同时回归收缩、变量选择和监督聚类。
Biometrics. 2008 Mar;64(1):115-23. doi: 10.1111/j.1541-0420.2007.00843.x. Epub 2007 Jun 30.
5
A Markov random field model for network-based analysis of genomic data.一种用于基于网络的基因组数据分析的马尔可夫随机场模型。
Bioinformatics. 2007 Jun 15;23(12):1537-44. doi: 10.1093/bioinformatics/btm129. Epub 2007 May 5.
6
Retargeted oncolytic measles strains entering via the EGFRvIII receptor maintain significant antitumor activity against gliomas with increased tumor specificity.通过表皮生长因子受体变体III(EGFRvIII)受体进入的靶向溶瘤麻疹病毒株对神经胶质瘤保持显著的抗肿瘤活性,且肿瘤特异性增强。
Cancer Res. 2006 Dec 15;66(24):11840-50. doi: 10.1158/0008-5472.CAN-06-1200.
7
Analysis of oncogenic signaling networks in glioblastoma identifies ASPM as a molecular target.胶质母细胞瘤致癌信号网络分析确定ASPM为分子靶点。
Proc Natl Acad Sci U S A. 2006 Nov 14;103(46):17402-7. doi: 10.1073/pnas.0608396103. Epub 2006 Nov 7.
8
Identification of novel candidate target genes in amplicons of Glioblastoma multiforme tumors detected by expression and CGH microarray profiling.通过表达和比较基因组杂交微阵列分析鉴定多形性胶质母细胞瘤肿瘤扩增区域中的新型候选靶基因。
Mol Cancer. 2006 Sep 26;5:39. doi: 10.1186/1476-4598-5-39.
9
COSMIC 2005.宇宙2005年。 (感觉这个原文比较简短和模糊,翻译可能不太能体现其确切含义,建议提供更完整准确的原文以便更精准翻译 )
Br J Cancer. 2006 Jan 30;94(2):318-22. doi: 10.1038/sj.bjc.6602928.
10
Classification of gene microarrays by penalized logistic regression.基于惩罚逻辑回归的基因微阵列分类
Biostatistics. 2004 Jul;5(3):427-43. doi: 10.1093/biostatistics/5.3.427.