• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于基因组数据分析的网络正则化高维Cox回归

NETWORK-REGULARIZED HIGH-DIMENSIONAL COX REGRESSION FOR ANALYSIS OF GENOMIC DATA.

作者信息

Sun Hokeun, Lin Wei, Feng Rui, Li Hongzhe

机构信息

Pusan National University.

University of Pennsylvania.

出版信息

Stat Sin. 2014 Jul;24(3):1433-1459. doi: 10.5705/ss.2012.317.

DOI:10.5705/ss.2012.317
PMID:26316678
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4549005/
Abstract

We consider estimation and variable selection in high-dimensional Cox regression when a prior knowledge of the relationships among the covariates, described by a network or graph, is available. A limitation of the existing methodology for survival analysis with high-dimensional genomic data is that a wealth of structural information about many biological processes, such as regulatory networks and pathways, has often been ignored. In order to incorporate such prior network information into the analysis of genomic data, we propose a network-based regularization method for high-dimensional Cox regression; it uses an ℓ-penalty to induce sparsity of the regression coefficients and a quadratic Laplacian penalty to encourage smoothness between the coefficients of neighboring variables on a given network. The proposed method is implemented by an efficient coordinate descent algorithm. In the setting where the dimensionality can grow exponentially fast with the sample size , we establish model selection consistency and estimation bounds for the proposed estimators. The theoretical results provide insights into the gain from taking into account the network structural information. Extensive simulation studies indicate that our method outperforms Lasso and elastic net in terms of variable selection accuracy and stability. We apply our method to a breast cancer gene expression study and identify several biologically plausible subnetworks and pathways that are associated with breast cancer distant metastasis.

摘要

当可以获得由网络或图描述的协变量之间关系的先验知识时,我们考虑高维Cox回归中的估计和变量选择。现有用于高维基因组数据生存分析方法的一个局限性在于,关于许多生物过程(如调控网络和信号通路)的大量结构信息常常被忽视。为了将此类先验网络信息纳入基因组数据分析,我们提出了一种用于高维Cox回归的基于网络的正则化方法;它使用ℓ惩罚来诱导回归系数的稀疏性,并使用二次拉普拉斯惩罚来鼓励给定网络上相邻变量系数之间的平滑性。所提出的方法通过一种高效的坐标下降算法实现。在维度可以随样本量呈指数快速增长的情况下,我们为所提出的估计器建立了模型选择一致性和估计界。理论结果为考虑网络结构信息所带来的收益提供了见解。广泛的模拟研究表明,我们的方法在变量选择准确性和稳定性方面优于Lasso和弹性网络。我们将我们的方法应用于一项乳腺癌基因表达研究,并识别出几个与乳腺癌远处转移相关的具有生物学合理性的子网和信号通路。

相似文献

1
NETWORK-REGULARIZED HIGH-DIMENSIONAL COX REGRESSION FOR ANALYSIS OF GENOMIC DATA.用于基因组数据分析的网络正则化高维Cox回归
Stat Sin. 2014 Jul;24(3):1433-1459. doi: 10.5705/ss.2012.317.
2
The L regularization network Cox model for analysis of genomic data.L 正则化网络 Cox 模型用于分析基因组数据。
Comput Biol Med. 2018 Sep 1;100:203-208. doi: 10.1016/j.compbiomed.2018.07.009. Epub 2018 Jul 17.
3
The Sparse Laplacian Shrinkage Estimator for High-Dimensional Regression.用于高维回归的稀疏拉普拉斯收缩估计器
Ann Stat. 2011;39(4):2021-2046. doi: 10.1214/11-aos897.
4
VARIABLE SELECTION AND REGRESSION ANALYSIS FOR GRAPH-STRUCTURED COVARIATES WITH AN APPLICATION TO GENOMICS.具有基因组学应用的图结构协变量的变量选择与回归分析
Ann Appl Stat. 2010 Sep 1;4(3):1498-1516. doi: 10.1214/10-AOAS332.
5
Regularization Methods for High-Dimensional Instrumental Variables Regression With an Application to Genetical Genomics.高维工具变量回归的正则化方法及其在遗传基因组学中的应用
J Am Stat Assoc. 2015;110(509):270-288. doi: 10.1080/01621459.2014.908125.
6
A Bayesian Approach for Graph-constrained Estimation for High-dimensional Regression.一种用于高维回归的图约束估计的贝叶斯方法。
Int J Syst Synth Biol. 2010;1(2):255-272.
7
Network-constrained regularization and variable selection for analysis of genomic data.用于基因组数据分析的网络约束正则化和变量选择
Bioinformatics. 2008 May 1;24(9):1175-82. doi: 10.1093/bioinformatics/btn081. Epub 2008 Mar 1.
8
Sparse logistic regression with a L1/2 penalty for gene selection in cancer classification.基于 L1/2 罚项的稀疏逻辑回归在癌症分类中的基因选择。
BMC Bioinformatics. 2013 Jun 19;14:198. doi: 10.1186/1471-2105-14-198.
9
ADAPTIVE ROBUST VARIABLE SELECTION.自适应鲁棒变量选择
Ann Stat. 2014 Feb 1;42(1):324-351. doi: 10.1214/13-AOS1191.
10
glmgraph: an R package for variable selection and predictive modeling of structured genomic data.glmgraph:一个用于结构化基因组数据变量选择和预测建模的R包。
Bioinformatics. 2015 Dec 15;31(24):3991-3. doi: 10.1093/bioinformatics/btv497. Epub 2015 Aug 26.

引用本文的文献

1
Network-based multi-class classifier to identify optimized gene networks for acute leukemia cell line classification.基于网络的多类分类器,用于识别急性白血病细胞系分类的优化基因网络。
PLoS One. 2025 May 8;20(5):e0321549. doi: 10.1371/journal.pone.0321549. eCollection 2025.
2
Sparse spectral graph analysis and its application to gastric cancer drug resistance-specific molecular interplays identification.稀疏谱图分析及其在胃癌耐药特异性分子相互作用识别中的应用。
PLoS One. 2024 Jul 5;19(7):e0305386. doi: 10.1371/journal.pone.0305386. eCollection 2024.
3
Bayesian functional analysis for untargeted metabolomics data with matching uncertainty and small sample sizes.贝叶斯功能分析用于具有匹配不确定性和小样本量的非靶向代谢组学数据。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae141.
4
New statistical selection method for pleiotropic variants associated with both quantitative and qualitative traits.一种新的统计选择方法,用于与数量性状和质量性状都相关的多效变异体。
BMC Bioinformatics. 2023 Oct 10;24(1):381. doi: 10.1186/s12859-023-05505-8.
5
Topology-based radiomic features for prediction of parotid gland cancer malignancy grade in magnetic resonance images.基于拓扑学的影像组学特征用于预测磁共振图像中腮腺癌的恶性程度
MAGMA. 2023 Oct;36(5):767-777. doi: 10.1007/s10334-023-01084-0. Epub 2023 Apr 20.
6
Regularized regression when covariates are linked on a network: the 3CoSE algorithm.当协变量在网络上相关联时的正则化回归:3CoSE算法
J Appl Stat. 2021 Oct 7;50(3):535-554. doi: 10.1080/02664763.2021.1982878. eCollection 2023.
7
Computational Tactics for Precision Cancer Network Biology.精准癌症网络生物学的计算策略。
Int J Mol Sci. 2022 Nov 19;23(22):14398. doi: 10.3390/ijms232214398.
8
PredictiveNetwork: predictive gene network estimation with application to gastric cancer drug response-predictive network analysis.预测网络:具有应用于胃癌药物反应预测网络分析的预测基因网络估计。
BMC Bioinformatics. 2022 Aug 16;23(1):342. doi: 10.1186/s12859-022-04871-z.
9
Network-based survival analysis to discover target genes for developing cancer immunotherapies and predicting patient survival.基于网络的生存分析,以发现用于开发癌症免疫疗法的靶基因并预测患者生存情况。
J Appl Stat. 2021;48(8):1352-1373. doi: 10.1080/02664763.2020.1812543. Epub 2020 Sep 3.
10
Knowledge-Guided Statistical Learning Methods for Analysis of High-Dimensional -Omics Data in Precision Oncology.用于精准肿瘤学中高维组学数据分析的知识引导统计学习方法
JCO Precis Oncol. 2019 Oct 24;3. doi: 10.1200/PO.19.00018. eCollection 2019 Oct.

本文引用的文献

1
Regularization Paths for Cox's Proportional Hazards Model via Coordinate Descent.通过坐标下降法求解Cox比例风险模型的正则化路径
J Stat Softw. 2011 Mar;39(5):1-13. doi: 10.18637/jss.v039.i05.
2
Non-Asymptotic Oracle Inequalities for the High-Dimensional Cox Regression via Lasso.通过套索法实现高维Cox回归的非渐近最优不等式
Stat Sin. 2014 Jan 1;24(1):25-42. doi: 10.5705/ss.2012.240.
3
ORACLE INEQUALITIES FOR THE LASSO IN THE COX MODEL.Cox模型中套索回归的Oracle不等式
Ann Stat. 2013 Jun 1;41(3):1142-1165. doi: 10.1214/13-AOS1098.
4
ELASTIC NET FOR COX'S PROPORTIONAL HAZARDS MODEL WITH A SOLUTION PATH ALGORITHM.带求解路径算法的Cox比例风险模型的弹性网络法
Stat Sin. 2012;22:27-294. doi: 10.5705/ss.2010.107.
5
REGULARIZATION FOR COX'S PROPORTIONAL HAZARDS MODEL WITH NP-DIMENSIONALITY.具有NP维数的Cox比例风险模型的正则化
Ann Stat. 2011;39(6):3092-3120. doi: 10.1214/11-AOS911.
6
VARIABLE SELECTION AND REGRESSION ANALYSIS FOR GRAPH-STRUCTURED COVARIATES WITH AN APPLICATION TO GENOMICS.具有基因组学应用的图结构协变量的变量选择与回归分析
Ann Appl Stat. 2010 Sep 1;4(3):1498-1516. doi: 10.1214/10-AOAS332.
7
The Sparse Laplacian Shrinkage Estimator for High-Dimensional Regression.用于高维回归的稀疏拉普拉斯收缩估计器
Ann Stat. 2011;39(4):2021-2046. doi: 10.1214/11-aos897.
8
COORDINATE DESCENT ALGORITHMS FOR NONCONVEX PENALIZED REGRESSION, WITH APPLICATIONS TO BIOLOGICAL FEATURE SELECTION.用于非凸惩罚回归的坐标下降算法及其在生物特征选择中的应用
Ann Appl Stat. 2011 Jan 1;5(1):232-253. doi: 10.1214/10-AOAS388.
9
HLA-E and HLA-G expression in classical HLA class I-negative tumors is of prognostic value for clinical outcome of early breast cancer patients.经典 HLA I 类分子阴性肿瘤中 HLA-E 和 HLA-G 的表达对早期乳腺癌患者临床结局具有预后价值。
J Immunol. 2010 Dec 15;185(12):7452-9. doi: 10.4049/jimmunol.1002629. Epub 2010 Nov 5.
10
One-step Sparse Estimates in Nonconcave Penalized Likelihood Models.非凹惩罚似然模型中的一步稀疏估计
Ann Stat. 2008 Aug 1;36(4):1509-1533. doi: 10.1214/009053607000000802.