• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于癌症预后高维基因组数据的基于网络的稳健正则化和变量选择

Robust network-based regularization and variable selection for high-dimensional genomic data in cancer prognosis.

作者信息

Ren Jie, Du Yinhao, Li Shaoyu, Ma Shuangge, Jiang Yu, Wu Cen

机构信息

Department of Statistics, Kansas State University, Manhattan, Kansas.

Department of Mathematics and Statistics, University of North Carolina at Charlotte, Charlotte, North Carolina.

出版信息

Genet Epidemiol. 2019 Apr;43(3):276-291. doi: 10.1002/gepi.22194. Epub 2019 Feb 11.

DOI:10.1002/gepi.22194
PMID:30746793
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6446588/
Abstract

In cancer genomic studies, an important objective is to identify prognostic markers associated with patients' survival. Network-based regularization has achieved success in variable selections for high-dimensional cancer genomic data, because of its ability to incorporate the correlations among genomic features. However, as survival time data usually follow skewed distributions, and are contaminated by outliers, network-constrained regularization that does not take the robustness into account leads to false identifications of network structure and biased estimation of patients' survival. In this study, we develop a novel robust network-based variable selection method under the accelerated failure time model. Extensive simulation studies show the advantage of the proposed method over the alternative methods. Two case studies of lung cancer datasets with high-dimensional gene expression measurements demonstrate that the proposed approach has identified markers with important implications.

摘要

在癌症基因组研究中,一个重要目标是识别与患者生存相关的预后标志物。基于网络的正则化方法在高维癌症基因组数据的变量选择中取得了成功,因为它能够纳入基因组特征之间的相关性。然而,由于生存时间数据通常遵循偏态分布,并且受到异常值的影响,未考虑稳健性的网络约束正则化会导致网络结构的错误识别以及患者生存的偏差估计。在本研究中,我们在加速失效时间模型下开发了一种新颖的基于稳健网络的变量选择方法。大量模拟研究表明了所提出方法相对于其他方法的优势。对具有高维基因表达测量的肺癌数据集进行的两个案例研究表明,所提出的方法识别出了具有重要意义的标志物。

相似文献

1
Robust network-based regularization and variable selection for high-dimensional genomic data in cancer prognosis.用于癌症预后高维基因组数据的基于网络的稳健正则化和变量选择
Genet Epidemiol. 2019 Apr;43(3):276-291. doi: 10.1002/gepi.22194. Epub 2019 Feb 11.
2
Incorporating network structure in integrative analysis of cancer prognosis data.将网络结构纳入癌症预后数据的综合分析中。
Genet Epidemiol. 2013 Feb;37(2):173-83. doi: 10.1002/gepi.21697. Epub 2012 Nov 17.
3
NCC-AUC: an AUC optimization method to identify multi-biomarker panel for cancer prognosis from genomic and clinical data.NCC-AUC:一种 AUC 优化方法,用于从基因组和临床数据中识别用于癌症预后的多生物标志物组。
Bioinformatics. 2015 Oct 15;31(20):3330-8. doi: 10.1093/bioinformatics/btv374. Epub 2015 Jun 18.
4
Multi-omics facilitated variable selection in Cox-regression model for cancer prognosis prediction.多组学技术助力Cox回归模型中的变量选择以进行癌症预后预测。
Methods. 2017 Jul 15;124:100-107. doi: 10.1016/j.ymeth.2017.06.010. Epub 2017 Jun 13.
5
Network-based regularization for high dimensional SNP data in the case-control study of Type 2 diabetes.2型糖尿病病例对照研究中高维单核苷酸多态性数据的基于网络的正则化方法
BMC Genet. 2017 May 16;18(1):44. doi: 10.1186/s12863-017-0495-5.
6
Network-based drug sensitivity prediction.基于网络的药物敏感性预测。
BMC Med Genomics. 2020 Dec 28;13(Suppl 11):193. doi: 10.1186/s12920-020-00829-3.
7
Bayesian variable selection with graphical structure learning: Applications in integrative genomics.贝叶斯变量选择与图形结构学习:在整合基因组学中的应用。
PLoS One. 2018 Jul 30;13(7):e0195070. doi: 10.1371/journal.pone.0195070. eCollection 2018.
8
Predicting censored survival data based on the interactions between meta-dimensional omics data in breast cancer.基于乳腺癌元维度组学数据间的相互作用预测删失生存数据。
J Biomed Inform. 2015 Aug;56:220-8. doi: 10.1016/j.jbi.2015.05.019. Epub 2015 Jun 3.
9
Integrative analysis of genetical genomics data incorporating network structures.整合包含网络结构的遗传基因组学数据的综合分析。
Biometrics. 2019 Dec;75(4):1063-1075. doi: 10.1111/biom.13072. Epub 2019 Apr 29.
10
Integrative Molecular Analyses of an Individual Transcription Factor-Based Genomic Model for Lung Cancer Prognosis.基于个体转录因子的基因组模型对肺癌预后的综合分子分析。
Dis Markers. 2021 Dec 7;2021:5125643. doi: 10.1155/2021/5125643. eCollection 2021.

引用本文的文献

1
A Comprehensive Review of Deep Learning Applications with Multi-Omics Data in Cancer Research.癌症研究中多组学数据深度学习应用的综合综述
Genes (Basel). 2025 May 28;16(6):648. doi: 10.3390/genes16060648.
2
Efficient blockLASSO for polygenic scores with applications to all of us and UK Biobank.用于多基因评分的高效分组套索回归及其在“我们所有人”项目和英国生物银行中的应用
BMC Genomics. 2025 Mar 27;26(1):302. doi: 10.1186/s12864-025-11505-0.
3
MMOSurv: meta-learning for few-shot survival analysis with multi-omics data.MMOSurv:利用多组学数据进行少样本生存分析的元学习
Bioinformatics. 2024 Dec 26;41(1). doi: 10.1093/bioinformatics/btae684.
4
The spike-and-slab quantile LASSO for robust variable selection in cancer genomics studies.用于癌症基因组学研究中稳健变量选择的尖峰和板条分位数 LASSO。
Stat Med. 2024 Nov 20;43(26):4928-4983. doi: 10.1002/sim.10196. Epub 2024 Sep 11.
5
Bayesian functional analysis for untargeted metabolomics data with matching uncertainty and small sample sizes.贝叶斯功能分析用于具有匹配不确定性和小样本量的非靶向代谢组学数据。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae141.
6
Integrating DNA methylation and gene expression data in a single gene network using the iNETgrate package.使用 iNETgrate 包将 DNA 甲基化和基因表达数据整合到单个基因网络中。
Sci Rep. 2023 Dec 8;13(1):21721. doi: 10.1038/s41598-023-48237-8.
7
Identification and validation of a DNA methylation-driven gene-based prognostic model for clear cell renal cell carcinoma.鉴定和验证 DNA 甲基化驱动的基于基因的透明细胞肾细胞癌预后模型。
BMC Genomics. 2023 Jun 7;24(1):307. doi: 10.1186/s12864-023-09416-z.
8
Identification of an individualized therapy prognostic signature for head and neck squamous cell carcinoma.鉴定头颈部鳞状细胞癌个体化治疗预后特征。
BMC Genomics. 2023 Apr 28;24(1):221. doi: 10.1186/s12864-023-09325-1.
9
Springer: An R package for bi-level variable selection of high-dimensional longitudinal data.施普林格:用于高维纵向数据双层变量选择的R包。
Front Genet. 2023 Apr 6;14:1088223. doi: 10.3389/fgene.2023.1088223. eCollection 2023.
10
Gene Screening in High-Throughput Right-Censored Lung Cancer Data.高通量右删失肺癌数据中的基因筛查
Onco (Basel). 2022 Dec;2(4):305-318. doi: 10.3390/onco2040017. Epub 2022 Oct 17.

本文引用的文献

1
The microtubule-associated protein PRC1 is a potential therapeutic target for lung cancer.微管相关蛋白PRC1是肺癌的一个潜在治疗靶点。
Oncotarget. 2017 Dec 22;9(4):4985-4997. doi: 10.18632/oncotarget.23577. eCollection 2018 Jan 12.
2
Dissecting gene-environment interactions: A penalized robust approach accounting for hierarchical structures.剖析基因-环境交互作用:一种考虑层次结构的惩罚稳健方法。
Stat Med. 2018 Feb 10;37(3):437-456. doi: 10.1002/sim.7518. Epub 2017 Oct 16.
3
Genome-scale analysis identifies NEK2, DLGAP5 and ECT2 as promising diagnostic and prognostic biomarkers in human lung cancer.基因组规模分析鉴定 NEK2、DLGAP5 和 ECT2 作为人类肺癌有前途的诊断和预后生物标志物。
Sci Rep. 2017 Aug 14;7(1):8072. doi: 10.1038/s41598-017-08615-5.
4
PRC1 contributes to tumorigenesis of lung adenocarcinoma in association with the Wnt/β-catenin signaling pathway.PRC1 与 Wnt/β-catenin 信号通路相关,有助于肺腺癌的肿瘤发生。
Mol Cancer. 2017 Jun 24;16(1):108. doi: 10.1186/s12943-017-0682-z.
5
Network-based regularization for high dimensional SNP data in the case-control study of Type 2 diabetes.2型糖尿病病例对照研究中高维单核苷酸多态性数据的基于网络的正则化方法
BMC Genet. 2017 May 16;18(1):44. doi: 10.1186/s12863-017-0495-5.
6
Mining expression and prognosis of topoisomerase isoforms in non-small-cell lung cancer by using Oncomine and Kaplan-Meier plotter.利用Oncomine和Kaplan-Meier绘图工具挖掘非小细胞肺癌中拓扑异构酶亚型的表达及预后情况。
PLoS One. 2017 Mar 29;12(3):e0174515. doi: 10.1371/journal.pone.0174515. eCollection 2017.
7
PRPS1 silencing reverses cisplatin resistance in human breast cancer cells.PRPS1基因沉默可逆转人乳腺癌细胞对顺铂的耐药性。
Biochem Cell Biol. 2017 Jun;95(3):385-393. doi: 10.1139/bcb-2016-0106. Epub 2016 Nov 3.
8
Network-Regularized Sparse Logistic Regression Models for Clinical Risk Prediction and Biomarker Discovery.用于临床风险预测和生物标志物发现的基于网络正则化稀疏逻辑回归模型。
IEEE/ACM Trans Comput Biol Bioinform. 2018 May-Jun;15(3):944-953. doi: 10.1109/TCBB.2016.2640303. Epub 2016 Dec 15.
9
AURKA, DLGAP5, TPX2, KIF11 and CKAP5: Five specific mitosis-associated genes correlate with poor prognosis for non-small cell lung cancer patients.极光激酶A(AURKA)、Dlg 家族成员5(DLGAP5)、肿瘤蛋白X(TPX2)、驱动蛋白家族成员11(KIF11)和细胞骨架相关蛋白5(CKAP5):五个特定的有丝分裂相关基因与非小细胞肺癌患者的不良预后相关。
Int J Oncol. 2017 Feb;50(2):365-372. doi: 10.3892/ijo.2017.3834. Epub 2017 Jan 2.
10
Pan-cancer analysis of somatic copy-number alterations implicates IRS4 and IGF2 in enhancer hijacking.体细胞拷贝数改变的泛癌分析表明IRS4和IGF2参与增强子劫持。
Nat Genet. 2017 Jan;49(1):65-74. doi: 10.1038/ng.3722. Epub 2016 Nov 21.