具有肿瘤特征分析的套索惩罚 Cox 模型的预后可提高预测准确性，优于仅使用临床数据的预测，并且受益于二维预筛选。

Prognosis of lasso-like penalized Cox models with tumor profiling improves prediction over clinical data alone and benefits from bi-dimensional pre-screening.

机构信息

IRIG, Biosanté U1292, Univ. Grenoble Alpes, Inserm, CEA, Grenoble, France.

GIPSA-lab, Institute of Engineering University Grenoble Alpes, Univ. Grenoble Alpes, CNRS, Grenoble INP, Grenoble, France.

出版信息

BMC Cancer. 2022 Oct 5;22(1):1045. doi: 10.1186/s12885-022-10117-1.

DOI:10.1186/s12885-022-10117-1

PMID:36199072

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9533541/

Abstract

BACKGROUND

Prediction of patient survival from tumor molecular '-omics' data is a key step toward personalized medicine. Cox models performed on RNA profiling datasets are popular for clinical outcome predictions. But these models are applied in the context of "high dimension", as the number p of covariates (gene expressions) greatly exceeds the number n of patients and e of events. Thus, pre-screening together with penalization methods are widely used for dimensional reduction.

METHODS

In the present paper, (i) we benchmark the performance of the lasso penalization and three variants (i.e., ridge, elastic net, adaptive elastic net) on 16 cancers from TCGA after pre-screening, (ii) we propose a bi-dimensional pre-screening procedure based on both gene variability and p-values from single variable Cox models to predict survival, and (iii) we compare our results with iterative sure independence screening (ISIS).

RESULTS

First, we show that integration of mRNA-seq data with clinical data improves predictions over clinical data alone. Second, our bi-dimensional pre-screening procedure can only improve, in moderation, the C-index and/or the integrated Brier score, while excluding irrelevant genes for prediction. We demonstrate that the different penalization methods reached comparable prediction performances, with slight differences among datasets. Finally, we provide advice in the case of multi-omics data integration.

CONCLUSIONS

Tumor profiles convey more prognostic information than clinical variables such as stage for many cancer subtypes. Lasso and Ridge penalizations perform similarly than Elastic Net penalizations for Cox models in high-dimension. Pre-screening of the top 200 genes in term of single variable Cox model p-values is a practical way to reduce dimension, which may be particularly useful when integrating multi-omics.

摘要

背景

从肿瘤分子“组学”数据预测患者的生存情况是迈向个体化医疗的关键一步。基于 RNA 谱数据集的 Cox 模型常用于临床结局预测。但是，这些模型是在“高维”背景下应用的，因为协变量（基因表达）的数量 p 远远超过患者数量 n 和事件数量 e。因此，预筛选和惩罚方法被广泛用于降维。

方法

在本文中，（i）我们在 TCGA 的 16 种癌症中，对 Lasso 惩罚和三种变体（岭回归、弹性网络、自适应弹性网络）在预筛选后的表现进行了基准测试，（ii）我们提出了一种基于基因变异性和单变量 Cox 模型的 p 值的二维预筛选程序，用于预测生存，（iii）我们将结果与迭代确定性筛选（ISIS）进行了比较。

结果

首先，我们表明，将 mRNA-seq 数据与临床数据集成可以提高临床数据单独预测的准确性。其次，我们的二维预筛选程序只能适度提高 C 指数和/或综合 Brier 得分，同时排除与预测无关的基因。我们证明了不同的惩罚方法达到了类似的预测性能，不同数据集之间存在细微差异。最后，我们在多组学数据集成的情况下提供了建议。

结论

对于许多癌症亚型，肿瘤图谱比临床变量（如分期）传递更多的预后信息。在高维环境中，Lasso 和 Ridge 惩罚与弹性网络惩罚在 Cox 模型中的表现相似。基于单变量 Cox 模型 p 值筛选前 200 个基因是一种实用的降维方法，在整合多组学数据时可能特别有用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6568/9533541/c92cf7496e2b/12885_2022_10117_Fig1_HTML.jpg

相似文献

Prognosis of lasso-like penalized Cox models with tumor profiling improves prediction over clinical data alone and benefits from bi-dimensional pre-screening.具有肿瘤特征分析的套索惩罚 Cox 模型的预后可提高预测准确性，优于仅使用临床数据的预测，并且受益于二维预筛选。

BMC Cancer. 2022 Oct 5;22(1):1045. doi: 10.1186/s12885-022-10117-1.

High-dimensional Cox models: the choice of penalty as part of the model building process.高维Cox模型：作为模型构建过程一部分的惩罚项选择

Biom J. 2010 Feb;52(1):50-69. doi: 10.1002/bimj.200900064.

Multi-omics facilitated variable selection in Cox-regression model for cancer prognosis prediction.多组学技术助力Cox回归模型中的变量选择以进行癌症预后预测。

Methods. 2017 Jul 15;124:100-107. doi: 10.1016/j.ymeth.2017.06.010. Epub 2017 Jun 13.

Pan-cancer evaluation of gene expression and somatic alteration data for cancer prognosis prediction.泛癌种评估基因表达和体细胞改变数据以预测癌症预后。

BMC Cancer. 2021 Sep 25;21(1):1053. doi: 10.1186/s12885-021-08796-3.

Combined Performance of Screening and Variable Selection Methods in Ultra-High Dimensional Data in Predicting Time-To-Event Outcomes.超高维数据中筛选和变量选择方法在预测事件发生时间结局方面的综合性能

Diagn Progn Res. 2018;2. doi: 10.1186/s41512-018-0043-4. Epub 2018 Sep 26.

Large-scale benchmark study of survival prediction methods using multi-omics data.大规模基于多组学数据的生存预测方法基准研究。

Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa167.

Identification of clinically relevant features in hypertensive patients using penalized regression: a case study of cardiovascular events.使用惩罚回归识别高血压患者的临床相关特征：心血管事件的案例研究。

Med Biol Eng Comput. 2019 Sep;57(9):2011-2026. doi: 10.1007/s11517-019-02007-9. Epub 2019 Jul 25.

Robust estimation of the expected survival probabilities from high-dimensional Cox models with biomarker-by-treatment interactions in randomized clinical trials.在随机临床试验中，通过生物标志物与治疗的相互作用，从高维Cox模型中稳健估计预期生存概率。

BMC Med Res Methodol. 2017 May 22;17(1):83. doi: 10.1186/s12874-017-0354-0.

Comparison of Cox Model Methods in A Low-dimensional Setting with Few Events.低维环境下少量事件的Cox模型方法比较

Genomics Proteomics Bioinformatics. 2016 Aug;14(4):235-43. doi: 10.1016/j.gpb.2016.03.006. Epub 2016 May 17.

A novel non-negative Bayesian stacking modeling method for Cancer survival prediction using high-dimensional omics data.一种使用高维组学数据进行癌症生存预测的新型非负贝叶斯堆叠建模方法。

BMC Med Res Methodol. 2024 May 3;24(1):105. doi: 10.1186/s12874-024-02232-3.

引用本文的文献

Predictive Model for In-Hospital Death in Older Patients with Type 2 Diabetes Mellitus: A Multicenter Retrospective Study in Southwest China.中国西南地区老年2型糖尿病患者院内死亡的预测模型：一项多中心回顾性研究

Diabetes Metab Syndr Obes. 2025 Jun 9;18:1873-1889. doi: 10.2147/DMSO.S527018. eCollection 2025.

Developing clinical prognostic models to predict graft survival after renal transplantation: comparison of statistical and machine learning models.开发临床预后模型以预测肾移植后的移植物存活：统计模型与机器学习模型的比较

BMC Med Inform Decis Mak. 2025 Feb 3;25(1):54. doi: 10.1186/s12911-025-02906-y.

A Narrative Review of Prognostic Gene Signatures in Oral Squamous Cell Carcinoma Using LASSO Cox Regression.使用LASSO Cox回归对口腔鳞状细胞癌预后基因特征的叙述性综述

Biomedicines. 2025 Jan 8;13(1):134. doi: 10.3390/biomedicines13010134.

A CLRN3-Based CD8 T-Related Gene Signature Predicts Prognosis and Immunotherapy Response in Colorectal Cancer.CLRN3 为基础的 CD8 T 细胞相关基因标志物预测结直肠癌的预后和免疫治疗反应。

Biomolecules. 2024 Jul 24;14(8):891. doi: 10.3390/biom14080891.

The molecular prognostic score, a classifier for risk stratification of high-grade serous ovarian cancer.分子预后评分，用于高级别浆液性卵巢癌风险分层的分类器。

J Ovarian Res. 2024 Aug 2;17(1):159. doi: 10.1186/s13048-024-01482-5.

Target Genes of c-MYC and MYCN with Prognostic Power in Neuroblastoma Exhibit Different Expressions during Sympathoadrenal Development.在神经母细胞瘤中具有预后价值的c-MYC和MYCN的靶基因在交感肾上腺发育过程中表现出不同的表达。

Cancers (Basel). 2023 Sep 16;15(18):4599. doi: 10.3390/cancers15184599.

COL7A1 Expression Improves Prognosis Prediction for Patients with Clear Cell Renal Cell Carcinoma Atop of Stage.COL7A1表达改善了透明细胞肾细胞癌患者分期之上的预后预测。

Cancers (Basel). 2023 May 10;15(10):2701. doi: 10.3390/cancers15102701.

Assessing Metabolic Markers in Glioblastoma Using Machine Learning: A Systematic Review.使用机器学习评估胶质母细胞瘤中的代谢标志物：一项系统综述。

Metabolites. 2023 Jan 21;13(2):161. doi: 10.3390/metabo13020161.

Optimal microRNA Sequencing Depth to Predict Cancer Patient Survival with Random Forest and Cox Models.随机森林和 Cox 模型预测癌症患者生存的最优 microRNA 测序深度。

Genes (Basel). 2022 Dec 2;13(12):2275. doi: 10.3390/genes13122275.

本文引用的文献

Genome-wide identification and analysis of prognostic features in human cancers.全基因组鉴定和分析人类癌症的预后特征。

Cell Rep. 2022 Mar 29;38(13):110569. doi: 10.1016/j.celrep.2022.110569.

Pan-cancer evaluation of gene expression and somatic alteration data for cancer prognosis prediction.泛癌种评估基因表达和体细胞改变数据以预测癌症预后。

BMC Cancer. 2021 Sep 25;21(1):1053. doi: 10.1186/s12885-021-08796-3.

Benchmark of filter methods for feature selection in high-dimensional gene expression survival data.高维基因表达生存数据中特征选择的过滤方法的基准测试。

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab354.

Current Achievements and Applications of Transcriptomics in Personalized Cancer Medicine.转录组学在个性化癌症医学中的当前成就和应用。

Int J Mol Sci. 2021 Jan 31;22(3):1422. doi: 10.3390/ijms22031422.

Large-scale benchmark study of survival prediction methods using multi-omics data.大规模基于多组学数据的生存预测方法基准研究。

Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa167.

Accounting for grouped predictor variables or pathways in high-dimensional penalized Cox regression models.在高维惩罚 Cox 回归模型中考虑分组预测变量或途径。

BMC Bioinformatics. 2020 Jul 2;21(1):277. doi: 10.1186/s12859-020-03618-y.

On fusion methods for knowledge discovery from multi-omics datasets.关于从多组学数据集中进行知识发现的融合方法

Comput Struct Biotechnol J. 2020 Mar 5;18:509-517. doi: 10.1016/j.csbj.2020.02.011. eCollection 2020.

Why Test for Proportional Hazards?为什么要检验比例风险？

JAMA. 2020 Apr 14;323(14):1401-1402. doi: 10.1001/jama.2020.1267.

Cancer prognosis with shallow tumor RNA sequencing.浅肿瘤 RNA 测序的癌症预后。

Nat Med. 2020 Feb;26(2):188-192. doi: 10.1038/s41591-019-0729-3. Epub 2020 Feb 10.

Improving survival prediction using a novel feature selection and feature reduction framework based on the integration of clinical and molecular data.基于临床与分子数据整合的新型特征选择与降维框架提高生存预测。

Pac Symp Biocomput. 2020;25:415-426.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

具有肿瘤特征分析的套索惩罚 Cox 模型的预后可提高预测准确性，优于仅使用临床数据的预测，并且受益于二维预筛选。

Prognosis of lasso-like penalized Cox models with tumor profiling improves prediction over clinical data alone and benefits from bi-dimensional pre-screening.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献