一项通过机器学习策略确定肺腺癌新预后预测模型的大型队列研究。

A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies.

机构信息

Department of Thoracic Surgery, Zhongshan Hospital, Fudan University, 180 Fenglin Road, Shanghai, 200032, People's Republic of China.

出版信息

BMC Cancer. 2019 Sep 5;19(1):886. doi: 10.1186/s12885-019-6101-7.

DOI:10.1186/s12885-019-6101-7

PMID:31488089

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6729062/

Abstract

BACKGROUND

Predicting lung adenocarcinoma (LUAD) risk is crucial in determining further treatment strategies. Molecular biomarkers may improve risk stratification for LUAD.

METHODS

We analyzed the gene expression profiles of LUAD patients from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO). We initially used three distinct algorithms (sigFeature, random forest, and univariate Cox regression) to evaluate each gene's prognostic relevance. Survival related genes were then fitted into the least absolute shrinkage and selection operator (LASSO) model to build a risk prediction model for LUAD. After 100,000 times of calculation and model construction, a 16-gene-based prediction model capable of classifying LUAD patients into high-risk and low-risk groups was successfully built.

RESULTS

Using a combined strategy, we initially identified 2472 significant survival-related genes. Functional enrichment analysis demonstrated these genes' relevance to tumor initiation and progression. Using the LASSO method, we successfully built a reliable risk prediction model. The risk model was validated in two external sets and an independent set. The expression of these 16 genes was highly correlated with patients' risk. High-risk group patients witnessed poorer recurrence-free survival (RFS) and overall survival (OS) compared to low-risk group patients. Moreover, stratification analysis and decision curve analysis (DCA) confirmed the independence and potential translational value of this predictive tool. We also built a nomogram comprising risk model and stage to predict OS for LUAD patients.

CONCLUSIONS

Our risk model may serve as a practical and reliable prognosis predictive tool for LUAD and could provide novel insights into the understanding of the molecular mechanism of this disease.

摘要

背景

预测肺腺癌（LUAD）风险对于确定进一步的治疗策略至关重要。分子生物标志物可能会改善 LUAD 的风险分层。

方法

我们分析了来自癌症基因组图谱（TCGA）和基因表达综合数据库（GEO）的 LUAD 患者的基因表达谱。我们最初使用三种不同的算法（sigFeature、随机森林和单变量 Cox 回归）来评估每个基因的预后相关性。然后，将与生存相关的基因拟合到最小绝对收缩和选择算子（LASSO）模型中，以构建 LUAD 的风险预测模型。经过 100,000 次计算和模型构建后，成功构建了一个基于 16 个基因的预测模型，能够将 LUAD 患者分为高风险和低风险组。

结果

使用联合策略，我们最初确定了 2472 个与生存显著相关的基因。功能富集分析表明这些基因与肿瘤发生和进展有关。使用 LASSO 方法，我们成功构建了一个可靠的风险预测模型。该风险模型在两个外部数据集和一个独立数据集得到了验证。这些 16 个基因的表达与患者的风险高度相关。与低风险组患者相比，高风险组患者的无复发生存（RFS）和总生存（OS）较差。此外，分层分析和决策曲线分析（DCA）证实了该预测工具的独立性和潜在转化价值。我们还构建了一个包含风险模型和分期的列线图，以预测 LUAD 患者的 OS。

结论

我们的风险模型可能成为 LUAD 的一种实用且可靠的预后预测工具，并为深入了解该疾病的分子机制提供新的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c257/6729062/6b7c2d1a8d6c/12885_2019_6101_Fig6_HTML.jpg

相似文献

A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies.一项通过机器学习策略确定肺腺癌新预后预测模型的大型队列研究。

BMC Cancer. 2019 Sep 5;19(1):886. doi: 10.1186/s12885-019-6101-7.

A Seven-Gene Signature with Close Immune Correlation Was Identified for Survival Prediction of Lung Adenocarcinoma.一个与免疫密切相关的七基因标志物被鉴定出来，可用于预测肺腺癌的生存。

Med Sci Monit. 2020 Jul 2;26:e924269. doi: 10.12659/MSM.924269.

Identification Six Metabolic Genes as Potential Biomarkers for Lung Adenocarcinoma.鉴定六个代谢基因作为肺腺癌的潜在生物标志物。

J Comput Biol. 2020 Oct;27(10):1532-1543. doi: 10.1089/cmb.2019.0454. Epub 2020 Apr 16.

A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm.基于机器学习算法的肺腺癌复发特异性基因预后预测模型。

Biomed Res Int. 2020 Nov 7;2020:9124792. doi: 10.1155/2020/9124792. eCollection 2020.

Identification of a methylomics-associated nomogram for predicting overall survival of stage I-II lung adenocarcinoma.鉴定一个甲基组学相关的列线图，用于预测 I-II 期肺腺癌的总生存期。

Sci Rep. 2021 May 11;11(1):9938. doi: 10.1038/s41598-021-89429-4.

A novel ferroptosis-related genes model for prognosis prediction of lung adenocarcinoma.一种新的与铁死亡相关的基因模型用于肺腺癌的预后预测。

BMC Pulm Med. 2021 Jul 13;21(1):229. doi: 10.1186/s12890-021-01588-2.

[Construction and Validation of Prognostic Risk Score Model of Autophagy Related Genes in Lung Adenocarcinoma].[肺腺癌自噬相关基因预后风险评分模型的构建与验证]

Zhongguo Fei Ai Za Zhi. 2021 Aug 20;24(8):557-566. doi: 10.3779/j.issn.1009-3419.2021.103.09. Epub 2021 Jul 14.

Development and validation of a robust immune-related prognostic signature in early-stage lung adenocarcinoma.早期肺腺癌中一种稳健的免疫相关预后标志物的开发与验证

J Transl Med. 2020 Oct 7;18(1):380. doi: 10.1186/s12967-020-02545-z.

A ten-gene signature-based risk assessment model predicts the prognosis of lung adenocarcinoma.基于十个基因的签名风险评估模型预测肺腺癌的预后。

BMC Cancer. 2020 Aug 20;20(1):782. doi: 10.1186/s12885-020-07235-z.

DNA methylation profiling to predict recurrence risk in stage Ι lung adenocarcinoma: Development and validation of a nomogram to clinical management.DNA 甲基化分析预测Ⅰ期肺腺癌复发风险：用于临床管理的列线图的建立和验证。

J Cell Mol Med. 2020 Jul;24(13):7576-7589. doi: 10.1111/jcmm.15393. Epub 2020 Jun 12.

引用本文的文献

Bioinformatics-Based Discovery of Therapeutic Targets in Cadmium-Induced Lung Adenocarcinoma: The Role of Oxyresveratrol.基于生物信息学的镉诱导肺腺癌治疗靶点发现：氧化白藜芦醇的作用

Biol Trace Elem Res. 2025 Jul 4. doi: 10.1007/s12011-025-04730-x.

Assessing the prognosis mortality in patients with cutaneous verrucous carcinoma using Lasso-cox regression model: a retrospective study.使用套索-考克斯回归模型评估皮肤疣状癌患者的预后死亡率：一项回顾性研究。

Discov Oncol. 2025 Jun 13;16(1):1091. doi: 10.1007/s12672-025-02893-6.

A machine learning approach for multimodal data fusion for survival prediction in cancer patients.一种用于癌症患者生存预测的多模态数据融合的机器学习方法。

NPJ Precis Oncol. 2025 May 6;9(1):128. doi: 10.1038/s41698-025-00917-6.

Machine learning-based model for CD4 conventional T cell genes to predict survival and immune responses in colorectal cancer.基于机器学习的 CD4 常规 T 细胞基因模型预测结直肠癌的生存和免疫反应。

Sci Rep. 2024 Oct 18;14(1):24426. doi: 10.1038/s41598-024-75270-y.

Diagnostic value of immune-related biomarker FAM83A in differentiating malignant from benign pleural effusion in lung adenocarcinoma.免疫相关生物标志物FAM83A在鉴别肺腺癌恶性与良性胸腔积液中的诊断价值

Discov Oncol. 2024 Jun 24;15(1):242. doi: 10.1007/s12672-024-01109-7.

UPP1 promotes lung adenocarcinoma progression through the induction of an immunosuppressive microenvironment.UPP1 通过诱导免疫抑制微环境促进肺腺癌进展。

Nat Commun. 2024 Feb 8;15(1):1200. doi: 10.1038/s41467-024-45340-w.

AI/ML advances in non-small cell lung cancer biomarker discovery.人工智能/机器学习在非小细胞肺癌生物标志物发现方面的进展。

Front Oncol. 2023 Dec 11;13:1260374. doi: 10.3389/fonc.2023.1260374. eCollection 2023.

Improving Pancreatic Cyst Management: Artificial Intelligence-Powered Prediction of Advanced Neoplasms through Endoscopic Ultrasound-Guided Confocal Endomicroscopy.改善胰腺囊肿管理：通过内镜超声引导共聚焦内镜检查利用人工智能预测高级别肿瘤

Biomimetics (Basel). 2023 Oct 19;8(6):496. doi: 10.3390/biomimetics8060496.

Autoencoder-based multimodal prediction of non-small cell lung cancer survival.基于自动编码器的非小细胞肺癌生存的多模态预测。

Sci Rep. 2023 Sep 22;13(1):15761. doi: 10.1038/s41598-023-42365-x.

The activity of cuproptosis pathway calculated by AUCell algorithm was employed to construct cuproptosis landscape in lung adenocarcinoma.采用AUCell算法计算的铜死亡通路活性来构建肺腺癌中的铜死亡景观。

Discov Oncol. 2023 Jul 23;14(1):135. doi: 10.1007/s12672-023-00755-7.

本文引用的文献

Cancer statistics, 2019.癌症统计数据，2019 年。

CA Cancer J Clin. 2019 Jan;69(1):7-34. doi: 10.3322/caac.21551. Epub 2019 Jan 8.

SOX30 specially prevents Wnt-signaling to suppress metastasis and improve prognosis of lung adenocarcinoma patients.SOX30 专门防止 Wnt 信号抑制肺腺癌患者的转移并改善预后。

Respir Res. 2018 Dec 4;19(1):241. doi: 10.1186/s12931-018-0952-3.

IGF2BP1 promotes SRF-dependent transcription in cancer in a m6A- and miRNA-dependent manner.IGF2BP1 通过 m6A 和 miRNA 依赖性方式促进癌症中 SRF 依赖性转录。

Nucleic Acids Res. 2019 Jan 10;47(1):375-390. doi: 10.1093/nar/gky1012.

Transcriptomic and functional network features of lung squamous cell carcinoma through integrative analysis of GEO and TCGA data.通过 GEO 和 TCGA 数据的综合分析，揭示肺鳞癌的转录组和功能网络特征。

Sci Rep. 2018 Oct 26;8(1):15834. doi: 10.1038/s41598-018-34160-w.

An expression signature model to predict lung adenocarcinoma-specific survival.一种预测肺腺癌特异性生存的表达特征模型。

Cancer Manag Res. 2018 Sep 24;10:3717-3732. doi: 10.2147/CMAR.S159563. eCollection 2018.

The Cancer Genome Atlas Comprehensive Molecular Characterization of Renal Cell Carcinoma.癌症基因组图谱：肾细胞癌的全面分子特征

Cell Rep. 2018 Jun 19;23(12):3698. doi: 10.1016/j.celrep.2018.06.032.

An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics.TCGA 泛癌临床数据资源整合，推动高质量生存预后分析。

Cell. 2018 Apr 5;173(2):400-416.e11. doi: 10.1016/j.cell.2018.02.052.

Progress in the Management of Advanced Thoracic Malignancies in 2017.2017 年晚期胸部恶性肿瘤的治疗进展。

J Thorac Oncol. 2018 Mar;13(3):301-322. doi: 10.1016/j.jtho.2018.01.002. Epub 2018 Jan 11.

LncRNA Expression Signature in Prediction of the Prognosis of Lung Adenocarcinoma.预测肺腺癌预后的长链非编码RNA表达特征

Genet Test Mol Biomarkers. 2018 Jan;22(1):20-28. doi: 10.1089/gtmb.2017.0194. Epub 2018 Jan 3.

PEBP1 Wardens Ferroptosis by Enabling Lipoxygenase Generation of Lipid Death Signals.PEBP1通过促进脂氧合酶产生脂质死亡信号来调控铁死亡。

Cell. 2017 Oct 19;171(3):628-641.e26. doi: 10.1016/j.cell.2017.09.044.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一项通过机器学习策略确定肺腺癌新预后预测模型的大型队列研究。

A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献