Suppr超能文献

机器学习模型在肺腺癌的诊断和预后中的开发和验证,以及免疫浸润分析。

Development and validation of machine learning models for diagnosis and prognosis of lung adenocarcinoma, and immune infiltration analysis.

机构信息

Department of Respiratory and Critical Care Medicine, The Second Affiliated Hospital of Harbin Medical University, No. 148, Health Care Road, Nangang District, Harbin, 150086, Heilongjiang, People's Republic of China.

出版信息

Sci Rep. 2024 Sep 27;14(1):22081. doi: 10.1038/s41598-024-73498-2.

Abstract

The aim of our study was to develop robust diagnostic and prognostic models for lung adenocarcinoma (LUAD) using machine learning (ML) techniques, focusing on early immune infiltration. Feature selection was performed on The Cancer Genome Atlas (TCGA) data using least absolute shrinkage and selection Operator (LASSO), random forest (RF), and support vector machine (SVM) algorithms. Six ML algorithms were employed to construct the diagnostic models, which were evaluated through receiver operating characteristic (ROC) curves, precision-recall curves (PRC), and classification error (CE), and validated on the GSE7670 dataset. Additionally, a lasso cox prognostic model was built on the TCGA-LUAD dataset and externally validated using independent Gene Expression Omnibus datasets (GSE30219, GSE31210, GSE50081, and GSE37745). Single-sample gene set enrichment analysis (ssGSEA) was performed to assess immune cell infiltration in stage I LUAD samples, revealing significant differences in immune cell types. These findings demonstrate a positive correlation between immune infiltration in stage I LUAD and Th2 cells, Tcm cells, and T helper cells, while a negative correlation was observed with Macrophages, Eosinophils, and Tem cells. These insights provide novel perspectives for clinical diagnosis and treatment of LUAD.

摘要

本研究旨在利用机器学习 (ML) 技术开发针对肺腺癌 (LUAD) 的稳健诊断和预后模型,重点关注早期免疫浸润。使用最小绝对收缩和选择算子 (LASSO)、随机森林 (RF) 和支持向量机 (SVM) 算法对癌症基因组图谱 (TCGA) 数据进行特征选择。使用六种 ML 算法构建诊断模型,并通过接收者操作特征 (ROC) 曲线、精度-召回曲线 (PRC) 和分类误差 (CE) 进行评估,并在 GSE7670 数据集上进行验证。此外,在 TCGA-LUAD 数据集上构建了 LASSO COX 预后模型,并使用独立的基因表达综合数据集 (GSE30219、GSE31210、GSE50081 和 GSE37745) 进行外部验证。进行了单样本基因集富集分析 (ssGSEA) 以评估 I 期 LUAD 样本中的免疫细胞浸润,揭示了免疫细胞类型的显著差异。这些发现表明,I 期 LUAD 中的免疫浸润与 Th2 细胞、Tcm 细胞和 T 辅助细胞呈正相关,而与巨噬细胞、嗜酸性粒细胞和 Tem 细胞呈负相关。这些研究结果为 LUAD 的临床诊断和治疗提供了新的视角。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/095a/11437281/c1c2fcdb3f1e/41598_2024_73498_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验