Suppr超能文献

基于LASSO和支持向量机机器学习方法的结直肠癌诊断基因及免疫浸润分析:一项生物信息学分析

Diagnostic genes and immune infiltration analysis of colorectal cancer determined by LASSO and SVM machine learning methods: a bioinformatics analysis.

作者信息

Li Yan-Rong, Meng Ke, Yang Guang, Liu Bao-Hai, Li Chu-Qiao, Zhang Jia-Yuan, Zhang Xiao-Mei

机构信息

Department of Gastroenterology, The First Affiliated Hospital of Jinzhou Medical University, Jinzhou, China.

Department of Gastroenterology and Hepatology, The First Medical Center, Chinese PLA General Hospital, Beijing, China.

出版信息

J Gastrointest Oncol. 2022 Jun;13(3):1188-1203. doi: 10.21037/jgo-22-536.

Abstract

BACKGROUND

Genetic factors account for approximately 35% of colorectal cancer risk. The specificity and sensitivity of previous diagnostic biomarkers for colorectal cancer could not meet the need of clinical application. The expanding scale and inherent complexity of biological data have encouraged a growing use of machine learning to build informative and predictive models of the underlying biological processes. The aim of this study is to identify diagnostic genes of colorectal cancer by using machine learning methods.

METHODS

The GSE41328 and GSE106582 data sets were downloaded from the Gene Expression Omnibus (GEO) database. The gene expression differences between colon cancer and normal tissues were analyzed. The key colorectal cancer genes were screened and validated by Least Absolute Shrinkage and Selection Operator (LASSO) and Support Vector Machine (SVM) regression. Immune cell infiltration and the correlation with the key genes in patients with colon cancer were further analyzed by CIBERSORT.

RESULTS

Eleven key genes were identified as biomarkers for colon cancer, namely and . The mean area under the receiver operating characteristic (ROC) curve (AUC) of all 11 genes for colon cancer diagnosis were 0.94 with a range of 0.91-0.97. In the validation set, the expression of the 11 key genes was significantly different between colon cancer and normal subjects (P<0.05) and the mean AUCs were 0.82 with a range of 0.70-0.88. Immune cell infiltration analyses demonstrated that the relative quantity of plasma cells, T cells, B cells, NK cells, MO, M1, Dendritic cells resting, Mast cells resting, Mast cells activated, and Neutrophils in the tumor group were significantly different to the normal group.

CONCLUSIONS

, and were identified as the key genes for colon cancer diagnosis. These genes are expected to become novel diagnostic markers and targets of new pharmacotherapies for colorectal cancer.

摘要

背景

遗传因素约占结直肠癌风险的35%。先前用于结直肠癌诊断的生物标志物的特异性和敏感性无法满足临床应用需求。生物数据规模的不断扩大及其内在复杂性促使机器学习在构建潜在生物过程的信息性和预测性模型方面的应用日益增加。本研究的目的是使用机器学习方法鉴定结直肠癌的诊断基因。

方法

从基因表达综合数据库(GEO)下载GSE41328和GSE106582数据集。分析结肠癌组织与正常组织之间的基因表达差异。通过最小绝对收缩和选择算子(LASSO)和支持向量机(SVM)回归筛选并验证关键的结直肠癌基因。通过CIBERSORT进一步分析结肠癌患者的免疫细胞浸润情况以及与关键基因的相关性。

结果

鉴定出11个关键基因作为结肠癌的生物标志物,即 和 。用于结肠癌诊断的所有11个基因的受试者操作特征曲线(ROC)下的平均面积(AUC)为0.94,范围为0.91 - 0.97。在验证集中,结肠癌患者与正常受试者之间11个关键基因的表达存在显著差异(P<0.05),平均AUC为0.82,范围为0.70 - 0.88。免疫细胞浸润分析表明,肿瘤组中浆细胞、T细胞、B细胞、NK细胞、单核细胞、M1、静息树突状细胞、静息肥大细胞、活化肥大细胞和中性粒细胞的相对数量与正常组有显著差异。

结论

和 被鉴定为结肠癌诊断的关键基因。这些基因有望成为结直肠癌新的诊断标志物和新药物治疗的靶点。

相似文献

7
Five genes as diagnostic biomarkers of dermatomyositis and their correlation with immune cell infiltration.
Front Immunol. 2023 Jan 18;14:1053099. doi: 10.3389/fimmu.2023.1053099. eCollection 2023.
9
Predicting potential biomarkers and immune infiltration characteristics in heart failure.
Math Biosci Eng. 2022 Jun 16;19(9):8671-8688. doi: 10.3934/mbe.2022402.

引用本文的文献

1
Machine learning-driven multi-targeted drug discovery in colon cancer using biomarker signatures.
NPJ Precis Oncol. 2025 Aug 22;9(1):297. doi: 10.1038/s41698-025-01058-6.
4
NFKB1 as a key player in Tumor biology: from mechanisms to therapeutic implications.
Cell Biol Toxicol. 2025 Jan 11;41(1):29. doi: 10.1007/s10565-024-09974-2.
8
Incorporating machine learning and PPI networks to identify mitochondrial fission-related immune markers in abdominal aortic aneurysms.
Heliyon. 2024 Mar 26;10(7):e27989. doi: 10.1016/j.heliyon.2024.e27989. eCollection 2024 Apr 15.
9
TRIB3, as a robust prognostic biomarker for HNSC, is associated with poor immune infiltration and cancer cell immune evasion.
Front Immunol. 2024 Jan 3;14:1290839. doi: 10.3389/fimmu.2023.1290839. eCollection 2023.

本文引用的文献

1
Colon cancer diagnosis and staging classification based on machine learning and bioinformatics analysis.
Comput Biol Med. 2022 Jun;145:105409. doi: 10.1016/j.compbiomed.2022.105409. Epub 2022 Mar 19.
2
Improved Wound Healing by Naringin Associated with MMP and the VEGF Pathway.
Molecules. 2022 Mar 4;27(5):1695. doi: 10.3390/molecules27051695.
4
The trouble with TRIBbles: TRIB3 blocks CD8 T cell homing to colorectal cancers.
Sci Immunol. 2022 Feb 4;7(68):eabo2990. doi: 10.1126/sciimmunol.abo2990.
5
Oncogenic potential of BEST4 in colorectal cancer via activation of PI3K/Akt signaling.
Oncogene. 2022 Feb;41(8):1166-1177. doi: 10.1038/s41388-021-02160-2. Epub 2022 Jan 21.
6
The Involvement of TRIB3 and FABP1 and Their Potential Functions in the Dynamic Process of Gastric Cancer.
Front Mol Biosci. 2021 Dec 9;8:790433. doi: 10.3389/fmolb.2021.790433. eCollection 2021.
7
A guide to machine learning for biologists.
Nat Rev Mol Cell Biol. 2022 Jan;23(1):40-55. doi: 10.1038/s41580-021-00407-0. Epub 2021 Sep 13.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验