Suppr超能文献

基于机器学习的结直肠癌肝转移特异性基因的筛选与验证

Machine learning-based screening and validation of liver metastasis-specific genes in colorectal cancer.

作者信息

Zheng Shiyao, He Hongxin, Zheng Jianfeng, Zhu Xingshu, Lin Nan, Wu Qing, Wei Enhao, Weng Caiming, Chen Shuqian, Huang Xinxiang, Jian Chenxing, Guan Shen, Yang Chunkang

机构信息

Department of Gastrointestinal Surgical Oncology, Clinical Oncology School of Fujian Medical University, Fujian Cancer Hospital, Fuzhou, 350014, People's Republic of China.

Department of Gynecology, Clinical Oncology School of Fujian Medical University, Fujian Cancer Hospital, Fuzhou, 350014, People's Republic of China.

出版信息

Sci Rep. 2024 Jul 30;14(1):17679. doi: 10.1038/s41598-024-68706-y.

Abstract

Colorectal liver metastasis (CRLM) is challenging in the clinical treatment of colorectal cancer. Limited research has been conducted on how CRLM develops. RNA sequencing data were obtained from the Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA). Four machine learning algorithms were used to screen the hub CRLM-specific genes, including Least Absolute Shrinkage and Selection Operator (Lasso), Random forest, SVM-RFE, and XGboost. The model for identifying CRLM was developed using stepwise logistic regression and was validated using internal and independent datasets. The prognostic value of hub CRLM-specific genes was assessed using the Lasso-Cox method. The in vitro experiments were performed using SW620 cells. The CRLM identification model was developed based on four CRLM-specific genes (SPP1, ZG16, P2RY14, and PRKAR2B), and the model efficacy was validated using GSE41258 and three external cohorts. Five CRLM-specific prognostic hub genes, SPP1, ZG16, P2RY14, CYP2E1, and C5, were identified using the Lasso-Cox algorithm, and a risk score was constructed. The risk score was validated using the GSE39582 cohort. Three genes have both efficacy in identifying CRLM and prognostic value: ZG16, P2RY14, and SPP1. Immune infiltration and enrichment analyses demonstrated that SPP1 was associated with M2 macrophage polarization and extracellular matrix remodeling. In vitro experiments indicated that SPP1 may act as a cancer-promoting factor. The hub CRLM-specific gene SPP1 can help determine the diagnosis, prognosis, and immune infiltration of patients with CRLM.

摘要

结直肠癌肝转移(CRLM)在结直肠癌的临床治疗中具有挑战性。关于CRLM如何发展的研究有限。从基因表达综合数据库(GEO)和癌症基因组图谱(TCGA)获取了RNA测序数据。使用四种机器学习算法筛选CRLM特异性核心基因,包括最小绝对收缩和选择算子(Lasso)、随机森林、支持向量机递归特征消除(SVM-RFE)和极端梯度提升(XGboost)。使用逐步逻辑回归建立识别CRLM的模型,并使用内部和独立数据集进行验证。使用Lasso-Cox方法评估CRLM特异性核心基因的预后价值。使用SW620细胞进行体外实验。基于四个CRLM特异性基因(SPP1、ZG16、P2RY14和PRKAR2B)建立CRLM识别模型,并使用GSE41258和三个外部队列验证模型效能。使用Lasso-Cox算法识别出五个CRLM特异性预后核心基因SPP1、ZG16、P2RY14、CYP2E1和C5,并构建风险评分。使用GSE39582队列验证风险评分。三个基因在识别CRLM和预后价值方面均有效:ZG16、P2RY14和SPP1。免疫浸润和富集分析表明,SPP1与M2巨噬细胞极化和细胞外基质重塑相关。体外实验表明,SPP1可能作为一种促癌因子。CRLM特异性核心基因SPP1有助于确定CRLM患者的诊断、预后和免疫浸润情况。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/830b/11291988/d7386b1a3da1/41598_2024_68706_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验