Department of Biotechnology, Motilal Nehru National Institute of Technology Allahabad, Prayagraj, 211004, India.
National Institute of Animal Biotechnology, Hyderabad, 500032, India.
Sci Rep. 2023 Apr 19;13(1):6413. doi: 10.1038/s41598-023-33327-4.
Colorectal cancer (CRC) is the third most prevalent cancer type and accounts for nearly one million deaths worldwide. The CRC mRNA gene expression datasets from TCGA and GEO (GSE144259, GSE50760, and GSE87096) were analyzed to find the significant differentially expressed genes (DEGs). These significant genes were further processed for feature selection through boruta and the confirmed features of importance (genes) were subsequently used for ML-based prognostic classification model development. These genes were analyzed for survival and correlation analysis between final genes and infiltrated immunocytes. A total of 770 CRC samples were included having 78 normal and 692 tumor tissue samples. 170 significant DEGs were identified after DESeq2 analysis along with the topconfects R package. The 33 confirmed features of importance-based RF prognostic classification model have given accuracy, precision, recall, and f1-score of 100% with 0% standard deviation. The overall survival analysis had finalized GLP2R and VSTM2A genes that were significantly downregulated in tumor samples and had a strong correlation with immunocyte infiltration. The involvement of these genes in CRC prognosis was further confirmed on the basis of their biological function and literature analysis. The current findings indicate that GLP2R and VSTM2A may play a significant role in CRC progression and immune response suppression.
结直肠癌(CRC)是第三大常见癌症类型,占全球近 100 万人死亡。分析了来自 TCGA 和 GEO(GSE144259、GSE50760 和 GSE87096)的 CRC mRNA 基因表达数据集,以找到显著差异表达的基因(DEGs)。通过 boruta 对这些显著基因进行进一步的特征选择处理,随后使用基于 ML 的预后分类模型开发来确认重要的特征(基因)。对这些基因进行生存分析,并对最终基因与浸润免疫细胞之间的相关性进行分析。共纳入 770 例 CRC 样本,其中 78 例为正常组织,692 例为肿瘤组织。经过 DESeq2 分析和 topconfects R 包,共鉴定出 170 个显著的 DEGs。基于 RF 预后分类模型的 33 个确认重要特征的准确性、精度、召回率和 F1 得分为 100%,标准偏差为 0%。总体生存分析确定了 GLP2R 和 VSTM2A 基因,这些基因在肿瘤样本中显著下调,与免疫细胞浸润有很强的相关性。根据这些基因的生物学功能和文献分析,进一步证实了它们在 CRC 预后中的作用。目前的研究结果表明,GLP2R 和 VSTM2A 可能在 CRC 的进展和免疫反应抑制中发挥重要作用。