• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于生物信息学分析和机器学习的结直肠癌潜在生物标志物的鉴定。

Identification of potential biomarkers with colorectal cancer based on bioinformatics analysis and machine learning.

机构信息

Department of Biochemistry and Department of Thoracic Surgery of the First Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou 310003, China.

Radiation Biology Department, National Center for Radiation Research and Technology, Egyptian Atomic Energy Authority, Cairo 13759, Egypt.

出版信息

Math Biosci Eng. 2021 Oct 19;18(6):8997-9015. doi: 10.3934/mbe.2021443.

DOI:10.3934/mbe.2021443
PMID:34814332
Abstract

Colorectal cancer (CRC) is one of the most common malignancies worldwide. Biomarker discovery is critical to improve CRC diagnosis, however, machine learning offers a new platform to study the etiology of CRC for this purpose. Therefore, the current study aimed to perform an integrated bioinformatics and machine learning analyses to explore novel biomarkers for CRC prognosis. In this study, we acquired gene expression microarray data from Gene Expression Omnibus (GEO) database. The microarray expressions GSE103512 dataset was downloaded and integrated. Subsequently, differentially expressed genes (DEGs) were identified and functionally analyzed via Gene Ontology (GO) and Kyoto Enrichment of Genes and Genomes (KEGG). Furthermore, protein protein interaction (PPI) network analysis was conducted using the STRING database and Cytoscape software to identify hub genes; however, the hub genes were subjected to Support Vector Machine (SVM), Receiver operating characteristic curve (ROC) and survival analyses to explore their diagnostic values. Meanwhile, TCGA transcriptomics data in Gene Expression Profiling Interactive Analysis (GEPIA) database and the pathology data presented by in the human protein atlas (HPA) database were used to verify our transcriptomic analyses. A total of 105 DEGs were identified in this study. Functional enrichment analysis showed that these genes were significantly enriched in biological processes related to cancer progression. Thereafter, PPI network explored a total of 10 significant hub genes. The ROC curve was used to predict the potential application of biomarkers in CRC diagnosis, with an area under ROC curve (AUC) of these genes exceeding 0.92 suggesting that this risk classifier can discriminate between CRC patients and normal controls. Moreover, the prognostic values of these hub genes were confirmed by survival analyses using different CRC patient cohorts. Our results demonstrated that these 10 differentially expressed hub genes could be used as potential biomarkers for CRC diagnosis.

摘要

结直肠癌(CRC)是全球最常见的恶性肿瘤之一。生物标志物的发现对于改善 CRC 的诊断至关重要,然而,机器学习为此提供了一个研究 CRC 病因的新平台。因此,本研究旨在进行综合的生物信息学和机器学习分析,以探索 CRC 预后的新生物标志物。在这项研究中,我们从基因表达综合数据库(GEO)数据库中获取了基因表达微阵列数据。下载并整合了 GSE103512 数据集的微阵列表达数据。随后,通过基因本体论(GO)和京都基因与基因组百科全书(KEGG)进行了差异表达基因(DEGs)的鉴定和功能分析。此外,使用 STRING 数据库和 Cytoscape 软件进行蛋白质-蛋白质相互作用(PPI)网络分析,以识别枢纽基因;然而,这些枢纽基因还经过支持向量机(SVM)、接收器工作特征曲线(ROC)和生存分析,以探讨其诊断价值。同时,我们使用基因表达谱交互式分析(GEPIA)数据库中的 TCGA 转录组学数据和人类蛋白质图谱(HPA)数据库中的病理学数据来验证我们的转录组学分析。本研究共鉴定出 105 个 DEGs。功能富集分析表明,这些基因在与癌症进展相关的生物学过程中显著富集。此后,PPI 网络共探索了 10 个重要的枢纽基因。ROC 曲线用于预测生物标志物在 CRC 诊断中的潜在应用,这些基因的 ROC 曲线下面积(AUC)超过 0.92,表明该风险分类器可以区分 CRC 患者和正常对照。此外,通过使用不同的 CRC 患者队列进行生存分析,验证了这些枢纽基因的预后价值。我们的结果表明,这 10 个差异表达的枢纽基因可作为 CRC 诊断的潜在生物标志物。

相似文献

1
Identification of potential biomarkers with colorectal cancer based on bioinformatics analysis and machine learning.基于生物信息学分析和机器学习的结直肠癌潜在生物标志物的鉴定。
Math Biosci Eng. 2021 Oct 19;18(6):8997-9015. doi: 10.3934/mbe.2021443.
2
Identification of biomarkers associated with diagnosis and prognosis of colorectal cancer patients based on integrated bioinformatics analysis.基于整合生物信息学分析鉴定与结直肠癌患者诊断和预后相关的生物标志物。
Gene. 2019 Apr 15;692:119-125. doi: 10.1016/j.gene.2019.01.001. Epub 2019 Jan 14.
3
Employing bioinformatics analysis to identify hub genes and microRNAs involved in colorectal cancer.运用生物信息学分析鉴定结直肠癌相关的枢纽基因和 microRNAs。
Med Oncol. 2021 Aug 14;38(9):114. doi: 10.1007/s12032-021-01543-5.
4
Identification of candidate biomarkers and therapeutic drugs of colorectal cancer by integrated bioinformatics analysis.通过整合生物信息学分析鉴定结直肠癌的候选生物标志物和治疗药物。
Med Oncol. 2020 Oct 19;37(11):104. doi: 10.1007/s12032-020-01425-2.
5
The identification of a common different gene expression signature in patients with colorectal cancer.在结直肠癌患者中鉴定共同的不同基因表达特征。
Math Biosci Eng. 2019 Apr 10;16(4):2942-2958. doi: 10.3934/mbe.2019145.
6
Identification of novel biomarkers and small molecule drugs in human colorectal cancer by microarray and bioinformatics analysis.通过微阵列和生物信息学分析鉴定人类结直肠癌的新型生物标志物和小分子药物。
Mol Genet Genomic Med. 2019 Jul;7(7):e00713. doi: 10.1002/mgg3.713. Epub 2019 May 13.
7
Identification and clinicopathological analysis of potential p73-regulated biomarkers in colorectal cancer via integrative bioinformatics.基于整合生物信息学的方法鉴定和临床病理分析结直肠癌中潜在的 p73 调控生物标志物
Sci Rep. 2024 Apr 30;14(1):9894. doi: 10.1038/s41598-024-60715-1.
8
Identifying the key genes and microRNAs in colorectal cancer liver metastasis by bioinformatics analysis and in vitro experiments.通过生物信息学分析和体外实验鉴定结直肠癌肝转移的关键基因和 microRNAs。
Oncol Rep. 2019 Jan;41(1):279-291. doi: 10.3892/or.2018.6840. Epub 2018 Nov 1.
9
An integrative bioinformatics analysis of microarray data for identifying hub genes as diagnostic biomarkers of preeclampsia.基于基因芯片数据的综合生物信息学分析,以识别先兆子痫的诊断生物标志物的枢纽基因。
Biosci Rep. 2019 Sep 3;39(9). doi: 10.1042/BSR20190187. Print 2019 Sep 30.
10
Bioinformatic Identification of Hub Genes and Analysis of Prognostic Values in Colorectal Cancer.生物信息学鉴定结直肠癌的枢纽基因并分析其预后价值。
Nutr Cancer. 2021;73(11-12):2568-2578. doi: 10.1080/01635581.2020.1841249. Epub 2020 Nov 5.

引用本文的文献

1
Integrating machine learning and genetic evidence to uncover novel gene biomarkers for colorectal cancer diagnosis.整合机器学习和遗传证据以发现用于结直肠癌诊断的新型基因生物标志物。
Discov Oncol. 2025 May 6;16(1):675. doi: 10.1007/s12672-025-02435-0.
2
Identification of MAD2L1 as a novel biomarker for hepatoblastoma through bioinformatics and machine learning approaches.通过生物信息学和机器学习方法鉴定MAD2L1作为肝母细胞瘤的新型生物标志物。
Front Oncol. 2025 Mar 31;15:1524714. doi: 10.3389/fonc.2025.1524714. eCollection 2025.
3
Identification of potential biomarkers for lung cancer using integrated bioinformatics and machine learning approaches.
使用综合生物信息学和机器学习方法鉴定肺癌的潜在生物标志物。
PLoS One. 2025 Feb 27;20(2):e0317296. doi: 10.1371/journal.pone.0317296. eCollection 2025.
4
EGFR-Targeted Therapies: A Literature Review.表皮生长因子受体靶向治疗:文献综述
J Clin Med. 2024 Oct 25;13(21):6391. doi: 10.3390/jcm13216391.
5
Molecular biomarkers involved in the progression of gallbladder inflammatory lesions to invasive cancer: A proteomic approach.参与胆囊炎性病变进展为浸润性癌过程的分子生物标志物:蛋白质组学方法。
Biomol Biomed. 2024 Dec 11;25(1):115-143. doi: 10.17305/bb.2024.10704.
6
Classification of Long Non-Coding RNAs s Between Early and Late Stage of Liver Cancers From Non-coding RNA Profiles Using Machine-Learning Approach.使用机器学习方法从非编码RNA谱分类肝癌早晚期的长链非编码RNA
Bioinform Biol Insights. 2024 Jun 5;18:11779322241258586. doi: 10.1177/11779322241258586. eCollection 2024.
7
Identification of SV2C and DENR as Key Biomarkers for Parkinson's Disease Based on Bioinformatics, Machine Learning, and Experimental Verification.基于生物信息学、机器学习和实验验证鉴定 SV2C 和 DENR 为帕金森病的关键生物标志物。
J Mol Neurosci. 2024 Jan 8;74(1):6. doi: 10.1007/s12031-023-02182-3.
8
Identification of Differentially Expressed Genes in Human Colorectal Cancer Using RNASeq Data Validated on the Molecular Level with Real-Time PCR.利用实时 PCR 在分子水平上验证的 RNAseq 数据鉴定人类结直肠癌中的差异表达基因。
Biochem Genet. 2024 Aug;62(4):3260-3284. doi: 10.1007/s10528-023-10593-5. Epub 2023 Dec 14.
9
The Prognostic Value of and in Colorectal Cancer: A Machine Learning-Based Integrated Bioinformatics Approach.[具体指标]与[具体指标]在结直肠癌中的预后价值:基于机器学习的综合生物信息学方法
Cancers (Basel). 2023 Aug 28;15(17):4300. doi: 10.3390/cancers15174300.
10
Advances in Genomic Data and Biomarkers: Revolutionizing NSCLC Diagnosis and Treatment.基因组数据与生物标志物的进展:革新非小细胞肺癌的诊断与治疗
Cancers (Basel). 2023 Jul 3;15(13):3474. doi: 10.3390/cancers15133474.