基于生物信息学分析和有监督学习方法鉴定胰腺癌诊断价值的关键基因。

Identification of hub genes with diagnostic values in pancreatic cancer by bioinformatics analyses and supervised learning methods.

机构信息

West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, China.

Medical Big Data Center, Sichuan University, Chengdu, China.

出版信息

World J Surg Oncol. 2018 Nov 14;16(1):223. doi: 10.1186/s12957-018-1519-y.

DOI:10.1186/s12957-018-1519-y

PMID:30428899

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6237021/

Abstract

BACKGROUND

Pancreatic cancer is one of the most lethal tumors with poor prognosis, and lacks of effective biomarkers in diagnosis and treatment. The aim of this investigation was to identify hub genes in pancreatic cancer, which would serve as potential biomarkers for cancer diagnosis and therapy in the future.

METHODS

Combination of two expression profiles of GSE16515 and GSE22780 from Gene Expression Omnibus (GEO) database was served as training set. Differentially expressed genes (DEGs) with top 25% variance followed by protein-protein interaction (PPI) network were performed to find candidate genes. Then, hub genes were further screened by survival and cox analyses in The Cancer Genome Atlas (TCGA) database. Finally, hub genes were validated in GSE15471 dataset from GEO by supervised learning methods k-nearest neighbor (kNN) and random forest algorithms.

RESULTS

After quality control and batch effect elimination of training set, 181 DEGs bearing top 25% variance were identified as candidate genes. Then, two hub genes, MMP7 and ITGA2, correlating with diagnosis and prognosis of pancreatic cancer were screened as hub genes according to above-mentioned bioinformatics methods. Finally, hub genes were demonstrated to successfully differ tumor samples from normal tissues with predictive accuracies reached to 93.59 and 81.31% by using kNN and random forest algorithms, respectively.

CONCLUSIONS

All the hub genes were associated with the regulation of tumor microenvironment, which implicated in tumor proliferation, progression, migration, and metastasis. Our results provide a novel prospect for diagnosis and treatment of pancreatic cancer, which may have a further application in clinical.

摘要

背景

胰腺癌是预后最差的致命肿瘤之一，在诊断和治疗方面缺乏有效的生物标志物。本研究旨在鉴定胰腺癌中的枢纽基因，这些基因将成为未来癌症诊断和治疗的潜在生物标志物。

方法

将基因表达综合数据库（GEO）中的 GSE16515 和 GSE22780 两个表达谱组合作为训练集。通过差异表达基因（DEGs）和蛋白质-蛋白质相互作用（PPI）网络，筛选前 25%变异的候选基因。然后，通过癌症基因组图谱（TCGA）数据库的生存和 COX 分析进一步筛选枢纽基因。最后，通过监督学习方法 k-最近邻（kNN）和随机森林算法在 GEO 的 GSE15471 数据集验证枢纽基因。

结果

经过训练集的质量控制和批次效应消除后，确定了 181 个具有前 25%变异的 DEGs 作为候选基因。然后，根据上述生物信息学方法，筛选出与胰腺癌诊断和预后相关的两个枢纽基因 MMP7 和 ITGA2。最后，使用 kNN 和随机森林算法，枢纽基因成功区分肿瘤样本和正常组织，预测准确率分别达到 93.59%和 81.31%。

结论

所有的枢纽基因都与肿瘤微环境的调节有关，参与肿瘤的增殖、进展、迁移和转移。我们的研究结果为胰腺癌的诊断和治疗提供了新的前景，可能在临床应用中有进一步的应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8cbf/6237021/b43f2b8ea7ad/12957_2018_1519_Fig1_HTML.jpg

相似文献

Identification of hub genes with diagnostic values in pancreatic cancer by bioinformatics analyses and supervised learning methods.基于生物信息学分析和有监督学习方法鉴定胰腺癌诊断价值的关键基因。

World J Surg Oncol. 2018 Nov 14;16(1):223. doi: 10.1186/s12957-018-1519-y.

Identification of hub genes and regulators associated with pancreatic ductal adenocarcinoma based on integrated gene expression profile analysis.基于综合基因表达谱分析鉴定与胰腺导管腺癌相关的枢纽基因和调控因子

Discov Med. 2019 Sep;28(153):159-172.

Identification of hub genes and analysis of prognostic values in pancreatic ductal adenocarcinoma by integrated bioinformatics methods.通过综合生物信息学方法鉴定胰腺导管腺癌中的枢纽基因并分析其预后价值

Mol Biol Rep. 2018 Dec;45(6):1799-1807. doi: 10.1007/s11033-018-4325-2. Epub 2018 Sep 1.

Identification of biomarkers associated with diagnosis and prognosis of colorectal cancer patients based on integrated bioinformatics analysis.基于整合生物信息学分析鉴定与结直肠癌患者诊断和预后相关的生物标志物。

Gene. 2019 Apr 15;692:119-125. doi: 10.1016/j.gene.2019.01.001. Epub 2019 Jan 14.

Identification of novel genes associated with a poor prognosis in pancreatic ductal adenocarcinoma via a bioinformatics analysis.通过生物信息学分析鉴定与胰腺导管腺癌预后不良相关的新基因。

Biosci Rep. 2019 Aug 2;39(8). doi: 10.1042/BSR20190625. Print 2019 Aug 30.

An Integrated Microarray Analysis Reveals Significant Diagnostic and Prognostic Biomarkers in Pancreatic Cancer.集成微阵列分析揭示胰腺癌有显著的诊断和预后生物标志物。

Med Sci Monit. 2020 Apr 1;26:e921769. doi: 10.12659/MSM.921769.

, and as Novel Panel for Pancreatic Cancer: A Bioinformatics Analysis and Experiments Validation.载脂蛋白 A1 作为胰腺癌的新型标志物：生物信息学分析和实验验证。

Front Immunol. 2021 Mar 18;12:649551. doi: 10.3389/fimmu.2021.649551. eCollection 2021.

Identification of differentially expressed genes in pancreatic ductal adenocarcinoma and normal pancreatic tissues based on microarray datasets.基于基因芯片数据集鉴定胰腺导管腺癌与正常胰腺组织中的差异表达基因。

Mol Med Rep. 2019 Aug;20(2):1901-1914. doi: 10.3892/mmr.2019.10414. Epub 2019 Jun 24.

Screening and Validation of Independent Predictors of Poor Survival in Pancreatic Cancer.胰腺癌患者生存预后不良的独立预测因素的筛选与验证

Pathol Oncol Res. 2021 Jul 12;27:1609868. doi: 10.3389/pore.2021.1609868. eCollection 2021.

Screening and identification of hub genes in pancreatic cancer by integrated bioinformatics analysis.通过综合生物信息学分析筛选和鉴定胰腺癌的枢纽基因。

J Cell Biochem. 2019 Dec;120(12):19496-19508. doi: 10.1002/jcb.29253. Epub 2019 Jul 11.

引用本文的文献

Identification of key hub genes in pancreatic ductal adenocarcinoma: an integrative bioinformatics study.胰腺导管腺癌关键枢纽基因的鉴定：一项整合生物信息学研究

Front Bioinform. 2025 Mar 28;5:1536783. doi: 10.3389/fbinf.2025.1536783. eCollection 2025.

Effect of left colonic artery preservation on perfusion at the anastomosis in rectal cancer surgery evaluated with intraoperative ultrasound.术中超声评估左结肠动脉保留对直肠癌手术吻合口灌注的影响。

Tech Coloproctol. 2024 Nov 12;28(1):157. doi: 10.1007/s10151-024-03037-8.

Low ligation of the inferior mesenteric artery in robotic mid-low rectal cancer surgery: a comparative study from a single-center.机器人辅助中低位直肠癌手术中肠系膜下动脉低位结扎：单中心的对比研究。

J Robot Surg. 2024 Aug 21;18(1):325. doi: 10.1007/s11701-024-02080-9.

Systems biology approach: identification of hub genes, signaling pathways, and molecular docking of COL1A1 gene in cervical insufficiency.系统生物学方法：宫颈机能不全中枢纽基因的鉴定、信号通路及COL1A1基因的分子对接

In Silico Pharmacol. 2024 May 14;12(1):45. doi: 10.1007/s40203-024-00218-z. eCollection 2024.

Evaluation of penalized and machine learning methods for asthma disease prediction in the Korean Genome and Epidemiology Study (KoGES).评估惩罚和机器学习方法在韩国基因组与流行病学研究（KoGES）中对哮喘病的预测作用。

BMC Bioinformatics. 2024 Feb 2;25(1):56. doi: 10.1186/s12859-024-05677-x.

High tie sigmoidectomy syndrome.高位乙状结肠切除术综合征

Tech Coloproctol. 2023 Dec;27(12):1409-1410. doi: 10.1007/s10151-023-02864-5. Epub 2023 Oct 6.

Multiple-model machine learning identifies potential functional genes in dilated cardiomyopathy.多模型机器学习识别扩张型心肌病中的潜在功能基因。

Front Cardiovasc Med. 2023 Jan 11;9:1044443. doi: 10.3389/fcvm.2022.1044443. eCollection 2022.

Molecular Markers of Pancreatic Cancer: A 10-Year Retrospective Review of Molecular Advances.胰腺癌的分子标志物：分子进展的10年回顾性综述

Cureus. 2022 Sep 23;14(9):e29485. doi: 10.7759/cureus.29485. eCollection 2022 Sep.

Early Detection of Pancreatic Cancers Using Liquid Biopsies and Hierarchical Decision Structure.使用液体活检和层次决策结构早期检测胰腺癌。

IEEE J Transl Eng Health Med. 2022 Jun 27;10:4300208. doi: 10.1109/JTEHM.2022.3186836. eCollection 2022.

Identification and prognostic analysis of biomarkers to predict the progression of pancreatic cancer patients.预测胰腺癌患者病情进展的生物标志物的鉴定与预后分析

Mol Med. 2022 Apr 15;28(1):43. doi: 10.1186/s10020-022-00467-8.

本文引用的文献

Association of hOGG1 Ser326Cys, ITGA2 C807T, TNF-A -308G>A and XPD Lys751Gln polymorphisms with the survival of Malaysian NPC patients.HOGG1 Ser326Cys、ITGA2 C807T、TNF-A -308G>A 和 XPD Lys751Gln 多态性与马来西亚 NPC 患者生存的关联。

PLoS One. 2018 Jun 18;13(6):e0198332. doi: 10.1371/journal.pone.0198332. eCollection 2018.

Blockade of ITGA2 Induces Apoptosis and Inhibits Cell Migration in Gastric Cancer.整合素α2（ITGA2）阻断可诱导胃癌细胞凋亡并抑制其迁移。

Biol Proced Online. 2018 May 1;20:10. doi: 10.1186/s12575-018-0073-x. eCollection 2018.

Yes-associated protein (YAP) in pancreatic cancer: at the epicenter of a targetable signaling network associated with patient survival.Yes 相关蛋白（YAP）在胰腺癌中的作用：位于与患者生存相关的可靶向信号网络的中心。

Signal Transduct Target Ther. 2018 Apr 20;3:11. doi: 10.1038/s41392-017-0005-2. eCollection 2018.

Survival of pancreatic cancer cells lacking KRAS function.缺乏KRAS功能的胰腺癌细胞的存活情况。

Nat Commun. 2017 Oct 23;8(1):1090. doi: 10.1038/s41467-017-00942-5.

Multiplex detection of pancreatic cancer biomarkers using a SERS-based immunoassay.基于 SERS 的免疫分析用于胰腺癌生物标志物的多重检测。

Nanotechnology. 2017 Nov 10;28(45):455101. doi: 10.1088/1361-6528/aa8e8c.

CCR10 activation stimulates the invasion and migration of breast cancer cells through the ERK1/2/MMP-7 signaling pathway.CCR10 激活通过 ERK1/2/MMP-7 信号通路刺激乳腺癌细胞的侵袭和迁移。

Int Immunopharmacol. 2017 Oct;51:124-130. doi: 10.1016/j.intimp.2017.07.018. Epub 2017 Aug 19.

Syndecan-2 cytoplasmic domain up-regulates matrix metalloproteinase-7 expression via the protein kinase Cγ-mediated FAK/ERK signaling pathway in colon cancer.Syndecan-2胞质结构域通过蛋白激酶Cγ介导的FAK/ERK信号通路上调结肠癌中基质金属蛋白酶-7的表达。

J Biol Chem. 2017 Sep 29;292(39):16321-16332. doi: 10.1074/jbc.M117.793752. Epub 2017 Aug 16.

Immuno-proteomic discovery of tumor tissue autoantigens identifies olfactomedin 4, CD11b, and integrin alpha-2 as markers of colorectal cancer with liver metastases.免疫蛋白质组学发现肿瘤组织自身抗原，鉴定出嗅觉素 4、CD11b 和整合素 α2 作为结直肠癌肝转移的标志物。

J Proteomics. 2017 Sep 25;168:53-65. doi: 10.1016/j.jprot.2017.06.021. Epub 2017 Jun 29.

MMP-10, MMP-7, TIMP-1 and TIMP-2 mRNA expression in esophageal cancer.食管癌中基质金属蛋白酶-10、基质金属蛋白酶-7、金属蛋白酶组织抑制因子-1和金属蛋白酶组织抑制因子-2的信使核糖核酸表达

Acta Biochim Pol. 2017;64(2):295-299. doi: 10.18388/abp.2016_1408. Epub 2017 May 15.

Overexpression of G protein-coupled receptor GPR87 promotes pancreatic cancer aggressiveness and activates NF-κB signaling pathway.G蛋白偶联受体GPR87的过表达促进胰腺癌的侵袭性并激活核因子κB信号通路。

Mol Cancer. 2017 Mar 14;16(1):61. doi: 10.1186/s12943-017-0627-6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于生物信息学分析和有监督学习方法鉴定胰腺癌诊断价值的关键基因。

Identification of hub genes with diagnostic values in pancreatic cancer by bioinformatics analyses and supervised learning methods.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献