Suppr超能文献

基于加权基因共表达网络分析(WGCNA)和多种机器学习算法筛选乳腺癌新型生物标志物

Screening of novel biomarkers for breast cancer based on WGCNA and multiple machine learning algorithms.

作者信息

Jin Xiaohu, Huang Zhiqi, Guo Peng, Yuan Ronghua

机构信息

Department of Thyroid and Breast Surgery, Nantong City No. 1 People's Hospital and Second Affiliated Hospital of Nantong University, Nantong, China.

Department of Gastrointestinal Surgery, Nantong City No. 1 People's Hospital and Second Affiliated Hospital of Nantong University, Nantong, China.

出版信息

Transl Cancer Res. 2023 Jun 30;12(6):1466-1489. doi: 10.21037/tcr-23-3. Epub 2023 Jun 20.

Abstract

BACKGROUND

Breast cancer (BC) ranks first in incidence among women, with approximately 2 million new cases per year. Therefore, it is essential to investigate emerging targets for BC patients' diagnosis and prognosis.

METHODS

We analyzed gene expression data from 99 normal and 1,081 BC tissues in The Cancer Genome Atlas (TCGA) database. Differentially expressed genes (DEGs) were identified using "limma" R package, and relevant modules were chosen through Weighted Gene Coexpression Network Analysis (WGCNA). Intersection genes were obtained by matching DEGs to WGCNA module genes. Functional enrichment studies were performed on these genes using Gene Ontology (GO), Disease Ontology (DO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Biomarkers were screened via Protein-Protein Interaction (PPI) networks and multiple machine-learning algorithms. The Gene Expression Profiling Interactive Analysis (GEPIA), The University of ALabama at Birmingham CANcer (UALCAN), and Human Protein Atlas (HPA) databases were employed to examine mRNA and protein expression of eight biomarkers. Kaplan-Meier mapper tool assessed their prognostic capabilities. Key biomarkers were analyzed via single-cell sequencing, and their relationship with immune infiltration was examined using Tumor Immune Estimation Resource (TIMER) database and "xCell" R package. Lastly, drug prediction was conducted based on the identified biomarkers.

RESULTS

We identified 1,673 DEGs and 542 important genes through differential analysis and WGCNA, respectively. Intersection analysis revealed 76 genes, which play significant roles in immune-related viral infection and IL-17 signaling pathways. DIX domain containing 1 (DIXDC1), Dual specificity phosphatase 6 (DUSP6), Pyruvate dehydrogenase kinase 4 (PDK4), C-X-C motif chemokine ligand 12 (CXCL12), Interferon regulatory factor 7 (IRF7), Integrin subunit alpha 7 (ITGA7), NIMA related kinase 2 (NEK2), and Nuclear receptor subfamily 3 group C member 1 (NR3C1) were selected as BC biomarkers using machine-learning algorithms. NEK2 was the most critical gene for diagnosis. Prospective drugs targeting NEK2 include etoposide and lukasunone.

CONCLUSIONS

Our study identified DIXDC1, DUSP6, PDK4, CXCL12, IRF7, ITGA7, NEK2, and NR3C1 as potential diagnostic biomarkers for BC, with NEK2 having the highest potential to aid in diagnosis and prognosis in clinical settings.

摘要

背景

乳腺癌(BC)的发病率在女性中位居首位,每年新增病例约200万例。因此,研究乳腺癌患者诊断和预后的新靶点至关重要。

方法

我们分析了癌症基因组图谱(TCGA)数据库中99个正常组织和1081个乳腺癌组织的基因表达数据。使用“limma”R包鉴定差异表达基因(DEG),并通过加权基因共表达网络分析(WGCNA)选择相关模块。通过将DEG与WGCNA模块基因匹配获得交集基因。使用基因本体论(GO)、疾病本体论(DO)和京都基因与基因组百科全书(KEGG)数据库对这些基因进行功能富集研究。通过蛋白质-蛋白质相互作用(PPI)网络和多种机器学习算法筛选生物标志物。利用基因表达谱交互式分析(GEPIA)、阿拉巴马大学伯明翰分校癌症(UALCAN)和人类蛋白质图谱(HPA)数据库检测8种生物标志物的mRNA和蛋白质表达。Kaplan-Meier映射工具评估了它们的预后能力。通过单细胞测序分析关键生物标志物,并使用肿瘤免疫估计资源(TIMER)数据库和“xCell”R包检查它们与免疫浸润的关系。最后,基于鉴定出的生物标志物进行药物预测。

结果

我们分别通过差异分析和WGCNA鉴定出1673个DEG和542个重要基因。交集分析揭示了76个基因,它们在免疫相关病毒感染和IL-17信号通路中发挥重要作用。使用机器学习算法,含DIX结构域蛋白1(DIXDC1)、双特异性磷酸酶6(DUSP6)、丙酮酸脱氢酶激酶4(PDK4)、C-X-C基序趋化因子配体12(CXCL12)、干扰素调节因子7(IRF7)、整合素α7亚基(ITGA7)、NIMA相关激酶2(NEK2)和核受体亚家族3组成员1(NR3C1)被选为乳腺癌生物标志物。NEK2是诊断的最关键基因。靶向NEK2的潜在药物包括依托泊苷和卢卡苏诺。

结论

我们的研究确定DIXDC1、DUSP6、PDK4、CXCL12、IRF7、ITGA7、NEK2和NR3C1为乳腺癌潜在的诊断生物标志物,其中NEK2在临床环境中辅助诊断和预后的潜力最高。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6974/10331707/fc3e8b431b31/tcr-12-06-1466-f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验