Jin Xiaohu, Huang Zhiqi, Guo Peng, Yuan Ronghua
Department of Thyroid and Breast Surgery, Nantong City No. 1 People's Hospital and Second Affiliated Hospital of Nantong University, Nantong, China.
Department of Gastrointestinal Surgery, Nantong City No. 1 People's Hospital and Second Affiliated Hospital of Nantong University, Nantong, China.
Transl Cancer Res. 2023 Jun 30;12(6):1466-1489. doi: 10.21037/tcr-23-3. Epub 2023 Jun 20.
Breast cancer (BC) ranks first in incidence among women, with approximately 2 million new cases per year. Therefore, it is essential to investigate emerging targets for BC patients' diagnosis and prognosis.
We analyzed gene expression data from 99 normal and 1,081 BC tissues in The Cancer Genome Atlas (TCGA) database. Differentially expressed genes (DEGs) were identified using "limma" R package, and relevant modules were chosen through Weighted Gene Coexpression Network Analysis (WGCNA). Intersection genes were obtained by matching DEGs to WGCNA module genes. Functional enrichment studies were performed on these genes using Gene Ontology (GO), Disease Ontology (DO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases. Biomarkers were screened via Protein-Protein Interaction (PPI) networks and multiple machine-learning algorithms. The Gene Expression Profiling Interactive Analysis (GEPIA), The University of ALabama at Birmingham CANcer (UALCAN), and Human Protein Atlas (HPA) databases were employed to examine mRNA and protein expression of eight biomarkers. Kaplan-Meier mapper tool assessed their prognostic capabilities. Key biomarkers were analyzed via single-cell sequencing, and their relationship with immune infiltration was examined using Tumor Immune Estimation Resource (TIMER) database and "xCell" R package. Lastly, drug prediction was conducted based on the identified biomarkers.
We identified 1,673 DEGs and 542 important genes through differential analysis and WGCNA, respectively. Intersection analysis revealed 76 genes, which play significant roles in immune-related viral infection and IL-17 signaling pathways. DIX domain containing 1 (DIXDC1), Dual specificity phosphatase 6 (DUSP6), Pyruvate dehydrogenase kinase 4 (PDK4), C-X-C motif chemokine ligand 12 (CXCL12), Interferon regulatory factor 7 (IRF7), Integrin subunit alpha 7 (ITGA7), NIMA related kinase 2 (NEK2), and Nuclear receptor subfamily 3 group C member 1 (NR3C1) were selected as BC biomarkers using machine-learning algorithms. NEK2 was the most critical gene for diagnosis. Prospective drugs targeting NEK2 include etoposide and lukasunone.
Our study identified DIXDC1, DUSP6, PDK4, CXCL12, IRF7, ITGA7, NEK2, and NR3C1 as potential diagnostic biomarkers for BC, with NEK2 having the highest potential to aid in diagnosis and prognosis in clinical settings.
乳腺癌(BC)的发病率在女性中位居首位,每年新增病例约200万例。因此,研究乳腺癌患者诊断和预后的新靶点至关重要。
我们分析了癌症基因组图谱(TCGA)数据库中99个正常组织和1081个乳腺癌组织的基因表达数据。使用“limma”R包鉴定差异表达基因(DEG),并通过加权基因共表达网络分析(WGCNA)选择相关模块。通过将DEG与WGCNA模块基因匹配获得交集基因。使用基因本体论(GO)、疾病本体论(DO)和京都基因与基因组百科全书(KEGG)数据库对这些基因进行功能富集研究。通过蛋白质-蛋白质相互作用(PPI)网络和多种机器学习算法筛选生物标志物。利用基因表达谱交互式分析(GEPIA)、阿拉巴马大学伯明翰分校癌症(UALCAN)和人类蛋白质图谱(HPA)数据库检测8种生物标志物的mRNA和蛋白质表达。Kaplan-Meier映射工具评估了它们的预后能力。通过单细胞测序分析关键生物标志物,并使用肿瘤免疫估计资源(TIMER)数据库和“xCell”R包检查它们与免疫浸润的关系。最后,基于鉴定出的生物标志物进行药物预测。
我们分别通过差异分析和WGCNA鉴定出1673个DEG和542个重要基因。交集分析揭示了76个基因,它们在免疫相关病毒感染和IL-17信号通路中发挥重要作用。使用机器学习算法,含DIX结构域蛋白1(DIXDC1)、双特异性磷酸酶6(DUSP6)、丙酮酸脱氢酶激酶4(PDK4)、C-X-C基序趋化因子配体12(CXCL12)、干扰素调节因子7(IRF7)、整合素α7亚基(ITGA7)、NIMA相关激酶2(NEK2)和核受体亚家族3组成员1(NR3C1)被选为乳腺癌生物标志物。NEK2是诊断的最关键基因。靶向NEK2的潜在药物包括依托泊苷和卢卡苏诺。
我们的研究确定DIXDC1、DUSP6、PDK4、CXCL12、IRF7、ITGA7、NEK2和NR3C1为乳腺癌潜在的诊断生物标志物,其中NEK2在临床环境中辅助诊断和预后的潜力最高。