• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用多种机器学习模型预测乳腺癌患者的基因特征。

Predicting gene signature in breast cancer patients with multiple machine learning models.

作者信息

Zhu Fangfang, Xu Dafang

机构信息

First Affiliated Hospital of Huzhou University, No.158, Guangchang Hou Road, Huzhou, 313000, Zhejiang, People's Republic of China.

出版信息

Discov Oncol. 2024 Oct 1;15(1):516. doi: 10.1007/s12672-024-01386-2.

DOI:10.1007/s12672-024-01386-2
PMID:39352418
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11445210/
Abstract

AIMS

The aim of this study was to predict gene signatures in breast cancer patients using multiple machine learning models.

METHODS

In this study, we first collated and merged the datasets GSE54002 and GSE22820, obtaining a gene expression matrix comprising 16,820 genes (including 593 breast cancer (BC) samples and 26 normal control (NC) samples). Subsequently, we performed enrichment analyses using Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and Disease Ontology (DO).

RESULTS

We identified 177 differentially expressed genes (DEGs), including 40 up-regulated and 137 down-regulated genes, through differential expression analysis. The GO enrichment results indicated that these genes are primarily involved in extracellular matrix organization, positive regulation of nervous system development, collagen-containing extracellular matrix, heparin binding, glycosaminoglycan binding, and Wnt protein binding, among others. KEGG enrichment analysis revealed that the DEGs were primarily associated with pathways such as focal adhesion, the PI3K-Akt signaling pathway, and human papillomavirus infection. DO enrichment analysis showed that the DEGs play a significant role in regulating diseases such as intestinal disorders, nephritis, and dermatitis. Further, through LASSO regression analysis and SVM-RFE algorithm analysis, we identified 9 key feature DEGs (CF-DEGs): ANGPTL7, TSHZ2, SDPR, CLCA4, PAMR1, MME, CXCL2, ADAMTS5, and KIT. Additionally, ROC curve analysis demonstrated that these CF-DEGs serve as a reliable diagnostic index. Finally, using the CIBERSORT algorithm, we analyzed the infiltration of immune cells and the associations between CF-DEGs and immune cell infiltration across all samples.

CONCLUSIONS

Our findings provide new insights into the molecular functions and metabolic pathways involved in breast cancer, potentially aiding in the discovery of new diagnostic and immunotherapeutic biomarkers.

摘要

目的

本研究旨在使用多种机器学习模型预测乳腺癌患者的基因特征。

方法

在本研究中,我们首先整理并合并了数据集GSE54002和GSE22820,获得了一个包含16820个基因的基因表达矩阵(包括593个乳腺癌(BC)样本和26个正常对照(NC)样本)。随后,我们使用基因本体论(GO)、京都基因与基因组百科全书(KEGG)和疾病本体论(DO)进行了富集分析。

结果

通过差异表达分析,我们鉴定出177个差异表达基因(DEG),包括40个上调基因和137个下调基因。GO富集结果表明,这些基因主要参与细胞外基质组织、神经系统发育的正调控、含胶原细胞外基质、肝素结合、糖胺聚糖结合和Wnt蛋白结合等。KEGG富集分析显示,DEG主要与粘着斑、PI3K-Akt信号通路和人乳头瘤病毒感染等途径相关。DO富集分析表明,DEG在调节肠道疾病、肾炎和皮炎等疾病中起重要作用。此外,通过LASSO回归分析和SVM-RFE算法分析,我们确定了9个关键特征DEG(CF-DEG):ANGPTL7、TSHZ2、SDPR、CLCA4、PAMR1、MME、CXCL2、ADAMTS5和KIT。此外,ROC曲线分析表明,这些CF-DEG可作为可靠的诊断指标。最后,使用CIBERSORT算法,我们分析了所有样本中免疫细胞的浸润情况以及CF-DEG与免疫细胞浸润之间的关联。

结论

我们的研究结果为乳腺癌涉及的分子功能和代谢途径提供了新的见解,可能有助于发现新的诊断和免疫治疗生物标志物。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a69/11445210/eafaba6dedb4/12672_2024_1386_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a69/11445210/6b0bfec8bd43/12672_2024_1386_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a69/11445210/893b99b17457/12672_2024_1386_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a69/11445210/a74c5a3c003d/12672_2024_1386_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a69/11445210/9e638f4e4d1f/12672_2024_1386_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a69/11445210/9333d0f0778d/12672_2024_1386_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a69/11445210/eafaba6dedb4/12672_2024_1386_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a69/11445210/6b0bfec8bd43/12672_2024_1386_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a69/11445210/893b99b17457/12672_2024_1386_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a69/11445210/a74c5a3c003d/12672_2024_1386_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a69/11445210/9e638f4e4d1f/12672_2024_1386_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a69/11445210/9333d0f0778d/12672_2024_1386_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a69/11445210/eafaba6dedb4/12672_2024_1386_Fig6_HTML.jpg

相似文献

1
Predicting gene signature in breast cancer patients with multiple machine learning models.使用多种机器学习模型预测乳腺癌患者的基因特征。
Discov Oncol. 2024 Oct 1;15(1):516. doi: 10.1007/s12672-024-01386-2.
2
Identification and Validation of the Diagnostic Characteristic Genes of Ovarian Cancer by Bioinformatics and Machine Learning.通过生物信息学和机器学习鉴定及验证卵巢癌的诊断特征基因
Front Genet. 2022 Jun 1;13:858466. doi: 10.3389/fgene.2022.858466. eCollection 2022.
3
Identification of the diagnostic genes and immune cell infiltration characteristics of gastric cancer using bioinformatics analysis and machine learning.利用生物信息学分析和机器学习鉴定胃癌的诊断基因及免疫细胞浸润特征
Front Genet. 2023 Jan 4;13:1067524. doi: 10.3389/fgene.2022.1067524. eCollection 2022.
4
Six potential biomarkers in septic shock: a deep bioinformatics and prospective observational study.脓毒性休克的 6 个潜在生物标志物:一项深入的生物信息学和前瞻性观察研究。
Front Immunol. 2023 Jun 8;14:1184700. doi: 10.3389/fimmu.2023.1184700. eCollection 2023.
5
Identification of differentially expressed genes regulated by molecular signature in breast cancer-associated fibroblasts by bioinformatics analysis.通过生物信息学分析鉴定乳腺癌相关成纤维细胞中受分子特征调控的差异表达基因。
Arch Gynecol Obstet. 2018 Jan;297(1):161-183. doi: 10.1007/s00404-017-4562-y. Epub 2017 Oct 23.
6
Identification of immune-associated biomarkers of diabetes nephropathy tubulointerstitial injury based on machine learning: a bioinformatics multi-chip integrated analysis.基于机器学习的糖尿病肾病肾小管间质损伤免疫相关生物标志物的鉴定:一项生物信息学多芯片综合分析
BioData Min. 2024 Jul 1;17(1):20. doi: 10.1186/s13040-024-00369-x.
7
Identification of a novel immune infiltration-related gene signature, , for coronary artery disease.鉴定出一个新的与免疫浸润相关的基因特征 , 用于冠状动脉疾病。
PeerJ. 2024 Sep 25;12:e18135. doi: 10.7717/peerj.18135. eCollection 2024.
8
[Screen of key characteristic genes of nasopharyngeal carcinoma (NPC) base on machine learning and analysis of their correlation with immune cells].基于机器学习的鼻咽癌关键特征基因筛选及其与免疫细胞的相关性分析
Xi Bao Yu Fen Zi Mian Yi Xue Za Zhi. 2023;39(11):988-995.
9
Bioinformatic analysis of related immune cell infiltration and key genes in the progression of osteonecrosis of the femoral head.基于生物信息学分析股骨头坏死进展过程中的相关免疫细胞浸润和关键基因。
Front Immunol. 2024 Jan 11;14:1340446. doi: 10.3389/fimmu.2023.1340446. eCollection 2023.
10
Identification of hub biomarkers of myocardial infarction by single-cell sequencing, bioinformatics, and machine learning.通过单细胞测序、生物信息学和机器学习鉴定心肌梗死的核心生物标志物
Front Cardiovasc Med. 2022 Jul 25;9:939972. doi: 10.3389/fcvm.2022.939972. eCollection 2022.

引用本文的文献

1
Breast cancer prediction based on gene expression data using interpretable machine learning techniques.基于基因表达数据,运用可解释机器学习技术进行乳腺癌预测。
Sci Rep. 2025 Mar 4;15(1):7594. doi: 10.1038/s41598-025-85323-5.

本文引用的文献

1
SHCBP1 Promotes the Proliferation of Breast Cancer Cells by Inhibiting CXCL2.SHCBP1通过抑制CXCL2促进乳腺癌细胞增殖。
J Cancer. 2023 Oct 16;14(18):3444-3456. doi: 10.7150/jca.88072. eCollection 2023.
2
Membrane metalloendopeptidase (MME) is positively correlated with systemic lupus erythematosus and may inhibit the occurrence of breast cancer.膜金属肽酶(MME)与系统性红斑狼疮呈正相关,可能抑制乳腺癌的发生。
PLoS One. 2023 Aug 16;18(8):e0289960. doi: 10.1371/journal.pone.0289960. eCollection 2023.
3
Exploring Key Genes with Diagnostic Value for Nonalcoholic Steatohepatitis Based on Bioinformatics Analysis.
基于生物信息学分析探索非酒精性脂肪性肝炎具有诊断价值的关键基因
ACS Omega. 2023 May 30;8(23):20959-20967. doi: 10.1021/acsomega.3c01709. eCollection 2023 Jun 13.
4
Hyperparameter Optimizer with Deep Learning-Based Decision-Support Systems for Histopathological Breast Cancer Diagnosis.基于深度学习的决策支持系统的超参数优化器用于乳腺癌组织病理学诊断
Cancers (Basel). 2023 Jan 31;15(3):885. doi: 10.3390/cancers15030885.
5
Identification of Potential Biomarkers for Coronary Artery Disease Based on Cuproptosis.基于铜死亡的冠心病潜在生物标志物的鉴定。
Cardiovasc Ther. 2023 Jan 25;2023:5996144. doi: 10.1155/2023/5996144. eCollection 2023.
6
Five genes as diagnostic biomarkers of dermatomyositis and their correlation with immune cell infiltration.五种用于诊断皮肌炎的基因标志物及其与免疫细胞浸润的相关性。
Front Immunol. 2023 Jan 18;14:1053099. doi: 10.3389/fimmu.2023.1053099. eCollection 2023.
7
Identification of pathological-related and diagnostic potential circular RNAs in Stanford type A aortic dissection.斯坦福A型主动脉夹层中与病理相关及具有诊断潜力的环状RNA的鉴定
Front Cardiovasc Med. 2023 Jan 13;9:1074835. doi: 10.3389/fcvm.2022.1074835. eCollection 2022.
8
genetic alterations in breast cancer.乳腺癌中的基因改变。
J Clin Pathol. 2023 Dec 14;77(1):40-45. doi: 10.1136/jcp-2022-208611.
9
Identification of hub genes and immune cell infiltration characteristics in chronic rhinosinusitis with nasal polyps: Bioinformatics analysis and experimental validation.鼻息肉型慢性鼻-鼻窦炎中枢纽基因的鉴定及免疫细胞浸润特征:生物信息学分析与实验验证
Front Mol Biosci. 2022 Aug 17;9:843580. doi: 10.3389/fmolb.2022.843580. eCollection 2022.
10
ASPN Is a Potential Biomarker and Associated with Immune Infiltration in Endometriosis.ASPn 是子宫内膜异位症的一个潜在的生物标志物,并与免疫浸润相关。
Genes (Basel). 2022 Jul 28;13(8):1352. doi: 10.3390/genes13081352.