• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

鉴定与乳腺癌患者发病机制相关的候选生物标志物。

Identification of candidate biomarkers correlated with the pathogenesis of breast cancer patients.

作者信息

Yu Shiqun, Wang Chengman, Ouyang Jin, Luo Ting, Zeng Fanfan, Zhang Yu, Gao Liyun, Huang Shaoxin, Wang Xin

机构信息

Yunfu Center for Disease Control and Prevention, Yunfu, China.

Jiangxi Provincial Key Laboratory of Cell Precision Therapy, School of Basic Medical Sciences , Jiujiang University, Jiujiang, 332005, Jiangxi, China.

出版信息

Sci Rep. 2025 Mar 13;15(1):8770. doi: 10.1038/s41598-025-93208-w.

DOI:10.1038/s41598-025-93208-w
PMID:40082607
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11906855/
Abstract

Breast cancer (BC) is the second leading cause of cancer-related death in females, followed by lung cancer. Disadvantages exist in conventional diagnostic techniques of BC, such as radiation risk. The present study integrated bioinformatics analysis with machine learning to elucidate potential key candidate genes associated with the tumorigenesis of BC. Eleven datasets were downloaded from the Gene Expression Omnibus (GEO) database and were consolidated into two independent cohorts (training cohort and validation cohort) after batch-effect removal. We employed "limma" package to screen differentially expressed genes (DEGs) between BC and adjacent normal breast samples. Subsequently, the most reliable diagnostic indicators were identified utilizing LASSO-Logistic regression, SVM-RFE and multivariate stepwise Logistic regression analysis. Logistic model and nomogram were created based on these hub genes and applied in external validation cohort to verify the robustness of the model. As a result, a total of six hub genes connected with BC pathogenesis were identified, including CD300LG, IGSF10, FAM83D, MAMDC2, COMP and SEMA3G. Then, a diagnostic model of BC on the basis of these genes was established. ROC analysis of the diagnostic model illustrated that AUC of the training cohort was 0.978 (0.962, 0.995). In the validation cohort, AUC of training set and validation set were 0.936 (0.910, 0.961) and 0.921 (0.870, 0.972), respectively. This indicated that the model was reliable in separating BC patients from healthy individuals. The model may assist in early diagnosis of BC with implications for improving the prognosis of BC patients.

摘要

乳腺癌(BC)是女性癌症相关死亡的第二大主要原因,仅次于肺癌。BC的传统诊断技术存在缺点,如辐射风险。本研究将生物信息学分析与机器学习相结合,以阐明与BC肿瘤发生相关的潜在关键候选基因。从基因表达综合数据库(GEO)下载了11个数据集,并在去除批次效应后整合为两个独立队列(训练队列和验证队列)。我们使用“limma”软件包筛选BC与相邻正常乳腺样本之间的差异表达基因(DEG)。随后,利用LASSO逻辑回归、支持向量机递归特征消除(SVM-RFE)和多变量逐步逻辑回归分析确定最可靠的诊断指标。基于这些核心基因创建了逻辑模型和列线图,并应用于外部验证队列以验证模型的稳健性。结果,共鉴定出6个与BC发病机制相关的核心基因,包括CD300LG、IGSF10、FAM83D、MAMDC2、COMP和SEMA3G。然后,基于这些基因建立了BC的诊断模型。诊断模型的受试者工作特征(ROC)分析表明,训练队列的曲线下面积(AUC)为0.978(0.962,0.995)。在验证队列中,训练集和验证集的AUC分别为0.936(0.910,0.961)和0.921(0.870,0.972)。这表明该模型在区分BC患者和健康个体方面是可靠的。该模型可能有助于BC的早期诊断,对改善BC患者的预后具有重要意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b026/11906855/5509bf338f21/41598_2025_93208_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b026/11906855/696d7a7b2cf7/41598_2025_93208_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b026/11906855/a00839826ce3/41598_2025_93208_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b026/11906855/a00b4b63676a/41598_2025_93208_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b026/11906855/dfe355591915/41598_2025_93208_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b026/11906855/b9601288e2af/41598_2025_93208_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b026/11906855/5509bf338f21/41598_2025_93208_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b026/11906855/696d7a7b2cf7/41598_2025_93208_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b026/11906855/a00839826ce3/41598_2025_93208_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b026/11906855/a00b4b63676a/41598_2025_93208_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b026/11906855/dfe355591915/41598_2025_93208_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b026/11906855/b9601288e2af/41598_2025_93208_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b026/11906855/5509bf338f21/41598_2025_93208_Fig6_HTML.jpg

相似文献

1
Identification of candidate biomarkers correlated with the pathogenesis of breast cancer patients.鉴定与乳腺癌患者发病机制相关的候选生物标志物。
Sci Rep. 2025 Mar 13;15(1):8770. doi: 10.1038/s41598-025-93208-w.
2
Construction of a novel mRNA-signature prediction model for prognosis of bladder cancer based on a statistical analysis.基于统计分析构建新型膀胱癌 mRNA 特征预测预后模型。
BMC Cancer. 2021 Jul 27;21(1):858. doi: 10.1186/s12885-021-08611-z.
3
Identification of candidate biomarkers correlated with poor prognosis of breast cancer based on bioinformatics analysis.基于生物信息学分析鉴定与乳腺癌不良预后相关的候选生物标志物。
Bioengineered. 2021 Dec;12(1):5149-5161. doi: 10.1080/21655979.2021.1960775.
4
Identification and Verification of Key Tumor Genes Associated with Diagnosis and Prognosis of Breast Cancer Based on Bioinformatics Analysis.基于生物信息学分析的乳腺癌诊断和预后相关关键肿瘤基因的鉴定和验证。
Dis Markers. 2022 May 31;2022:9041466. doi: 10.1155/2022/9041466. eCollection 2022.
5
Estrogen receptor 1 and progesterone receptor are distinct biomarkers and prognostic factors in estrogen receptor-positive breast cancer: Evidence from a bioinformatic analysis.雌激素受体 1 和孕激素受体是雌激素受体阳性乳腺癌的独特生物标志物和预后因素:来自生物信息学分析的证据。
Biomed Pharmacother. 2020 Jan;121:109647. doi: 10.1016/j.biopha.2019.109647. Epub 2019 Nov 13.
6
Insights into a Machine Learning-Based Palmitoylation-Related Gene Model for Predicting the Prognosis and Treatment Response of Breast Cancer Patients.基于机器学习的棕榈酰化相关基因模型预测乳腺癌患者预后和治疗反应的研究进展。
Technol Cancer Res Treat. 2024 Jan-Dec;23:15330338241263434. doi: 10.1177/15330338241263434.
7
Identification of key biomarkers in breast cancer based on bioinformatics analysis and experimental verification.基于生物信息学分析和实验验证的乳腺癌关键生物标志物鉴定
J Egypt Natl Canc Inst. 2025 Feb 24;37(1):5. doi: 10.1186/s43046-025-00260-8.
8
Integrated bioinformatics analyses identifying potential biomarkers for type 2 diabetes mellitus and breast cancer: In SIK1-ness and health.综合生物信息学分析鉴定 2 型糖尿病和乳腺癌的潜在生物标志物:SIK1 与健康。
PLoS One. 2023 Aug 9;18(8):e0289839. doi: 10.1371/journal.pone.0289839. eCollection 2023.
9
Integrated bioinformatics analysis reveals key candidate genes and pathways in breast cancer.整合生物信息学分析揭示乳腺癌的关键候选基因和途径。
Mol Med Rep. 2018 Jun;17(6):8091-8100. doi: 10.3892/mmr.2018.8895. Epub 2018 Apr 19.
10
Mitochondrial-related genes as prognostic and metastatic markers in breast cancer: insights from comprehensive analysis and clinical models.线粒体相关基因作为乳腺癌的预后和转移标志物:综合分析和临床模型的见解。
Front Immunol. 2024 Sep 24;15:1461489. doi: 10.3389/fimmu.2024.1461489. eCollection 2024.

本文引用的文献

1
KEGG for taxonomy-based analysis of pathways and genomes.KEGG 用于基于分类的途径和基因组分析。
Nucleic Acids Res. 2023 Jan 6;51(D1):D587-D592. doi: 10.1093/nar/gkac963.
2
The Arg82Cys Polymorphism of the Protein Nepmucin Implies a Role in HDL Metabolism.蛋白质Nepmucin的Arg82Cys多态性在高密度脂蛋白代谢中发挥作用。
J Endocr Soc. 2022 Mar 4;6(5):bvac034. doi: 10.1210/jendso/bvac034. eCollection 2022 May 1.
3
Transcriptomic Biomarker Signatures for Discrimination of Oral Cancer Surgical Margins.转录组生物标志物特征可用于区分口腔癌手术切缘。
Biomolecules. 2022 Mar 17;12(3):464. doi: 10.3390/biom12030464.
4
Cancer statistics, 2022.癌症统计数据,2022 年。
CA Cancer J Clin. 2022 Jan;72(1):7-33. doi: 10.3322/caac.21708. Epub 2022 Jan 12.
5
Notch1 promotes ordered revascularization through Semaphorin 3g modulation of downstream vascular patterning signalling factors.Notch1 通过调节下游血管形态发生信号因子来促进有序的血管生成。
J Physiol. 2022 Feb;600(3):509-530. doi: 10.1113/JP282286. Epub 2022 Jan 17.
6
High Levels of Expression of Cartilage Oligomeric Matrix Protein in Lymph Node Metastases in Breast Cancer Are Associated with Reduced Survival.乳腺癌淋巴结转移中软骨寡聚基质蛋白的高表达与生存率降低相关。
Cancers (Basel). 2021 Nov 23;13(23):5876. doi: 10.3390/cancers13235876.
7
Molecular subtyping of breast cancer intrinsic taxonomy with oligonucleotide microarray and NanoString nCounter.寡核苷酸微阵列和 NanoString nCounter 对乳腺癌内在分类的分子亚型分类
Biosci Rep. 2021 Aug 27;41(8). doi: 10.1042/BSR20211428.
8
G2M checkpoint pathway alone is associated with drug response and survival among cell proliferation-related pathways in pancreatic cancer.在胰腺癌中,仅G2M检查点通路就与细胞增殖相关通路中的药物反应和生存率相关。
Am J Cancer Res. 2021 Jun 15;11(6):3070-3084. eCollection 2021.
9
Breast Cancer Case Identification Based on Deep Learning and Bioinformatics Analysis.基于深度学习和生物信息学分析的乳腺癌病例识别
Front Genet. 2021 May 17;12:628136. doi: 10.3389/fgene.2021.628136. eCollection 2021.
10
Comprehensive analysis based on DNA methylation and RNA-seq reveals hypermethylation of the up-regulated WT1 gene with potential mechanisms in PAM50 subtypes of breast cancer.基于DNA甲基化和RNA测序的综合分析揭示了WT1基因在乳腺癌PAM50亚型中的上调及其潜在机制。
PeerJ. 2021 May 4;9:e11377. doi: 10.7717/peerj.11377. eCollection 2021.