使用高维嵌入和残差神经网络对乳腺癌诺丁汉预后指数进行分类

Classification of Breast Cancer Nottingham Prognostic Index Using High-Dimensional Embedding and Residual Neural Network.

作者信息

Zhou Li, Rueda Maria, Alkhateeb Abedalrhman

机构信息

School of Computer Science, University of Windsor, Windsor, ON N9B 3P4, Canada.

Department of Chemistry and Biochemistry, University of Windsor, Windsor, ON N9B 3P4, Canada.

出版信息

Cancers (Basel). 2022 Feb 13;14(4):934. doi: 10.3390/cancers14040934.

DOI:10.3390/cancers14040934

PMID:35205681

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8870306/

Abstract

The Nottingham Prognostics Index (NPI) is a prognostics measure that predicts operable primary breast cancer survival. The NPI value is calculated based on the size of the tumor, the number of lymph nodes, and the tumor grade. Next-generation sequencing advancements have led to measuring different biological indicators called multi-omics data. The availability of multi-omics data triggered the challenge of integrating and analyzing these various biological measures to understand the progression of the diseases. High-dimensional embedding techniques are incorporated to present the features in the lower dimension, i.e., in a 2-dimensional map. The dataset consists of three -omics: gene expression, copy number alteration (CNA), and mRNA from 1885 female patients. The model creates a gene similarity network (GSN) map for each omic using t-distributed stochastic neighbor embedding (-SNE) before being merged into the residual neural network (ResNet) classification model. The aim of this work was to (i) extract multi-omics biomarkers that are associated with the prognosis and prediction of breast cancer survival; and (ii) build a prediction model for multi-class breast cancer NPI classes. We evaluated this model and compared it to different high-dimensional embedding techniques and neural network combinations. The proposed model outperformed the other methods with an accuracy of 98.48%, and the area under the curve (AUC) equals 0.9999. The findings in the literature confirm associations between some of the extracted omics and breast cancer prognosis and survival including , , , and from the gene expression dataset; , , and from the CNA dataset; and , , and from the mRNA dataset.

摘要

诺丁汉预后指数（NPI）是一种预测可手术原发性乳腺癌生存率的预后指标。NPI值是根据肿瘤大小、淋巴结数量和肿瘤分级计算得出的。下一代测序技术的进步使得能够测量不同的生物指标，即多组学数据。多组学数据的可用性引发了整合和分析这些不同生物指标以了解疾病进展的挑战。采用高维嵌入技术在低维度（即二维图）中呈现特征。该数据集由来自1885名女性患者的三种组学数据组成：基因表达、拷贝数变异（CNA）和mRNA。该模型在合并到残差神经网络（ResNet）分类模型之前，使用t分布随机邻域嵌入（t-SNE）为每个组学创建基因相似性网络（GSN）图。这项工作的目的是：（i）提取与乳腺癌生存预后和预测相关的多组学生物标志物；（ii）构建多类乳腺癌NPI类别的预测模型。我们评估了该模型，并将其与不同的高维嵌入技术和神经网络组合进行了比较。所提出的模型以98.48%的准确率优于其他方法，曲线下面积（AUC）等于0.9999。文献中的研究结果证实了一些提取的组学数据与乳腺癌预后和生存之间的关联，包括基因表达数据集中的、、、和；CNA数据集中的、、和；以及mRNA数据集中的、、和。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/77ce/8870306/85aa0337b219/cancers-14-00934-g001.jpg

相似文献

Classification of Breast Cancer Nottingham Prognostic Index Using High-Dimensional Embedding and Residual Neural Network.使用高维嵌入和残差神经网络对乳腺癌诺丁汉预后指数进行分类

Cancers (Basel). 2022 Feb 13;14(4):934. doi: 10.3390/cancers14040934.

Multi-omics Data Integration Model Based on UMAP Embedding and Convolutional Neural Network.基于UMAP嵌入和卷积神经网络的多组学数据整合模型

Cancer Inform. 2022 Sep 28;21:11769351221124205. doi: 10.1177/11769351221124205. eCollection 2022.

Predicting censored survival data based on the interactions between meta-dimensional omics data in breast cancer.基于乳腺癌元维度组学数据间的相互作用预测删失生存数据。

J Biomed Inform. 2015 Aug;56:220-8. doi: 10.1016/j.jbi.2015.05.019. Epub 2015 Jun 3.

MSFN: a multi-omics stacked fusion network for breast cancer survival prediction.MSFN：一种用于乳腺癌生存预测的多组学堆叠融合网络。

Front Genet. 2024 Aug 2;15:1378809. doi: 10.3389/fgene.2024.1378809. eCollection 2024.

PaCMAP-embedded convolutional neural network for multi-omics data integration.用于多组学数据整合的嵌入PaCMAP的卷积神经网络。

Heliyon. 2023 Dec 5;10(1):e23195. doi: 10.1016/j.heliyon.2023.e23195. eCollection 2024 Jan 15.

NESM: a network embedding method for tumor stratification by integrating multi-omics data.NESM：一种通过整合多组学数据进行肿瘤分层的网络嵌入方法。

G3 (Bethesda). 2022 Nov 4;12(11). doi: 10.1093/g3journal/jkac243.

A novel prognostic model based on multi-omics features predicts the prognosis of colon cancer patients.一种基于多组学特征的新型预后模型预测结肠癌患者的预后。

Mol Genet Genomic Med. 2020 Jul;8(7):e1255. doi: 10.1002/mgg3.1255. Epub 2020 May 12.

A Translational Pipeline for Overall Survival Prediction of Breast Cancer Patients by Decision-Level Integration of Multi-Omics Data.一种通过多组学数据的决策级整合对乳腺癌患者总生存期进行预测的转化流程。

Proceedings (IEEE Int Conf Bioinformatics Biomed). 2019 Nov;2019:1573-1580. doi: 10.1109/bibm47256.2019.8983243. Epub 2020 Feb 6.

Deep learning based feature-level integration of multi-omics data for breast cancer patients survival analysis.基于深度学习的多组学生物标志物数据特征层融合在乳腺癌患者生存分析中的应用。

BMC Med Inform Decis Mak. 2020 Sep 15;20(1):225. doi: 10.1186/s12911-020-01225-8.

Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益

Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

引用本文的文献

Gelsolin (GSN) as a key regulator in estrogen receptor-positive breast cancer: implications for prognosis, chemotherapy sensitivity, and immune infiltration.凝溶胶蛋白（GSN）作为雌激素受体阳性乳腺癌的关键调节因子：对预后、化疗敏感性和免疫浸润的影响

Discov Oncol. 2025 Jul 9;16(1):1294. doi: 10.1007/s12672-025-03128-4.

A Case of Male Breast Cancer in Rural America: Treatment Delays and Healthcare Access Challenges.美国农村地区一例男性乳腺癌病例：治疗延误与医疗保健获取挑战

Case Rep Oncol. 2025 May 30;18(1):885-891. doi: 10.1159/000546042. eCollection 2025 Jan-Dec.

Multimodal data fusion AI model uncovers tumor microenvironment immunotyping heterogeneity and enhanced risk stratification of breast cancer.多模态数据融合人工智能模型揭示了肿瘤微环境免疫分型的异质性并增强了乳腺癌的风险分层。

MedComm (2020). 2024 Dec 11;5(12):e70023. doi: 10.1002/mco2.70023. eCollection 2024 Dec.

Serum Direct Bilirubin as a Biomarker for Breast Cancer.血清直接胆红素作为乳腺癌的生物标志物

Breast Cancer (Dove Med Press). 2024 Nov 7;16:735-743. doi: 10.2147/BCTT.S491523. eCollection 2024.

Exploring the Correlation Between Hypoxia, Variants, and Breast Cancer in Different Ethnicities, and Bangladeshi Women: Through ELISA and Integrative Multi-Omics Analysis.通过酶联免疫吸附测定（ELISA）和综合多组学分析探索不同种族及孟加拉国女性中缺氧、基因变异与乳腺癌之间的相关性。

Biomark Insights. 2024 Sep 18;19:11772719241278176. doi: 10.1177/11772719241278176. eCollection 2024.

Pyroptosis-associated genes and tumor immune response in endometrial cancer.子宫内膜癌中焦亡相关基因与肿瘤免疫反应

Discov Oncol. 2024 Sep 12;15(1):433. doi: 10.1007/s12672-024-01315-3.

A framework for block-wise missing data in multi-omics.多组学中基于块的缺失数据框架。

PLoS One. 2024 Jul 23;19(7):e0307482. doi: 10.1371/journal.pone.0307482. eCollection 2024.

Quantitative prediction of postpartum hemorrhage in cesarean section on machine learning.机器学习在剖宫产术中产后出血的定量预测。

BMC Med Inform Decis Mak. 2024 Jun 13;24(1):166. doi: 10.1186/s12911-024-02571-7.

Pharmgenomics Pers Med. 2024 May 23;17:251-270. doi: 10.2147/PGPM.S450960. eCollection 2024.

A hybrid deep learning scheme for MRI-based preliminary multiclassification diagnosis of primary brain tumors.一种基于磁共振成像的原发性脑肿瘤初步多分类诊断的混合深度学习方案。

Front Oncol. 2024 Apr 30;14:1363756. doi: 10.3389/fonc.2024.1363756. eCollection 2024.

本文引用的文献

TRK Fusion Cancer: Patient Characteristics and Survival Analysis in the Real-World Setting.TRK 融合癌：真实世界环境中的患者特征和生存分析。

Target Oncol. 2021 May;16(3):389-399. doi: 10.1007/s11523-021-00815-4. Epub 2021 Apr 24.

Decreased expression of the translation factor eIF3e induces senescence in breast cancer cells via suppression of PARP1 and activation of mTORC1.翻译因子eIF3e表达降低通过抑制PARP1和激活mTORC1诱导乳腺癌细胞衰老。

Oncotarget. 2021 Mar 30;12(7):649-664. doi: 10.18632/oncotarget.27923.

MAL2 drives immune evasion in breast cancer by suppressing tumor antigen presentation.MAL2 通过抑制肿瘤抗原呈递来驱动乳腺癌中的免疫逃逸。

J Clin Invest. 2021 Jan 4;131(1). doi: 10.1172/JCI140837.

HIF1α and p53 Regulated MED30, a Mediator Complex Subunit, is Involved in Regulation of Glioblastoma Pathogenesis and Temozolomide Resistance.低氧诱导因子1α（HIF1α）和p53调控的中介体复合物亚基MED30参与胶质母细胞瘤发病机制及替莫唑胺耐药性的调控

Cell Mol Neurobiol. 2021 Oct;41(7):1521-1535. doi: 10.1007/s10571-020-00920-4. Epub 2020 Jul 23.

Comparison of Nottingham Prognostic Index, PREDICT and PrognosTILs in Triple Negative Breast Cancer -a Retrospective Cohort Study.三阴性乳腺癌中诺丁汉预后指数、PREDICT 和 PrognosTILs 的比较-一项回顾性队列研究。

Pathol Oncol Res. 2020 Oct;26(4):2443-2450. doi: 10.1007/s12253-020-00846-8. Epub 2020 Jun 20.

Machine Learning techniques in breast cancer prognosis prediction: A primary evaluation.乳腺癌预后预测中的机器学习技术：初步评估。

Cancer Med. 2020 May;9(9):3234-3243. doi: 10.1002/cam4.2811. Epub 2020 Mar 10.

Prognostic models for breast cancer: a systematic review.乳腺癌预后模型：系统评价。

BMC Cancer. 2019 Mar 14;19(1):230. doi: 10.1186/s12885-019-5442-6.

Breast Cancer Prognosis Using a Machine Learning Approach.基于机器学习方法的乳腺癌预后分析

Cancers (Basel). 2019 Mar 7;11(3):328. doi: 10.3390/cancers11030328.

Proteome profiling of triple negative breast cancer cells overexpressing NOD1 and NOD2 receptors unveils molecular signatures of malignant cell proliferation.NOD1 和 NOD2 受体过表达的三阴性乳腺癌细胞的蛋白质组谱分析揭示了恶性细胞增殖的分子特征。

BMC Genomics. 2019 Feb 21;20(1):152. doi: 10.1186/s12864-019-5523-6.

Correlation of UGT2B7 Polymorphism with Cardiotoxicity in Breast Cancer Patients Undergoing Epirubicin/Cyclophosphamide-Docetaxel Adjuvant Chemotherapy.接受表柔比星/环磷酰胺-多西他赛辅助化疗的乳腺癌患者中UGT2B7基因多态性与心脏毒性的相关性

Yonsei Med J. 2019 Jan;60(1):30-37. doi: 10.3349/ymj.2019.60.1.30.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用高维嵌入和残差神经网络对乳腺癌诺丁汉预后指数进行分类

Classification of Breast Cancer Nottingham Prognostic Index Using High-Dimensional Embedding and Residual Neural Network.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献