一种用于乳腺癌分子亚型分类的整合深度学习框架。

An integrative deep learning framework for classifying molecular subtypes of breast cancer.

作者信息

Mohaiminul Islam Md, Huang Shujun, Ajwad Rasif, Chi Chen, Wang Yang, Hu Pingzhao

机构信息

Department of Biochemistry and Medical Genetics, University of Manitoba, Winnipeg, Manitoba R3E 0W3, Canada.

Department of Computer Science, University of Manitoba, Winnipeg, Manitoba R3E 0W3, Canada.

出版信息

Comput Struct Biotechnol J. 2020 Aug 11;18:2185-2199. doi: 10.1016/j.csbj.2020.08.005. eCollection 2020.

DOI:10.1016/j.csbj.2020.08.005

PMID:32952934

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7473884/

Abstract

Classification of breast cancer subtypes using multi-omics profiles is a difficult problem since the data sets are high-dimensional and highly correlated. Deep neural network (DNN) learning has demonstrated advantages over traditional methods as it does not require any hand-crafted features, but rather automatically extract features from raw data and efficiently analyze high-dimensional and correlated data. We aim to develop an integrative deep learning framework for classifying molecular subtypes of breast cancer. We collect copy number alteration and gene expression data measured on the same breast cancer patients from the Molecular Taxonomy of Breast Cancer International Consortium. We propose a deep learning model to integrate the omics datasets for predicting their molecular subtypes. The performance of our proposed DNN model is compared with some baseline models. Furthermore, we evaluate the misclassification of the subtypes using the learned deep features and explore their usefulness for clustering the breast cancer patients. We demonstrate that our proposed integrative deep learning model is superior to other deep learning and non-deep learning based models. Particularly, we get the best prediction result among the deep learning-based integration models when we integrate the two data sources using the concatenation layer in the models without sharing the weights. Using the learned deep features, we identify 6 breast cancer subgroups and show that Her2-enriched samples can be classified into more than one tumor subtype. Overall, the integrated model show better performance than those trained on individual data sources.

摘要

利用多组学图谱对乳腺癌亚型进行分类是一个难题，因为数据集具有高维度且高度相关。深度神经网络（DNN）学习已证明优于传统方法，因为它不需要任何手工制作的特征，而是能从原始数据中自动提取特征，并有效分析高维和相关数据。我们旨在开发一个用于对乳腺癌分子亚型进行分类的集成深度学习框架。我们从国际乳腺癌分子分类联盟收集了在同一乳腺癌患者身上测量的拷贝数改变和基因表达数据。我们提出了一个深度学习模型来整合组学数据集以预测其分子亚型。将我们提出的DNN模型的性能与一些基线模型进行比较。此外，我们使用学习到的深度特征评估亚型的错误分类，并探索它们在对乳腺癌患者进行聚类方面的有用性。我们证明，我们提出的集成深度学习模型优于其他基于深度学习和非深度学习的模型。特别是，当我们在不共享权重的模型中使用拼接层整合两个数据源时，我们在基于深度学习的集成模型中获得了最佳预测结果。利用学习到的深度特征，我们识别出6个乳腺癌亚组，并表明富含Her2的样本可以被分类到不止一种肿瘤亚型中。总体而言，集成模型比在单个数据源上训练的模型表现更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c6bd/7473884/eb2ed6047cf3/gr8.jpg

相似文献

An integrative deep learning framework for classifying molecular subtypes of breast cancer.

Comput Struct Biotechnol J. 2020 Aug 11;18:2185-2199. doi: 10.1016/j.csbj.2020.08.005. eCollection 2020.

A multimodal graph neural network framework for cancer molecular subtype classification.

BMC Bioinformatics. 2024 Jan 15;25(1):27. doi: 10.1186/s12859-023-05622-4.

Deep learning based feature-level integration of multi-omics data for breast cancer patients survival analysis.

BMC Med Inform Decis Mak. 2020 Sep 15;20(1):225. doi: 10.1186/s12911-020-01225-8.

Enhancing the prediction of IDC breast cancer staging from gene expression profiles using hybrid feature selection methods and deep learning architecture.

Med Biol Eng Comput. 2023 Nov;61(11):2895-2919. doi: 10.1007/s11517-023-02892-1. Epub 2023 Aug 2.

Classifying breast cancer subtypes on multi-omics data via sparse canonical correlation analysis and deep learning.

BMC Bioinformatics. 2024 Mar 27;25(1):132. doi: 10.1186/s12859-024-05749-y.

SADLN: Self-attention based deep learning network of integrating multi-omics data for cancer subtype recognition.

Front Genet. 2023 Jan 4;13:1032768. doi: 10.3389/fgene.2022.1032768. eCollection 2022.

A hierarchical integration deep flexible neural forest framework for cancer subtype classification by integrating multi-omics data.

BMC Bioinformatics. 2019 Oct 28;20(1):527. doi: 10.1186/s12859-019-3116-7.

Predicting drug-target interaction network using deep learning model.

Comput Biol Chem. 2019 Jun;80:90-101. doi: 10.1016/j.compbiolchem.2019.03.016. Epub 2019 Mar 25.

Classifying breast cancer using multi-view graph neural network based on multi-omics data.

Front Genet. 2024 Feb 20;15:1363896. doi: 10.3389/fgene.2024.1363896. eCollection 2024.

MOGAT: A Multi-Omics Integration Framework Using Graph Attention Networks for Cancer Subtype Prediction.

Int J Mol Sci. 2024 Feb 28;25(5):2788. doi: 10.3390/ijms25052788.

引用本文的文献

An autoencoder learning method for predicting breast cancer subtypes.

PLoS One. 2025 Jul 23;20(7):e0327773. doi: 10.1371/journal.pone.0327773. eCollection 2025.

Identification of a 10-species microbial signature of inflammatory bowel disease by machine learning and external validation.

Cell Regen. 2025 Jul 14;14(1):32. doi: 10.1186/s13619-025-00246-w.

Open challenges and opportunities in federated foundation models towards biomedical healthcare.

BioData Min. 2025 Jan 4;18(1):2. doi: 10.1186/s13040-024-00414-9.

Comparison of data fusion strategies for automated prostate lesion detection using mpMRI correlated with whole mount histology.

Radiat Oncol. 2024 Jul 29;19(1):96. doi: 10.1186/s13014-024-02471-0.

Pathformer: a biological pathway informed transformer for disease diagnosis and prognosis using multi-omics data.

Bioinformatics. 2024 May 2;40(5). doi: 10.1093/bioinformatics/btae316.

A Comprehensive Review on Synergy of Multi-Modal Data and AI Technologies in Medical Diagnosis.

Bioengineering (Basel). 2024 Feb 25;11(3):219. doi: 10.3390/bioengineering11030219.

HyperTMO: a trusted multi-omics integration framework based on hypergraph convolutional network for patient classification.

Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae159.

[Identification of breast cancer subtypes based on graph convolutional network].

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2024 Feb 25;41(1):121-128. doi: 10.7507/1001-5515.202306071.

A review on trends in development and translation of omics signatures in cancer.

Comput Struct Biotechnol J. 2024 Feb 3;23:954-971. doi: 10.1016/j.csbj.2024.01.024. eCollection 2024 Dec.

Historical perspective and future directions: computational science in immuno-oncology.

J Immunother Cancer. 2024 Jan 8;12(1):e008306. doi: 10.1136/jitc-2023-008306.

本文引用的文献

Heterogeneous Domain Adaptation for IHC Classification of Breast Cancer Subtypes.

IEEE/ACM Trans Comput Biol Bioinform. 2020 Jan-Feb;17(1):347-353. doi: 10.1109/TCBB.2018.2877755. Epub 2018 Oct 24.

Breast Cancer Molecular Stratification: From Intrinsic Subtypes to Integrative Clusters.

Am J Pathol. 2017 Oct;187(10):2152-2162. doi: 10.1016/j.ajpath.2017.04.022. Epub 2017 Jul 19.

DeepGene: an advanced cancer type classifier based on deep learning and somatic point mutations.

BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):476. doi: 10.1186/s12859-016-1334-9.

A DEEP LEARNING APPROACH FOR CANCER DETECTION AND RELEVANT GENE IDENTIFICATION.

Pac Symp Biocomput. 2017;22:219-229. doi: 10.1142/9789813207813_0022.

Iteratively refining breast cancer intrinsic subtypes in the METABRIC dataset.

BioData Min. 2016 Jan 13;9:2. doi: 10.1186/s13040-015-0078-9. eCollection 2016.

Integrative Data Analysis of Multi-Platform Cancer Data with a Multimodal Deep Learning Approach.

IEEE/ACM Trans Comput Biol Bioinform. 2015 Jul-Aug;12(4):928-37. doi: 10.1109/TCBB.2014.2377729.

Predicting effects of noncoding variants with deep learning-based sequence model.

Nat Methods. 2015 Oct;12(10):931-4. doi: 10.1038/nmeth.3547. Epub 2015 Aug 24.

Quality control of transcription start site selection by nonsense-mediated-mRNA decay.

Elife. 2015 Apr 23;4:e06722. doi: 10.7554/eLife.06722.

Deep learning of the tissue-regulated splicing code.

Bioinformatics. 2014 Jun 15;30(12):i121-9. doi: 10.1093/bioinformatics/btu277.

Drug repositioning by kernel-based integration of molecular structure, molecular activity, and phenotype data.

PLoS One. 2013 Nov 11;8(11):e78518. doi: 10.1371/journal.pone.0078518. eCollection 2013.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于乳腺癌分子亚型分类的整合深度学习框架。

An integrative deep learning framework for classifying molecular subtypes of breast cancer.

作者信息

Mohaiminul Islam Md, Huang Shujun, Ajwad Rasif, Chi Chen, Wang Yang, Hu Pingzhao

机构信息

Department of Biochemistry and Medical Genetics, University of Manitoba, Winnipeg, Manitoba R3E 0W3, Canada.

Department of Computer Science, University of Manitoba, Winnipeg, Manitoba R3E 0W3, Canada.

出版信息

Comput Struct Biotechnol J. 2020 Aug 11;18:2185-2199. doi: 10.1016/j.csbj.2020.08.005. eCollection 2020.

DOI:10.1016/j.csbj.2020.08.005

PMID:32952934

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7473884/

Abstract

摘要

一种用于乳腺癌分子亚型分类的整合深度学习框架。

An integrative deep learning framework for classifying molecular subtypes of breast cancer.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

一种用于乳腺癌分子亚型分类的整合深度学习框架。

An integrative deep learning framework for classifying molecular subtypes of breast cancer.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献