基于组学数据的多核学习在乳腺癌分型中的应用。

Classifying Breast Cancer Subtypes Using Multiple Kernel Learning Based on Omics Data.

机构信息

Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, China.

Computational System Biology Laboratory, Department of Biochemistry and Molecular Biology and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA.

出版信息

Genes (Basel). 2019 Mar 7;10(3):200. doi: 10.3390/genes10030200.

DOI:10.3390/genes10030200

PMID:30866472

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6471546/

Abstract

It is very significant to explore the intrinsic differences in breast cancer subtypes. These intrinsic differences are closely related to clinical diagnosis and designation of treatment plans. With the accumulation of biological and medicine datasets, there are many different omics data that can be viewed in different aspects. Combining these multiple omics data can improve the accuracy of prediction. Meanwhile; there are also many different databases available for us to download different types of omics data. In this article, we use estrogen receptor (ER), progesterone receptor (PR), human epidermal growth factor receptor 2 (HER2) to define breast cancer subtypes and classify any two breast cancer subtypes using SMO-MKL algorithm. We collected mRNA data, methylation data and copy number variation (CNV) data from TCGA to classify breast cancer subtypes. Multiple Kernel Learning (MKL) is employed to use these omics data distinctly. The result of using three omics data with multiple kernels is better than that of using single omics data with multiple kernels. Furthermore; these significant genes and pathways discovered in the feature selection process are also analyzed. In experiments; the proposed method outperforms other state-of-the-art methods and has abundant biological interpretations.

摘要

探索乳腺癌亚型的内在差异非常重要。这些内在差异与临床诊断和治疗方案的指定密切相关。随着生物和医学数据集的积累，有许多不同的组学数据可以从不同的角度进行观察。结合这些多种组学数据可以提高预测的准确性。同时，也有许多不同的数据库可供我们下载不同类型的组学数据。在本文中，我们使用雌激素受体 (ER)、孕激素受体 (PR)、人表皮生长因子受体 2 (HER2) 来定义乳腺癌亚型，并使用 SMO-MKL 算法对任意两种乳腺癌亚型进行分类。我们从 TCGA 收集了 mRNA 数据、甲基化数据和拷贝数变异 (CNV) 数据来对乳腺癌亚型进行分类。多核学习 (MKL) 用于区分这些组学数据。使用多核的三种组学数据的结果优于使用多核的单一组学数据的结果。此外，还对特征选择过程中发现的显著基因和途径进行了分析。在实验中，所提出的方法优于其他最先进的方法，并且具有丰富的生物学解释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0a5b/6471546/115560b8c096/genes-10-00200-g001.jpg

相似文献

Classifying Breast Cancer Subtypes Using Multiple Kernel Learning Based on Omics Data.基于组学数据的多核学习在乳腺癌分型中的应用。

Genes (Basel). 2019 Mar 7;10(3):200. doi: 10.3390/genes10030200.

Classifying Breast Cancer Subtypes Using Deep Neural Networks Based on Multi-Omics Data.基于多组学数据的深度学习神经网络分类乳腺癌亚型。

Genes (Basel). 2020 Aug 4;11(8):888. doi: 10.3390/genes11080888.

Integrated multi-omics profiling of high-grade estrogen receptor-positive, HER2-negative breast cancer.高级雌激素受体阳性、HER2 阴性乳腺癌的综合多组学分析。

Mol Oncol. 2022 Jun;16(12):2413-2431. doi: 10.1002/1878-0261.13043. Epub 2021 Jul 29.

Molecular Profiling of Breast Carcinoma in Almadinah, KSA: Immunophenotyping and Clinicopathological Correlation.沙特阿拉伯麦地那乳腺癌的分子剖析：免疫表型分析及临床病理相关性研究

Asian Pac J Cancer Prev. 2015;16(17):7819-24. doi: 10.7314/apjcp.2015.16.17.7819.

Defining breast cancer intrinsic subtypes by quantitative receptor expression.通过定量受体表达定义乳腺癌内在亚型。

Oncologist. 2015 May;20(5):474-82. doi: 10.1634/theoncologist.2014-0372. Epub 2015 Apr 23.

Epigenetic silencing of triple negative breast cancer hallmarks by Withaferin A.Withaferin A对三阴性乳腺癌特征的表观遗传沉默作用

Oncotarget. 2017 Jun 20;8(25):40434-40453. doi: 10.18632/oncotarget.17107.

The method for breast cancer grade prediction and pathway analysis based on improved multiple kernel learning.基于改进的多核学习的乳腺癌分级预测及通路分析方法

J Bioinform Comput Biol. 2017 Feb;15(1):1650037. doi: 10.1142/S0219720016500372. Epub 2016 Nov 29.

Dysregulation of the epigenome in triple-negative breast cancers: basal-like and claudin-low breast cancers express aberrant DNA hypermethylation.三阴性乳腺癌中表观基因组失调：基底样和 Claudin-低乳腺癌表达异常的 DNA 高甲基化。

Exp Mol Pathol. 2013 Dec;95(3):276-87. doi: 10.1016/j.yexmp.2013.09.001. Epub 2013 Sep 14.

Comprehensive profiling of biological processes reveals two major prognostic subtypes in breast cancer.生物过程的全面分析揭示了乳腺癌的两种主要预后亚型。

Tumour Biol. 2016 Mar;37(3):3365-70. doi: 10.1007/s13277-015-4173-9. Epub 2015 Oct 7.

Poor prognosis of single hormone receptor- positive breast cancer: similar outcome as triple-negative breast cancer.单激素受体阳性乳腺癌的预后较差：与三阴性乳腺癌的结局相似。

BMC Cancer. 2015 Mar 18;15:138. doi: 10.1186/s12885-015-1121-4.

引用本文的文献

KYNU is a potential metabolic-related biomarker for nasopharyngeal carcinoma by Raman spectroscopy, metabolomics, and transcriptomics analysis.通过拉曼光谱、代谢组学和转录组学分析，KYNU是鼻咽癌潜在的代谢相关生物标志物。

Discov Oncol. 2025 Aug 22;16(1):1595. doi: 10.1007/s12672-025-03349-7.

A review of the use of tumour DNA methylation for breast cancer subtyping and prediction of outcomes.肿瘤DNA甲基化在乳腺癌亚型分类及预后预测中的应用综述。

Clin Epigenetics. 2025 Jul 2;17(1):109. doi: 10.1186/s13148-025-01922-z.

Monkey king evolution (MKE)-GA-SVM model for subtype classification of breast cancer.用于乳腺癌亚型分类的猴王进化（MKE）-遗传算法-支持向量机模型

Digit Health. 2024 Dec 10;10:20552076241297002. doi: 10.1177/20552076241297002. eCollection 2024 Jan-Dec.

Classifying breast cancer subtypes on multi-omics data via sparse canonical correlation analysis and deep learning.基于稀疏典型相关分析和深度学习对多组学数据进行乳腺癌亚型分类。

BMC Bioinformatics. 2024 Mar 27;25(1):132. doi: 10.1186/s12859-024-05749-y.

Classifying breast cancer using multi-view graph neural network based on multi-omics data.基于多组学数据，使用多视图图神经网络对乳腺癌进行分类。

Front Genet. 2024 Feb 20;15:1363896. doi: 10.3389/fgene.2024.1363896. eCollection 2024.

Firefly-SVM predictive model for breast cancer subgroup classification with clinicopathological parameters.基于临床病理参数的萤火虫支持向量机乳腺癌亚组分类预测模型。

Digit Health. 2023 Oct 16;9:20552076231207203. doi: 10.1177/20552076231207203. eCollection 2023 Jan-Dec.

moBRCA-net: a breast cancer subtype classification framework based on multi-omics attention neural networks.moBRCA-net：一种基于多组学注意力神经网络的乳腺癌亚型分类框架。

BMC Bioinformatics. 2023 Apr 26;24(1):169. doi: 10.1186/s12859-023-05273-5.

Integration of multi-omics data reveals a novel hybrid breast cancer subtype and its biomarkers.多组学数据整合揭示了一种新型混合性乳腺癌亚型及其生物标志物。

Front Oncol. 2023 Mar 21;13:1130092. doi: 10.3389/fonc.2023.1130092. eCollection 2023.

Biomedical Application of Identified Biomarkers Gene Expression Based Early Diagnosis and Detection in Cervical Cancer with Modified Probabilistic Neural Network.基于改进概率神经网络的识别生物标志物基因表达在宫颈癌早期诊断和检测中的生物医学应用。

Contrast Media Mol Imaging. 2022 Sep 10;2022:4946154. doi: 10.1155/2022/4946154. eCollection 2022.

Heterogeneous data integration methods for patient similarity networks.用于患者相似网络的异质数据集成方法。

Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac207.

本文引用的文献

A Multiple Kernel Learning Model Based on -Norm.基于范数的多核学习模型。

Comput Intell Neurosci. 2018 Jan 23;2018:1018789. doi: 10.1155/2018/1018789. eCollection 2018.

Prognostic parameters of luminal A and luminal B intrinsic breast cancer subtypes of Pakistani patients.巴基斯坦患者腔面 A 型和腔面 B 型乳腺癌内在亚型的预后参数。

World J Surg Oncol. 2018 Jan 2;16(1):1. doi: 10.1186/s12957-017-1299-9.

Tumor Heterogeneity in Breast Cancer.乳腺癌中的肿瘤异质性

Front Med (Lausanne). 2017 Dec 8;4:227. doi: 10.3389/fmed.2017.00227. eCollection 2017.

A pathways-based prediction model for classifying breast cancer subtypes.一种用于乳腺癌亚型分类的基于通路的预测模型。

Oncotarget. 2017 Jun 17;8(35):58809-58822. doi: 10.18632/oncotarget.18544. eCollection 2017 Aug 29.

Characterisation of GATA3 expression in invasive breast cancer: differences in histological subtypes and immunohistochemically defined molecular subtypes.浸润性乳腺癌中GATA3表达的特征：组织学亚型和免疫组化定义的分子亚型的差异

J Clin Pathol. 2017 Nov;70(11):926-934. doi: 10.1136/jclinpath-2016-204137. Epub 2017 Apr 20.

A feature selection method based on multiple kernel learning with expression profiles of different types.一种基于多内核学习和不同类型表达谱的特征选择方法。

BioData Min. 2017 Feb 2;10:4. doi: 10.1186/s13040-017-0124-x. eCollection 2017.

The method for breast cancer grade prediction and pathway analysis based on improved multiple kernel learning.基于改进的多核学习的乳腺癌分级预测及通路分析方法

J Bioinform Comput Biol. 2017 Feb;15(1):1650037. doi: 10.1142/S0219720016500372. Epub 2016 Nov 29.

Exploring the intrinsic differences among breast tumor subtypes defined using immunohistochemistry markers based on the decision tree.基于决策树的免疫组织化学标志物定义的乳腺肿瘤亚型间内在差异的探索。

Sci Rep. 2016 Oct 27;6:35773. doi: 10.1038/srep35773.

Features of triple-negative breast cancer: Analysis of 38,813 cases from the national cancer database.三阴性乳腺癌的特征：来自国家癌症数据库的38813例病例分析。

Medicine (Baltimore). 2016 Aug;95(35):e4614. doi: 10.1097/MD.0000000000004614.

Clinicopathological characteristics of patients with HER2-positive breast cancer and the efficacy of trastuzumab in the People's Republic of China.中国HER2阳性乳腺癌患者的临床病理特征及曲妥珠单抗的疗效

Onco Targets Ther. 2016 Apr 18;9:2287-95. doi: 10.2147/OTT.S97583. eCollection 2016.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于组学数据的多核学习在乳腺癌分型中的应用。

Classifying Breast Cancer Subtypes Using Multiple Kernel Learning Based on Omics Data.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献