基于稳健图神经网络和多组学数据整合的癌症分子分型

Molecular Subtyping of Cancer Based on Robust Graph Neural Network and Multi-Omics Data Integration.

作者信息

Yin Chaoyi, Cao Yangkun, Sun Peishuo, Zhang Hengyuan, Li Zhi, Xu Ying, Sun Huiyan

机构信息

School of Artificial Intelligence, Jilin University, Changchun, China.

Department of Medical Oncology, the First Hospital of China Medical University, Shenyang, China.

出版信息

Front Genet. 2022 May 13;13:884028. doi: 10.3389/fgene.2022.884028. eCollection 2022.

DOI:10.3389/fgene.2022.884028

PMID:35646077

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9137453/

Abstract

Accurate molecular subtypes prediction of cancer patients is significant for personalized cancer diagnosis and treatments. Large amount of multi-omics data and the advancement of data-driven methods are expected to facilitate molecular subtyping of cancer. Most existing machine learning-based methods usually classify samples according to single omics data, fail to integrate multi-omics data to learn comprehensive representations of the samples, and ignore that information transfer and aggregation among samples can better represent them and ultimately help in classification. We propose a novel framework named multi-omics graph convolutional network (M-GCN) for molecular subtyping based on robust graph convolutional networks integrating multi-omics data. We first apply the Hilbert-Schmidt independence criterion least absolute shrinkage and selection operator (HSIC Lasso) to select the molecular subtype-related transcriptomic features and then construct a sample-sample similarity graph with low noise by using these features. Next, we take the selected gene expression, single nucleotide variants (SNV), and copy number variation (CNV) data as input and learn the multi-view representations of samples. On this basis, a robust variant of graph convolutional network (GCN) model is finally developed to obtain samples' new representations by aggregating their subgraphs. Experimental results of breast and stomach cancer demonstrate that the classification performance of M-GCN is superior to other existing methods. Moreover, the identified subtype-specific biomarkers are highly consistent with current clinical understanding and promising to assist accurate diagnosis and targeted drug development.

摘要

准确预测癌症患者的分子亚型对于个性化癌症诊断和治疗具有重要意义。大量的多组学数据以及数据驱动方法的进步有望推动癌症的分子分型。大多数现有的基于机器学习的方法通常根据单一组学数据对样本进行分类，无法整合多组学数据来学习样本的综合表示，并且忽略了样本之间的信息传递和聚合能够更好地表示它们并最终有助于分类。我们提出了一种名为多组学图卷积网络（M-GCN）的新颖框架，用于基于整合多组学数据的强大图卷积网络进行分子分型。我们首先应用希尔伯特-施密特独立性准则最小绝对收缩和选择算子（HSIC Lasso）来选择与分子亚型相关的转录组特征，然后利用这些特征构建一个低噪声的样本-样本相似性图。接下来，我们将所选的基因表达、单核苷酸变异（SNV）和拷贝数变异（CNV）数据作为输入，学习样本的多视图表示。在此基础上，最终开发了一种图卷积网络（GCN）模型的强大变体，通过聚合样本的子图来获得样本的新表示。乳腺癌和胃癌的实验结果表明，M-GCN的分类性能优于其他现有方法。此外，所确定的亚型特异性生物标志物与当前临床认识高度一致，有望辅助准确诊断和靶向药物开发。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd13/9137453/3bf5ce802599/fgene-13-884028-g001.jpg

相似文献

Molecular Subtyping of Cancer Based on Robust Graph Neural Network and Multi-Omics Data Integration.基于稳健图神经网络和多组学数据整合的癌症分子分型

Front Genet. 2022 May 13;13:884028. doi: 10.3389/fgene.2022.884028. eCollection 2022.

Classifying breast cancer using multi-view graph neural network based on multi-omics data.基于多组学数据，使用多视图图神经网络对乳腺癌进行分类。

Front Genet. 2024 Feb 20;15:1363896. doi: 10.3389/fgene.2024.1363896. eCollection 2024.

MOGAT: A Multi-Omics Integration Framework Using Graph Attention Networks for Cancer Subtype Prediction.MOGAT：一种使用图注意力网络进行癌症亚型预测的多组学整合框架。

Int J Mol Sci. 2024 Feb 28;25(5):2788. doi: 10.3390/ijms25052788.

A multimodal graph neural network framework for cancer molecular subtype classification.一种用于癌症分子亚型分类的多模态图神经网络框架。

BMC Bioinformatics. 2024 Jan 15;25(1):27. doi: 10.1186/s12859-023-05622-4.

A semi-supervised approach for the integration of multi-omics data based on transformer multi-head self-attention mechanism and graph convolutional networks.基于 Transformer 多头自注意力机制和图卷积网络的多组学数据集成的半监督方法。

BMC Genomics. 2024 Jan 22;25(1):86. doi: 10.1186/s12864-024-09985-7.

MoGCN: A Multi-Omics Integration Method Based on Graph Convolutional Network for Cancer Subtype Analysis.MoGCN：一种基于图卷积网络的多组学整合方法用于癌症亚型分析。

Front Genet. 2022 Feb 2;13:806842. doi: 10.3389/fgene.2022.806842. eCollection 2022.

Integration of multi-omics data using adaptive graph learning and attention mechanism for patient classification and biomarker identification.利用自适应图学习和注意力机制整合多组学数据，用于患者分类和生物标志物识别。

Comput Biol Med. 2023 Sep;164:107303. doi: 10.1016/j.compbiomed.2023.107303. Epub 2023 Aug 2.

Supervised graph contrastive learning for cancer subtype identification through multi-omics data integration.通过多组学数据整合进行癌症亚型识别的监督式图对比学习

Health Inf Sci Syst. 2024 Feb 23;12(1):12. doi: 10.1007/s13755-024-00274-x. eCollection 2024 Dec.

Capturing the latent space of an Autoencoder for multi-omics integration and cancer subtyping.捕获自动编码器的潜在空间，用于多组学整合和癌症亚型分类。

Comput Biol Med. 2022 Sep;148:105832. doi: 10.1016/j.compbiomed.2022.105832. Epub 2022 Jul 5.

Multi-view contrastive clustering for cancer subtyping using fully and weakly paired multi-omics data.基于全对和弱对多组学数据的多视图对比聚类进行癌症亚型分析。

Methods. 2024 Dec;232:1-8. doi: 10.1016/j.ymeth.2024.09.016. Epub 2024 Oct 17.

引用本文的文献

IGCN: integrative graph convolution networks for patient level insights and biomarker discovery in multi-omics integration.IGCN：用于多组学整合中患者层面洞察和生物标志物发现的整合图卷积网络。

Bioinformatics. 2025 Jun 2;41(6). doi: 10.1093/bioinformatics/btaf313.

Radiogenomic Landscape of Metastatic Endocrine-Positive Breast Cancer Resistant to Aromatase Inhibitors.对芳香化酶抑制剂耐药的转移性内分泌阳性乳腺癌的放射基因组学特征

Cancers (Basel). 2025 Feb 26;17(5):808. doi: 10.3390/cancers17050808.

Methods for multi-omic data integration in cancer research.癌症研究中的多组学数据整合方法。

Front Genet. 2024 Sep 19;15:1425456. doi: 10.3389/fgene.2024.1425456. eCollection 2024.

Multi-Omics Integration for Liver Cancer Using Regression Analysis.使用回归分析对肝癌进行多组学整合

Curr Issues Mol Biol. 2024 Apr 19;46(4):3551-3562. doi: 10.3390/cimb46040222.

HyperTMO: a trusted multi-omics integration framework based on hypergraph convolutional network for patient classification.HyperTMO：一种基于超图卷积网络的可信多组学整合框架，用于患者分类。

Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae159.

MOGAT: A Multi-Omics Integration Framework Using Graph Attention Networks for Cancer Subtype Prediction.MOGAT：一种使用图注意力网络进行癌症亚型预测的多组学整合框架。

Int J Mol Sci. 2024 Feb 28;25(5):2788. doi: 10.3390/ijms25052788.

Graph Neural Networks in Cancer and Oncology Research: Emerging and Future Trends.癌症与肿瘤学研究中的图神经网络：新兴趋势与未来发展方向

Cancers (Basel). 2023 Dec 15;15(24):5858. doi: 10.3390/cancers15245858.

Multimodal analysis of methylomics and fragmentomics in plasma cell-free DNA for multi-cancer early detection and localization.血浆游离 DNA 中甲基组学和片段组学的多模态分析用于多种癌症的早期检测和定位。

Elife. 2023 Oct 11;12:RP89083. doi: 10.7554/eLife.89083.

Identification of Cancer Driver Genes by Integrating Multiomics Data with Graph Neural Networks.通过图神经网络整合多组学数据识别癌症驱动基因

Metabolites. 2023 Feb 24;13(3):339. doi: 10.3390/metabo13030339.

A survey on multi-omics-based cancer diagnosis using machine learning with the potential application in gastrointestinal cancer.一项关于使用机器学习进行基于多组学的癌症诊断及其在胃肠道癌中的潜在应用的调查。

Front Med (Lausanne). 2023 Jan 10;9:1109365. doi: 10.3389/fmed.2022.1109365. eCollection 2022.

本文引用的文献

IEEE/ACM Trans Comput Biol Bioinform. 2023 Jan-Feb;20(1):658-667. doi: 10.1109/TCBB.2021.3139597. Epub 2023 Feb 3.

An Integrated Analysis of C5AR2 Related to Malignant Properties and Immune Infiltration of Breast Cancer.C5AR2与乳腺癌恶性特性及免疫浸润相关性的综合分析

Front Oncol. 2021 Sep 14;11:736725. doi: 10.3389/fonc.2021.736725. eCollection 2021.

Copy number aberrations drive kinase rewiring, leading to genetic vulnerabilities in cancer.拷贝数异常导致激酶重排，进而导致癌症的遗传脆弱性。

Cell Rep. 2021 May 18;35(7):109155. doi: 10.1016/j.celrep.2021.109155.

Molecular subtypes of triple-negative breast cancer: understanding of subtype categories and clinical implication.三阴性乳腺癌的分子亚型：对亚型分类的理解及其临床意义。

Genes Genomics. 2020 Dec;42(12):1381-1387. doi: 10.1007/s13258-020-01014-7. Epub 2020 Nov 3.

Tubulin Tyrosine Ligase Like 4 (TTLL4) overexpression in breast cancer cells is associated with brain metastasis and alters exosome biogenesis.微管酪氨酸连接酶样蛋白 4（TTLL4）在乳腺癌细胞中的过表达与脑转移有关，并改变外体的生物发生。

J Exp Clin Cancer Res. 2020 Sep 30;39(1):205. doi: 10.1186/s13046-020-01712-w.

Classifying Breast Cancer Subtypes Using Deep Neural Networks Based on Multi-Omics Data.基于多组学数据的深度学习神经网络分类乳腺癌亚型。

Genes (Basel). 2020 Aug 4;11(8):888. doi: 10.3390/genes11080888.

Long non-coding RNA DGCR5 incudes tumorigenesis of triple-negative breast cancer by affecting Wnt/β-catenin signaling pathway.长链非编码 RNA DGCR5 通过影响 Wnt/β-catenin 信号通路促进三阴性乳腺癌的发生。

J BUON. 2020 Mar-Apr;25(2):702-708.

Cancer subtype classification and modeling by pathway attention and propagation.基于通路注意力和传播的癌症亚型分类和建模。

Bioinformatics. 2020 Jun 1;36(12):3818-3824. doi: 10.1093/bioinformatics/btaa203.

Multi-omics Data Integration, Interpretation, and Its Application.多组学数据整合、解读及其应用

Bioinform Biol Insights. 2020 Jan 31;14:1177932219899051. doi: 10.1177/1177932219899051. eCollection 2020.

Molecular Classification of Gastric Adenocarcinoma.胃腺癌的分子分类

Gastroenterology Res. 2019 Dec;12(6):275-282. doi: 10.14740/gr1187. Epub 2019 Nov 21.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于稳健图神经网络和多组学数据整合的癌症分子分型

Molecular Subtyping of Cancer Based on Robust Graph Neural Network and Multi-Omics Data Integration.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献