基于区分共表达模块和机器学习的癌症分子分型

Molecular Subtyping of Cancer Based on Distinguishing Co-Expression Modules and Machine Learning.

作者信息

Sun Peishuo, Wu Ying, Yin Chaoyi, Jiang Hongyang, Xu Ying, Sun Huiyan

机构信息

School of Artificial Intelligence, Jilin University, Changchun, China.

Phase I Clinical Trails Center, The First Affiliated Hospital, China Medical University, Shenyang, China.

出版信息

Front Genet. 2022 May 2;13:866005. doi: 10.3389/fgene.2022.866005. eCollection 2022.

DOI:10.3389/fgene.2022.866005

PMID:35586568

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9108363/

Abstract

Molecular subtyping of cancer is recognized as a critical and challenging step towards individualized therapy. Most existing computational methods solve this problem multi-classification of gene-expressions of cancer samples. Although these methods, especially deep learning, perform well in data classification, they usually require large amounts of data for model training and have limitations in interpretability. Besides, as cancer is a complex systemic disease, the phenotypic difference between cancer samples can hardly be fully understood by only analyzing single molecules, and differential expression-based molecular subtyping methods are reportedly not conserved. To address the above issues, we present here a new framework for molecular subtyping of cancer through identifying a robust specific co-expression module for each subtype of cancer, generating network features for each sample by perturbing correlation levels of specific edges, and then training a deep neural network for multi-class classification. When applied to breast cancer (BRCA) and stomach adenocarcinoma (STAD) molecular subtyping, it has superior classification performance over existing methods. In addition to improving classification performance, we consider the specific co-expressed modules selected for subtyping to be biologically meaningful, which potentially offers new insight for diagnostic biomarker design, mechanistic studies of cancer, and individualized treatment plan selection.

摘要

癌症的分子亚型分类被认为是迈向个体化治疗的关键且具有挑战性的一步。大多数现有的计算方法通过对癌症样本的基因表达进行多分类来解决这个问题。尽管这些方法，尤其是深度学习，在数据分类方面表现良好，但它们通常需要大量数据进行模型训练，并且在可解释性方面存在局限性。此外，由于癌症是一种复杂的系统性疾病，仅通过分析单个分子很难完全理解癌症样本之间的表型差异，而且据报道基于差异表达的分子亚型分类方法并不保守。为了解决上述问题，我们在此提出一种新的癌症分子亚型分类框架，即通过为每种癌症亚型识别一个稳健的特定共表达模块，通过扰动特定边的相关水平为每个样本生成网络特征，然后训练深度神经网络进行多分类。当应用于乳腺癌（BRCA）和胃腺癌（STAD）分子亚型分类时，它比现有方法具有更优的分类性能。除了提高分类性能外，我们认为为亚型分类选择的特定共表达模块具有生物学意义，这可能为诊断生物标志物设计、癌症机制研究和个体化治疗方案选择提供新的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80d2/9108363/08f6a73c2546/fgene-13-866005-g001.jpg

相似文献

Molecular Subtyping of Cancer Based on Distinguishing Co-Expression Modules and Machine Learning.基于区分共表达模块和机器学习的癌症分子分型

Front Genet. 2022 May 2;13:866005. doi: 10.3389/fgene.2022.866005. eCollection 2022.

Molecular Subtyping of Cancer Based on Robust Graph Neural Network and Multi-Omics Data Integration.基于稳健图神经网络和多组学数据整合的癌症分子分型

Front Genet. 2022 May 13;13:884028. doi: 10.3389/fgene.2022.884028. eCollection 2022.

Colorectal cancer subtype identification from differential gene expression levels using minimalist deep learning.利用极简深度学习从差异基因表达水平识别结直肠癌亚型

BioData Min. 2022 Apr 23;15(1):12. doi: 10.1186/s13040-022-00295-w.

DeepCC: a novel deep learning-based framework for cancer molecular subtype classification.DeepCC：一种基于深度学习的新型癌症分子亚型分类框架。

Oncogenesis. 2019 Aug 16;8(9):44. doi: 10.1038/s41389-019-0157-8.

Investigating Deep Learning Based Breast Cancer Subtyping Using Pan-Cancer and Multi-Omic Data.基于泛癌和多组学数据的深度学习乳腺癌分型研究。

IEEE/ACM Trans Comput Biol Bioinform. 2022 Jan-Feb;19(1):121-134. doi: 10.1109/TCBB.2020.3042309. Epub 2022 Feb 3.

Tree-based machine learning algorithms identified minimal set of miRNA biomarkers for breast cancer diagnosis and molecular subtyping.基于树的机器学习算法确定了用于乳腺癌诊断和分子分型的最小 miRNA 生物标志物集。

Gene. 2018 Nov 30;677:111-118. doi: 10.1016/j.gene.2018.07.057. Epub 2018 Jul 25.

CONFIGURE: A pipeline for identifying context specific regulatory modules from gene expression data and its application to breast cancer.配置：一种从基因表达数据中识别上下文特定调控模块的管道及其在乳腺癌中的应用。

BMC Med Genomics. 2019 Jul 11;12(Suppl 5):97. doi: 10.1186/s12920-019-0515-6.

Deep Learning Feature Extraction Approach for Hematopoietic Cancer Subtype Classification.深度学习特征提取方法在血液肿瘤亚型分类中的应用。

Int J Environ Res Public Health. 2021 Feb 23;18(4):2197. doi: 10.3390/ijerph18042197.

De novo transcriptomic subtyping of colorectal cancer liver metastases in the context of tumor heterogeneity.结直肠癌肝转移肿瘤异质性背景下的从头转录组亚分型。

Genome Med. 2021 Sep 1;13(1):143. doi: 10.1186/s13073-021-00956-1.

NSCLC Subtyping in Conventional Cytology: Results of the International Association for the Study of Lung Cancer Cytology Working Group Survey to Determine Specific Cytomorphologic Criteria for Adenocarcinoma and Squamous Cell Carcinoma.非小细胞肺癌的常规细胞学亚型：国际肺癌研究协会细胞学工作组调查结果，旨在确定腺癌和鳞状细胞癌的特定细胞学形态学标准。

J Thorac Oncol. 2022 Jun;17(6):793-805. doi: 10.1016/j.jtho.2022.02.013. Epub 2022 Mar 22.

引用本文的文献

Characterizing Duodenal Immune Microenvironment in Functional Dyspepsia: An AutoML-Driven Diagnostic Framework.功能性消化不良中十二指肠免疫微环境的特征分析：一种自动机器学习驱动的诊断框架

J Inflamm Res. 2025 Jul 15;18:9201-9227. doi: 10.2147/JIR.S524791. eCollection 2025.

MATTE: a pipeline of transcriptome module alignment for anti-noise phenotype-gene-related analysis.MATTE：用于抗噪声表型-基因相关分析的转录组模块比对管道。

Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad207.

Phenomics and Robust Multiomics Data for Cardiovascular Disease Subtyping.表型组学和稳健的多组学数据在心血管疾病亚型分类中的应用。

Arterioscler Thromb Vasc Biol. 2023 Jul;43(7):1111-1123. doi: 10.1161/ATVBAHA.122.318892. Epub 2023 May 25.

Deciphering the endometrial immune landscape of RIF during the window of implantation from cellular senescence by integrated bioinformatics analysis and machine learning.通过整合生物信息学分析和机器学习，解析着床窗口期反复种植失败子宫内膜的细胞衰老免疫全景。

Front Immunol. 2022 Sep 5;13:952708. doi: 10.3389/fimmu.2022.952708. eCollection 2022.

本文引用的文献

A Topic Modeling Analysis of TCGA Breast and Lung Cancer Transcriptomic Data.对TCGA乳腺癌和肺癌转录组数据的主题建模分析。

Cancers (Basel). 2020 Dec 16;12(12):3799. doi: 10.3390/cancers12123799.

Machine learning for RNA sequencing-based intrinsic subtyping of breast cancer.基于 RNA 测序的乳腺癌内在分型的机器学习。

Sci Rep. 2020 Aug 21;10(1):14071. doi: 10.1038/s41598-020-70832-2.

Co-expression based cancer staging and application.基于共表达的癌症分期和应用。

Sci Rep. 2020 Jun 30;10(1):10624. doi: 10.1038/s41598-020-67476-7.

Triple-negative breast cancer molecular subtyping and treatment progress.三阴性乳腺癌分子分型及治疗进展。

Breast Cancer Res. 2020 Jun 9;22(1):61. doi: 10.1186/s13058-020-01296-5.

Cancer subtype classification and modeling by pathway attention and propagation.基于通路注意力和传播的癌症亚型分类和建模。

Bioinformatics. 2020 Jun 1;36(12):3818-3824. doi: 10.1093/bioinformatics/btaa203.

Metabolic Reprogramming in Cancer Is Induced to Increase Proton Production.肿瘤细胞的代谢重编程被诱导以增加质子产生。

Cancer Res. 2020 Mar 1;80(5):1143-1155. doi: 10.1158/0008-5472.CAN-19-3392. Epub 2020 Jan 13.

Predictive modelling using pathway scores: robustness and significance of pathway collections.基于通路评分的预测模型：通路集合的稳健性和显著性。

BMC Bioinformatics. 2019 Nov 4;20(1):543. doi: 10.1186/s12859-019-3163-0.

Deep-learning approach to identifying cancer subtypes using high-dimensional genomic data.深度学习方法通过高维基因组数据识别癌症亚型。

Bioinformatics. 2020 Mar 1;36(5):1476-1483. doi: 10.1093/bioinformatics/btz769.

DeepCC: a novel deep learning-based framework for cancer molecular subtype classification.DeepCC：一种基于深度学习的新型癌症分子亚型分类框架。

Oncogenesis. 2019 Aug 16;8(9):44. doi: 10.1038/s41389-019-0157-8.

Challenges and future of precision medicine strategies for breast cancer based on a database on drug reactions.基于药物反应数据库的乳腺癌精准医学策略的挑战与未来

Biosci Rep. 2019 Sep 6;39(9). doi: 10.1042/BSR20190230. Print 2019 Sep 30.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于区分共表达模块和机器学习的癌症分子分型

Molecular Subtyping of Cancer Based on Distinguishing Co-Expression Modules and Machine Learning.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献