基于图的深度学习在 COPD 多组学分类中的应用。

Deep learning on graphs for multi-omics classification of COPD.

机构信息

Department of Biostatistics and Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, United States of America.

Biostatistics Shared Resource, University of Colorado Cancer Center, University of Colorado Anschutz Medical Campus, Aurora, CO, United States of America.

出版信息

PLoS One. 2023 Apr 21;18(4):e0284563. doi: 10.1371/journal.pone.0284563. eCollection 2023.

DOI:10.1371/journal.pone.0284563

PMID:37083575

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10121008/

Abstract

Network approaches have successfully been used to help reveal complex mechanisms of diseases including Chronic Obstructive Pulmonary Disease (COPD). However despite recent advances, we remain limited in our ability to incorporate protein-protein interaction (PPI) network information with omics data for disease prediction. New deep learning methods including convolution Graph Neural Network (ConvGNN) has shown great potential for disease classification using transcriptomics data and known PPI networks from existing databases. In this study, we first reconstructed the COPD-associated PPI network through the AhGlasso (Augmented High-Dimensional Graphical Lasso Method) algorithm based on one independent transcriptomics dataset including COPD cases and controls. Then we extended the existing ConvGNN methods to successfully integrate COPD-associated PPI, proteomics, and transcriptomics data and developed a prediction model for COPD classification. This approach improves accuracy over several conventional classification methods and neural networks that do not incorporate network information. We also demonstrated that the updated COPD-associated network developed using AhGlasso further improves prediction accuracy. Although deep neural networks often achieve superior statistical power in classification compared to other methods, it can be very difficult to explain how the model, especially graph neural network(s), makes decisions on the given features and identifies the features that contribute the most to prediction generally and individually. To better explain how the spectral-based Graph Neural Network model(s) works, we applied one unified explainable machine learning method, SHapley Additive exPlanations (SHAP), and identified CXCL11, IL-2, CD48, KIR3DL2, TLR2, BMP10 and several other relevant COPD genes in subnetworks of the ConvGNN model for COPD prediction. Finally, Gene Ontology (GO) enrichment analysis identified glycosaminoglycan, heparin signaling, and carbohydrate derivative signaling pathways significantly enriched in the top important gene/proteins for COPD classifications.

摘要

网络方法已成功用于帮助揭示包括慢性阻塞性肺疾病（COPD）在内的复杂疾病机制。然而，尽管最近取得了进展，但我们在将蛋白质-蛋白质相互作用（PPI）网络信息与用于疾病预测的组学数据相结合的能力方面仍然受到限制。新的深度学习方法，包括卷积图神经网络（ConvGNN），已显示出使用转录组学数据和来自现有数据库的已知 PPI 网络对疾病进行分类的巨大潜力。在这项研究中，我们首先通过基于一个独立转录组学数据集（包括 COPD 病例和对照）的 AhGlasso（增强高维图形套索方法）算法重建 COPD 相关的 PPI 网络。然后，我们扩展了现有的 ConvGNN 方法，成功地整合了 COPD 相关的 PPI、蛋白质组学和转录组学数据，并开发了用于 COPD 分类的预测模型。与不整合网络信息的几种传统分类方法和神经网络相比，该方法提高了准确性。我们还证明，使用 AhGlasso 开发的更新的 COPD 相关网络进一步提高了预测准确性。尽管深度神经网络在分类方面通常比其他方法具有更高的统计能力，但要解释模型（特别是图神经网络）如何根据给定特征做出决策以及识别对一般和个别预测贡献最大的特征非常困难。为了更好地解释基于谱的图神经网络模型的工作原理，我们应用了一种统一的可解释机器学习方法 SHapley Additive exPlanations (SHAP)，并在 COPD 预测的 ConvGNN 模型的子网络中确定了 CXCL11、IL-2、CD48、KIR3DL2、TLR2、BMP10 和其他几个相关的 COPD 基因。最后，基因本体论（GO）富集分析确定了在 COPD 分类的重要基因/蛋白中显著富集的糖胺聚糖、肝素信号和碳水化合物衍生物信号通路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9f21/10121008/c87f3d2eb025/pone.0284563.g001.jpg

相似文献

Deep learning on graphs for multi-omics classification of COPD.

PLoS One. 2023 Apr 21;18(4):e0284563. doi: 10.1371/journal.pone.0284563. eCollection 2023.

An Augmented High-Dimensional Graphical Lasso Method to Incorporate Prior Biological Knowledge for Global Network Learning.

Front Genet. 2022 Jan 27;12:760299. doi: 10.3389/fgene.2021.760299. eCollection 2021.

Explaining decisions of graph convolutional neural networks: patient-specific molecular subnetworks responsible for metastasis prediction in breast cancer.

Genome Med. 2021 Mar 11;13(1):42. doi: 10.1186/s13073-021-00845-7.

Prior knowledge-guided multilevel graph neural network for tumor risk prediction and interpretation via multi-omics data integration.

Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae184.

Local augmented graph neural network for multi-omics cancer prognosis prediction and analysis.

Methods. 2023 May;213:1-9. doi: 10.1016/j.ymeth.2023.02.011. Epub 2023 Mar 16.

Graph Neural Networks With Multiple Prior Knowledge for Multi-Omics Data Analysis.

IEEE J Biomed Health Inform. 2023 Sep;27(9):4591-4600. doi: 10.1109/JBHI.2023.3284794. Epub 2023 Sep 6.

Stable feature selection utilizing Graph Convolutional Neural Network and Layer-wise Relevance Propagation for biomarker discovery in breast cancer.

Artif Intell Med. 2024 May;151:102840. doi: 10.1016/j.artmed.2024.102840. Epub 2024 Mar 11.

A novel interactive deep cascade spectral graph convolutional network with multi-relational graphs for disease prediction.

Neural Netw. 2024 Jul;175:106285. doi: 10.1016/j.neunet.2024.106285. Epub 2024 Apr 1.

NNBGWO-BRCA marker: Neural Network and binary grey wolf optimization based Breast cancer biomarker discovery framework using multi-omics dataset.

Comput Methods Programs Biomed. 2024 Sep;254:108291. doi: 10.1016/j.cmpb.2024.108291. Epub 2024 Jun 18.

A multimodal graph neural network framework for cancer molecular subtype classification.

BMC Bioinformatics. 2024 Jan 15;25(1):27. doi: 10.1186/s12859-023-05622-4.

引用本文的文献

A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches.

Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf355.

Oxidative Stress and Inflammation in Hypoxemic Respiratory Diseases and Their Comorbidities: Molecular Insights and Diagnostic Advances in Chronic Obstructive Pulmonary Disease and Sleep Apnea.

Antioxidants (Basel). 2025 Jul 8;14(7):839. doi: 10.3390/antiox14070839.

Common inflammatory proteins linking frailty and area-level deprivation as key drivers of cardiovascular risk in women.

Commun Med (Lond). 2025 Jul 20;5(1):301. doi: 10.1038/s43856-025-01012-4.

A robust chronic obstructive pulmonary disease classification model using dragonfly optimized kernel extreme learning machine.

Sci Rep. 2025 May 28;15(1):18702. doi: 10.1038/s41598-025-02952-6.

Strategies to include prior knowledge in omics analysis with deep neural networks.

Patterns (N Y). 2025 Mar 14;6(3):101203. doi: 10.1016/j.patter.2025.101203.

Deep learning for detecting and early predicting chronic obstructive pulmonary disease from spirogram time series.

NPJ Syst Biol Appl. 2025 Feb 15;11(1):18. doi: 10.1038/s41540-025-00489-y.

Applications of digital health technologies and artificial intelligence algorithms in COPD: systematic review.

BMC Med Inform Decis Mak. 2025 Feb 13;25(1):77. doi: 10.1186/s12911-025-02870-7.

Advances on the Role of Lung Macrophages in the Pathogenesis of Chronic Obstructive Pulmonary Disease in the Era of Single-Cell Genomics.

Int J Med Sci. 2025 Jan 1;22(2):298-308. doi: 10.7150/ijms.100160. eCollection 2025.

Comprehensive time-course gene expression evaluation of high-risk beef cattle to establish immunological characteristics associated with undifferentiated bovine respiratory disease.

Front Immunol. 2024 Sep 13;15:1412766. doi: 10.3389/fimmu.2024.1412766. eCollection 2024.

Exploring Molecular Mechanisms and Biomarkers in COPD: An Overview of Current Advancements and Perspectives.

Int J Mol Sci. 2024 Jul 4;25(13):7347. doi: 10.3390/ijms25137347.

本文引用的文献

Early detection of COPD based on graph convolutional network and small and weakly labeled data.

Med Biol Eng Comput. 2022 Aug;60(8):2321-2333. doi: 10.1007/s11517-022-02589-x. Epub 2022 Jun 24.

An Augmented High-Dimensional Graphical Lasso Method to Incorporate Prior Biological Knowledge for Global Network Learning.

Front Genet. 2022 Jan 27;12:760299. doi: 10.3389/fgene.2021.760299. eCollection 2021.

Identifying miRNA-mRNA Networks Associated With COPD Phenotypes.

Front Genet. 2021 Oct 28;12:748356. doi: 10.3389/fgene.2021.748356. eCollection 2021.

Multi-omics subtyping pipeline for chronic obstructive pulmonary disease.

PLoS One. 2021 Aug 25;16(8):e0255337. doi: 10.1371/journal.pone.0255337. eCollection 2021.

Proteomics of extracellular vesicles in plasma reveals the characteristics and residual traces of COVID-19 patients without underlying diseases after 3 months of recovery.

Cell Death Dis. 2021 May 25;12(6):541. doi: 10.1038/s41419-021-03816-3.

An Integrative Transcriptomic and Metabolomic Study Revealed That Melatonin Plays a Protective Role in Chronic Lung Inflammation by Reducing Necroptosis.

Front Immunol. 2021 May 4;12:668002. doi: 10.3389/fimmu.2021.668002. eCollection 2021.

PFP-WGAN: Protein function prediction by discovering Gene Ontology term correlations with generative adversarial networks.

PLoS One. 2021 Feb 25;16(2):e0244430. doi: 10.1371/journal.pone.0244430. eCollection 2021.

Prediction of Obstructive Lung Disease from Chest Radiographs via Deep Learning Trained on Pulmonary Function Data.

Int J Chron Obstruct Pulmon Dis. 2021 Jan 5;15:3455-3466. doi: 10.2147/COPD.S279850. eCollection 2020.

The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets.

Nucleic Acids Res. 2021 Jan 8;49(D1):D605-D612. doi: 10.1093/nar/gkaa1074.

Array programming with NumPy.

Nature. 2020 Sep;585(7825):357-362. doi: 10.1038/s41586-020-2649-2. Epub 2020 Sep 16.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于图的深度学习在 COPD 多组学分类中的应用。

Deep learning on graphs for multi-omics classification of COPD.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献