Suppr超能文献

AWCluster:一种新颖的基于多组学整合网络的聚类方法,用于癌症数据的亚型分析。

aWCluster: A Novel Integrative Network-Based Clustering of Multiomics for Subtype Analysis of Cancer Data.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1472-1483. doi: 10.1109/TCBB.2020.3039511. Epub 2022 Jun 3.

Abstract

The remarkable growth of multi-platform genomic profiles has led to the challenge of multiomics data integration. In this study, we present a novel network-based multiomics clustering founded on the Wasserstein distance from optimal mass transport. This distance has many important geometric properties making it a suitable choice for application in machine learning and clustering. Our proposed method of aggregating multiomics and Wasserstein distance clustering (aWCluster) is applied to breast carcinoma as well as bladder carcinoma, colorectal adenocarcinoma, renal carcinoma, lung non-small cell adenocarcinoma, and endometrial carcinoma from The Cancer Genome Atlas project. Subtypes were characterized by the concordant effect of mRNA expression, DNA copy number alteration, and DNA methylation of genes and their neighbors in the interaction network. aWCluster successfully clusters all cancer types into classes with significantly different survival rates. Also, a gene ontology enrichment analysis of significant genes in the low survival subgroup of breast cancer leads to the well-known phenomenon of tumor hypoxia and the transcription factor ETS1 whose expression is induced by hypoxia. We believe aWCluster has the potential to discover novel subtypes and biomarkers by accentuating the genes that have concordant multiomics measurements in their interaction network, which are challenging to find without the network inference or with single omics analysis.

摘要

多平台基因组图谱的显著增长导致了多组学数据整合的挑战。在这项研究中,我们提出了一种新颖的基于最优物质传输 Wasserstein 距离的网络多组学聚类方法。该距离具有许多重要的几何性质,使其成为机器学习和聚类应用的合适选择。我们提出的多组学和 Wasserstein 距离聚类(aWCluster)方法应用于癌症基因组图谱项目中的乳腺癌、膀胱癌、结直肠腺癌、肾细胞癌、肺非小细胞腺癌和子宫内膜癌。亚型的特征是基因及其邻居在相互作用网络中的 mRNA 表达、DNA 拷贝数改变和 DNA 甲基化的一致效应。aWCluster 成功地将所有癌症类型聚类为具有显著不同生存率的类别。此外,对乳腺癌低生存率亚组中显著基因的基因本体富集分析导致了众所周知的肿瘤缺氧现象和转录因子 ETS1,其表达受缺氧诱导。我们相信,aWCluster 通过强调其相互作用网络中具有一致多组学测量的基因,有可能发现新的亚型和生物标志物,而没有网络推断或单一组学分析,这些基因很难找到。

相似文献

1
aWCluster: A Novel Integrative Network-Based Clustering of Multiomics for Subtype Analysis of Cancer Data.
IEEE/ACM Trans Comput Biol Bioinform. 2022 May-Jun;19(3):1472-1483. doi: 10.1109/TCBB.2020.3039511. Epub 2022 Jun 3.
3
COPS: A novel platform for multi-omic disease subtype discovery via robust multi-objective evaluation of clustering algorithms.
PLoS Comput Biol. 2024 Aug 5;20(8):e1012275. doi: 10.1371/journal.pcbi.1012275. eCollection 2024 Aug.
6
Multi-omics data fusion using adaptive GTO guided Non-negative matrix factorization for cancer subtype discovery.
Comput Methods Programs Biomed. 2023 Jan;228:107246. doi: 10.1016/j.cmpb.2022.107246. Epub 2022 Nov 16.
8
vWCluster: Vector-valued optimal transport for network based clustering using multi-omics data in breast cancer.
PLoS One. 2022 Mar 14;17(3):e0265150. doi: 10.1371/journal.pone.0265150. eCollection 2022.
10
Interactive gene identification for cancer subtyping based on multi-omics clustering.
Methods. 2023 Mar;211:61-67. doi: 10.1016/j.ymeth.2023.02.005. Epub 2023 Feb 15.

引用本文的文献

3
Data Science Opportunities To Improve Radiotherapy Planning and Clinical Decision Making.
Semin Radiat Oncol. 2024 Oct;34(4):379-394. doi: 10.1016/j.semradonc.2024.07.012.
6
Wasserstein HOG: Local Directionality Extraction via Optimal Transport.
IEEE Trans Med Imaging. 2024 Mar;43(3):916-927. doi: 10.1109/TMI.2023.3325295. Epub 2024 Mar 5.
8
vWCluster: Vector-valued optimal transport for network based clustering using multi-omics data in breast cancer.
PLoS One. 2022 Mar 14;17(3):e0265150. doi: 10.1371/journal.pone.0265150. eCollection 2022.
9
Pan-Cancer Prediction of Cell-Line Drug Sensitivity Using Network-Based Methods.
Int J Mol Sci. 2022 Jan 19;23(3):1074. doi: 10.3390/ijms23031074.
10
Reproducibility of radiomic features using network analysis and its application in Wasserstein -means clustering.
J Med Imaging (Bellingham). 2021 May;8(3):031904. doi: 10.1117/1.JMI.8.3.031904. Epub 2021 Apr 30.

本文引用的文献

2
Integrating Multi-Omic Data With Deep Subspace Fusion Clustering for Cancer Subtype Prediction.
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):216-226. doi: 10.1109/TCBB.2019.2951413. Epub 2021 Feb 3.
3
SinNLRR: a robust subspace clustering method for cell type detection by non-negative and low-rank representation.
Bioinformatics. 2019 Oct 1;35(19):3642-3650. doi: 10.1093/bioinformatics/btz139.
5
Low Rank Subspace Clustering via Discrete Constraint and Hypergraph Regularization for Tumor Molecular Pattern Discovery.
IEEE/ACM Trans Comput Biol Bioinform. 2018 Sep-Oct;15(5):1500-1512. doi: 10.1109/TCBB.2018.2834371. Epub 2018 May 11.
6
Characterizing Cancer Drug Response and Biological Correlates: A Geometric Network Approach.
Sci Rep. 2018 Apr 23;8(1):6402. doi: 10.1038/s41598-018-24679-3.
7
The Immune Landscape of Cancer.
Immunity. 2018 Apr 17;48(4):812-830.e14. doi: 10.1016/j.immuni.2018.03.023. Epub 2018 Apr 5.
8
A novel approach for data integration and disease subtyping.
Genome Res. 2017 Dec;27(12):2025-2039. doi: 10.1101/gr.215129.116. Epub 2017 Oct 24.
10
Pediatric Sarcoma Data Forms a Unique Cluster Measured via the Earth Mover's Distance.
Sci Rep. 2017 Aug 1;7(1):7035. doi: 10.1038/s41598-017-07551-8.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验