College of Electronic and Information Engineering, Shenzhen University, Shenzhen 518060, Guangdong, China.
Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen University, Shenzhen 518060, Guangdong, China.
Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae169.
Single-cell clustering plays a crucial role in distinguishing between cell types, facilitating the analysis of cell heterogeneity mechanisms. While many existing clustering methods rely solely on gene expression data obtained from single-cell RNA sequencing techniques to identify cell clusters, the information contained in mono-omic data is often limited, leading to suboptimal clustering performance. The emergence of single-cell multi-omics sequencing technologies enables the integration of multiple omics data for identifying cell clusters, but how to integrate different omics data effectively remains challenging. In addition, designing a clustering method that performs well across various types of multi-omics data poses a persistent challenge due to the data's inherent characteristics.
In this paper, we propose a graph-regularized multi-view ensemble clustering (GRMEC-SC) model for single-cell clustering. Our proposed approach can adaptively integrate multiple omics data and leverage insights from multiple base clustering results. We extensively evaluate our method on five multi-omics datasets through a series of rigorous experiments. The results of these experiments demonstrate that our GRMEC-SC model achieves competitive performance across diverse multi-omics datasets with varying characteristics.
Implementation of GRMEC-SC, along with examples, can be found on the GitHub repository: https://github.com/polarisChen/GRMEC-SC.
单细胞聚类在区分细胞类型、促进细胞异质性机制分析方面起着至关重要的作用。虽然许多现有的聚类方法仅依赖于单细胞 RNA 测序技术获得的基因表达数据来识别细胞簇,但单核子数据中包含的信息通常是有限的,导致聚类性能不理想。单细胞多组学测序技术的出现使得可以整合多种组学数据来识别细胞簇,但如何有效地整合不同的组学数据仍然具有挑战性。此外,由于数据的固有特性,设计一种在各种类型的多组学数据上都表现良好的聚类方法仍然是一个持续的挑战。
在本文中,我们提出了一种用于单细胞聚类的图正则化多视图集成聚类(GRMEC-SC)模型。我们提出的方法可以自适应地整合多种组学数据,并利用来自多个基础聚类结果的见解。我们通过一系列严格的实验,在五个多组学数据集上对我们的方法进行了广泛评估。这些实验的结果表明,我们的 GRMEC-SC 模型在具有不同特征的各种多组学数据集中表现出有竞争力的性能。
GRMEC-SC 的实现以及示例可以在 GitHub 存储库上找到:https://github.com/polarisChen/GRMEC-SC。