Meng Xiaokun, Zhang Yuanyuan, Xu Xiaoyu, Zhang Kaihao, Feng Baoming
School of Information and Control Engineering, Qingdao University of Technology, Qingdao, Shandong 266520, China.
School of Information and Control Engineering, Qingdao University of Technology, Qingdao, Shandong 266520, China.
Comput Biol Chem. 2025 Feb;114:108292. doi: 10.1016/j.compbiolchem.2024.108292. Epub 2024 Nov 22.
The rapid development of single-cell RNA sequencing(scRNA-seq) technology has spawned a variety of single-cell clustering methods. These methods combine statistics and bioinformatics to reveal differences in gene expression between cells and the diversity of cell types. Deep exploration of single-cell data is more challenging due to the high dimensionality, sparsity and noise of scRNA-seq data. Discriminative attribute information is often difficult to be fully utilised, while traditional clustering methods may not accurately capture the diversity of cell types. Therefore, a deep clustering method is proposed for scRNA-seq data based on subspace feature confidence learning called scSFCL. By dividing the subspace based on kernel density, discriminative feature subsets are filtered. The feature confidence of the subset is learned by combining the graph convolutional network (GCN) with weighting. Also, scSFCL facilitates the complementary fusion of generic structural and idiosyncratic information through a mutually supervised clustering that integrates GCN and a denoising variational autoencoder based on zero-inflated negative binomials (DVAE-ZINB). By validation on multiple scRNA-seq datasets, it is shown that the clustering performance of scSFCL is significantly improved compared with traditional methods, providing an effective solution for deep clustering of scRNA-seq data.
单细胞RNA测序(scRNA-seq)技术的快速发展催生了多种单细胞聚类方法。这些方法结合统计学和生物信息学来揭示细胞间基因表达的差异以及细胞类型的多样性。由于scRNA-seq数据的高维度、稀疏性和噪声,对单细胞数据进行深度探索更具挑战性。判别属性信息往往难以得到充分利用,而传统聚类方法可能无法准确捕捉细胞类型的多样性。因此,提出了一种基于子空间特征置信度学习的scRNA-seq数据深度聚类方法scSFCL。通过基于核密度划分子空间,过滤出判别特征子集。通过将图卷积网络(GCN)与加权相结合来学习子集的特征置信度。此外,scSFCL通过整合GCN和基于零膨胀负二项式的去噪变分自编码器(DVAE-ZINB)的相互监督聚类,促进通用结构信息和特异信息的互补融合。通过在多个scRNA-seq数据集上的验证,结果表明scSFCL的聚类性能与传统方法相比有显著提高,为scRNA-seq数据的深度聚类提供了一种有效解决方案。