School of Computer, National University of Defense Technology, No. 109 Deya Road, 410073 Changsha, Hunan, China.
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad216.
Clustering methods have been widely used in single-cell RNA-seq data for investigating tumor heterogeneity. Since traditional clustering methods fail to capture the high-dimension methods, deep clustering methods have drawn increasing attention these years due to their promising strengths on the task. However, existing methods consider either the attribute information of each cell or the structure information between different cells. In other words, they cannot sufficiently make use of all of this information simultaneously. To this end, we propose a novel single-cell deep fusion clustering model, which contains two modules, i.e. an attributed feature clustering module and a structure-attention feature clustering module. More concretely, two elegantly designed autoencoders are built to handle both features regardless of their data types. Experiments have demonstrated the validity of the proposed approach, showing that it is efficient to fuse attributes, structure, and attention information on single-cell RNA-seq data. This work will be further beneficial for investigating cell subpopulations and tumor microenvironment. The Python implementation of our work is now freely available at https://github.com/DayuHuu/scDFC.
聚类方法已广泛应用于单细胞 RNA-seq 数据中,用于研究肿瘤异质性。由于传统的聚类方法无法捕捉到高维信息,近年来,深度学习聚类方法由于在该任务上具有很大的潜力,因此受到了越来越多的关注。然而,现有的方法要么考虑每个细胞的属性信息,要么考虑不同细胞之间的结构信息。换句话说,它们不能同时充分利用所有这些信息。为此,我们提出了一种新的单细胞深度融合聚类模型,该模型包含两个模块,即属性特征聚类模块和结构注意特征聚类模块。更具体地说,构建了两个精心设计的自动编码器来处理无论数据类型如何的特征。实验证明了所提出方法的有效性,表明它能够有效地融合单细胞 RNA-seq 数据中的属性、结构和注意信息。这项工作将进一步有助于研究细胞亚群和肿瘤微环境。我们工作的 Python 实现现在可在 https://github.com/DayuHuu/scDFC 上免费获得。