IEEE J Biomed Health Inform. 2024 Nov;28(11):7015-7027. doi: 10.1109/JBHI.2024.3460761. Epub 2024 Nov 6.
Advancing in single-cell RNA sequencing techniques enhances the resolution of cell heterogeneity study. Density-based unsupervised clustering has the potential to detect the representative anchor points and the number of clusters automatically. Meanwhile, discovering the true cell type of scRNA-seq data in the unsupervised scenario is still challenging. To this end, we proposed a tensor shared nearest neighbor anchor clustering for scRNA-seq data, named scTSNN, which first makes use of the tensor affinity learning module to mine the local-global balanced topological structures among cells, next designs density-based shared nearest neighbor measurement method to automatically detect anchor cells, finally partitions the non-anchor cells to obtain the clustering results. Validated on synthetic datasets and scRNA-seq datasets, scTSNN not only exactly detects the complicated structures but also has better performance in accuracy and robustness compared with the state-of-the-art methods. Moreover, case studies on mammalian cells and cervical cancer tumor cells demonstrate the selected anchor cells of scTSNN benefit the cell pseudotime inference and rare cell identification, which show good application and research value of scTSNN.
单细胞 RNA 测序技术的进步提高了细胞异质性研究的分辨率。基于密度的无监督聚类有可能自动检测代表性锚点和聚类数量。同时,在无监督场景下发现 scRNA-seq 数据的真实细胞类型仍然具有挑战性。为此,我们提出了一种用于 scRNA-seq 数据的张量共享最近邻锚聚类方法,称为 scTSNN,它首先利用张量关联学习模块挖掘细胞之间的局部-全局平衡拓扑结构,其次设计基于密度的共享最近邻度量方法自动检测锚细胞,最后将非锚细胞划分成聚类结果。在合成数据集和 scRNA-seq 数据集上的验证表明,scTSNN 不仅能准确检测复杂结构,而且在准确性和鲁棒性方面的性能也优于最新方法。此外,哺乳动物细胞和宫颈癌肿瘤细胞的案例研究表明,scTSNN 选择的锚细胞有利于细胞拟时推断和稀有细胞识别,这表明 scTSNN 具有良好的应用和研究价值。