School of Mathematical Sciences, University of Science and Technology of China, 230026 Hefei, Anhui, China.
Department of Oncology, The First Affiliated Hospital of USTC, Division of Molecular Medicine, Hefei National Laboratory for Physical Sciences at Microscale, School of Basic Medical Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, 230021 Hefei, Anhui, China.
G3 (Bethesda). 2021 Jun 17;11(6). doi: 10.1093/g3journal/jkab098.
Unsupervised clustering is a fundamental step of single-cell RNA-sequencing (scRNA-seq) data analysis. This issue has inspired several clustering methods to classify cells in scRNA-seq data. However, accurate prediction of the cell clusters remains a substantial challenge. In this study, we propose a new algorithm for scRNA-seq data clustering based on Sparse Optimization and low-rank matrix factorization (scSO). We applied our scSO algorithm to analyze multiple benchmark datasets and showed that the cluster number predicted by scSO was close to the number of reference cell types and that most cells were correctly classified. Our scSO algorithm is available at https://github.com/QuKunLab/scSO. Overall, this study demonstrates a potent cell clustering approach that can help researchers distinguish cell types in single- scRNA-seq data.
无监督聚类是单细胞 RNA 测序 (scRNA-seq) 数据分析的基本步骤。这一问题激发了几种聚类方法的出现,以对 scRNA-seq 数据中的细胞进行分类。然而,准确预测细胞簇仍然是一个重大挑战。在本研究中,我们提出了一种基于稀疏优化和低秩矩阵分解的 scRNA-seq 数据聚类新算法 (scSO)。我们将 scSO 算法应用于多个基准数据集的分析,结果表明 scSO 预测的聚类数量与参考细胞类型的数量接近,并且大多数细胞被正确分类。我们的 scSO 算法可在 https://github.com/QuKunLab/scSO 上获取。总的来说,本研究展示了一种强大的细胞聚类方法,可以帮助研究人员在单细胞 scRNA-seq 数据中区分细胞类型。