School of Computer Science and Engineering, Central South University, Changsha 410083, China.
School of Computer Science and Engineering, Central South University, Changsha 410083, China.
Methods. 2022 Sep;205:114-122. doi: 10.1016/j.ymeth.2022.06.010. Epub 2022 Jun 28.
The rapid development of single-cell sequencing technologies makes it possible to analyze cellular heterogeneity at the single-cell level. Cell clustering is one of the most fundamental and common steps in the heterogeneity analysis. However, due to the high noise level, high dimensionality and high sparsity, accurate cell clustering is still challengeable. Here, we present DeepCI, a new clustering approach for scRNA-seq data. Using two autoencoders to obtain cell embedding and gene embedding, DeepCI can simultaneously learn cell low-dimensional representation and clustering. In addition, the recovered gene expression matrix can be obtained by the matrix multiplication of cell and gene embedding. To evaluate the performance of DeepCI, we performed it on several real scRNA-seq datasets for clustering and visualization analysis. The experimental results show that DeepCI obtains the overall better performance than several popular single cell analysis methods. We also evaluated the imputation performance of DeepCI by a dedicated experiment. The corresponding results show that the imputed gene expression of known specific marker genes can greatly improve the accuracy of cell type classification.
单细胞测序技术的快速发展使得在单细胞水平上分析细胞异质性成为可能。细胞聚类是异质性分析中最基本和最常见的步骤之一。然而,由于噪声水平高、维度高和稀疏性高,准确的细胞聚类仍然具有挑战性。在这里,我们提出了 DeepCI,这是一种用于 scRNA-seq 数据的新聚类方法。使用两个自动编码器来获得细胞嵌入和基因嵌入,DeepCI 可以同时学习细胞的低维表示和聚类。此外,通过细胞和基因嵌入的矩阵乘法可以获得恢复的基因表达矩阵。为了评估 DeepCI 的性能,我们在几个真实的 scRNA-seq 数据集上进行了聚类和可视化分析。实验结果表明,DeepCI 的整体性能优于几种流行的单细胞分析方法。我们还通过专门的实验评估了 DeepCI 的插补性能。相应的结果表明,已知特定标记基因的插补基因表达可以大大提高细胞类型分类的准确性。