Key Laboratory of Biorheological Science and Technology, Ministry of Education, Bioengineering College, Chongqing University, Chongqing, 400044, China; College of Life Sciences, Chongqing Normal University, Chongqing, 400044, PR China.
Department of Biology, College of Science, Northeastern University, Boston, MA, 02115, USA.
Comput Biol Med. 2023 Jun;159:106939. doi: 10.1016/j.compbiomed.2023.106939. Epub 2023 Apr 15.
With the rapid development of single-cell RNA-sequencing techniques, various computational methods and tools were proposed to analyze these high-throughput data, which led to an accelerated reveal of potential biological information. As one of the core steps of single-cell transcriptome data analysis, clustering plays a crucial role in identifying cell types and interpreting cellular heterogeneity. However, the results generated by different clustering methods showed distinguishing, and those unstable partitions can affect the accuracy of the analysis to a certain extent. To overcome this challenge and obtain more accurate results, currently clustering ensemble is frequently applied to cluster analysis of single-cell transcriptome datasets, and the results generated by all clustering ensembles are nearly more reliable than those from most of the single clustering partitions. In this review, we summarize applications and challenges of the clustering ensemble method in single-cell transcriptome data analysis, and provide constructive thoughts and references for researchers in this field.
随着单细胞 RNA 测序技术的快速发展,提出了各种计算方法和工具来分析这些高通量数据,从而加速了潜在生物学信息的揭示。作为单细胞转录组数据分析的核心步骤之一,聚类在识别细胞类型和解释细胞异质性方面起着至关重要的作用。然而,不同聚类方法产生的结果存在明显差异,这些不稳定的分区在一定程度上会影响分析的准确性。为了克服这一挑战并获得更准确的结果,目前聚类集成经常应用于单细胞转录组数据集的聚类分析,所有聚类集成产生的结果几乎比大多数单个聚类分区更可靠。在本文中,我们总结了聚类集成方法在单细胞转录组数据分析中的应用和挑战,并为该领域的研究人员提供了建设性的思路和参考。