SC3s：单细胞共识聚类到数百万个细胞的高效扩展。

Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, CB10 1SA, UK.

The Gurdon Institute, University of Cambridge, Tennis Court Road, Cambridge, CB2 1QN, UK.

BMC Bioinformatics. 2022 Dec 12;23(1):536. doi: 10.1186/s12859-022-05085-z.

BACKGROUND

Today it is possible to profile the transcriptome of individual cells, and a key step in the analysis of these datasets is unsupervised clustering. For very large datasets, efficient algorithms are required to ensure that analyses can be conducted with reasonable time and memory requirements.

RESULTS

Here, we present a highly efficient k-means based approach, and we demonstrate that it scales favorably with the number of cells with regards to time and memory.

CONCLUSIONS

We have demonstrated that our streaming k-means clustering algorithm gives state-of-the-art performance while resource requirements scale favorably for up to 2 million cells.

背景

如今，人们可以对单个细胞的转录组进行分析，而分析这些数据集的关键步骤是无监督聚类。对于非常大的数据集，需要使用高效的算法来确保分析可以在合理的时间和内存要求下进行。

结果

在这里，我们提出了一种基于高效 k-均值的方法，并证明它在时间和内存方面都能很好地扩展到细胞数量。

结论

我们已经证明，我们的流式 k-均值聚类算法在资源需求方面表现出色，最多可扩展到 200 万个细胞。

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

SC3s: efficient scaling of single cell consensus clustering to millions of cells.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献