Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD 21287, United States.
Department of Biomedical Engineering, Johns Hopkins University School of Medicine, Baltimore, MD 21218, United States.
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad521.
Single-cell sequencing technology has become a routine in studying many biological problems. A core step of analyzing single-cell data is the assignment of cell clusters to specific cell types. Reference-based methods are proposed for predicting cell types for single-cell clusters. However, the scalability and lack of preprocessed reference datasets prevent them from being practical and easy to use.
Here, we introduce a reference-based cell annotation web server, CellAnn, which is super-fast and easy to use. CellAnn contains a comprehensive reference database with 204 human and 191 mouse single-cell datasets. These reference datasets cover 32 organs. Furthermore, we developed a cluster-to-cluster alignment method to transfer cell labels from the reference to the query datasets, which is superior to the existing methods with higher accuracy and higher scalability. Finally, CellAnn is an online tool that integrates all the procedures in cell annotation, including reference searching, transferring cell labels, visualizing results, and harmonizing cell annotation labels. Through the user-friendly interface, users can identify the best annotation by cross-validating with multiple reference datasets. We believe that CellAnn can greatly facilitate single-cell sequencing data analysis.
The web server is available at www.cellann.io, and the source code is available at https://github.com/Pinlyu3/CellAnn_shinyapp.
单细胞测序技术已成为研究许多生物学问题的常规手段。分析单细胞数据的核心步骤是将细胞簇分配到特定的细胞类型。已经提出了基于参考的方法来预测单细胞簇的细胞类型。然而,可扩展性和缺乏预处理的参考数据集妨碍了它们的实际应用和易用性。
在这里,我们介绍了一个基于参考的细胞注释网络服务器 CellAnn,它快速且易于使用。CellAnn 包含一个全面的参考数据库,其中包含 204 个人类和 191 个小鼠单细胞数据集。这些参考数据集涵盖 32 个器官。此外,我们开发了一种簇到簇的对齐方法,将细胞标签从参考数据集转移到查询数据集,该方法在准确性和可扩展性方面都优于现有方法。最后,CellAnn 是一个集成了细胞注释所有步骤的在线工具,包括参考搜索、转移细胞标签、可视化结果和协调细胞注释标签。通过用户友好的界面,用户可以通过交叉验证多个参考数据集来识别最佳注释。我们相信 CellAnn 可以极大地促进单细胞测序数据分析。
该网络服务器可在 www.cellann.io 上使用,源代码可在 https://github.com/Pinlyu3/CellAnn_shinyapp 上获得。