Boufea Katerina, Seth Sohan, Batada Nizar N
Institute for Genetics and Molecular Medicine, University of Edinburgh, Edinburgh EH4 2XU, UK.
School of Informatics, University of Edinburgh, Edinburgh Eh8 9AB, UK.
iScience. 2020 Mar 27;23(3):100914. doi: 10.1016/j.isci.2020.100914. Epub 2020 Feb 14.
The power of single-cell RNA sequencing (scRNA-seq) stems from its ability to uncover cell type-dependent phenotypes, which rests on the accuracy of cell type identification. However, resolving cell types within and, thus, comparison of scRNA-seq data across conditions is challenging owing to technical factors such as sparsity, low number of cells, and batch effect. To address these challenges, we developed scID (Single Cell IDentification), which uses the Fisher's Linear Discriminant Analysis-like framework to identify transcriptionally related cell types between scRNA-seq datasets. We demonstrate the accuracy and performance of scID relative to existing methods on several published datasets. By increasing power to identify transcriptionally similar cell types across datasets with batch effect, scID enhances investigator's ability to integrate and uncover development-, disease-, and perturbation-associated changes in scRNA-seq data.
单细胞RNA测序(scRNA-seq)的强大之处源于其揭示细胞类型依赖性表型的能力,而这又依赖于细胞类型识别的准确性。然而,由于稀疏性、细胞数量少和批次效应等技术因素,解析scRNA-seq数据中的细胞类型以及跨条件比较scRNA-seq数据具有挑战性。为应对这些挑战,我们开发了scID(单细胞识别),它使用类似Fisher线性判别分析的框架来识别scRNA-seq数据集之间转录相关的细胞类型。我们在几个已发表的数据集上证明了scID相对于现有方法的准确性和性能。通过提高识别具有批次效应的跨数据集转录相似细胞类型的能力,scID增强了研究人员整合和揭示scRNA-seq数据中与发育、疾病和扰动相关变化的能力。