Faculty of Engineering and IT in the University of Technology Sydney.
School of Biomedical Engineering, University of Technology Sydney.
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa248.
Single-cell mRNA sequencing has been adopted as a powerful technique for understanding gene expression profiles at the single-cell level. However, challenges remain due to factors such as the inefficiency of mRNA molecular capture, technical noises and separate sequencing of cells in different batches. Normalization methods have been developed to ensure a relatively accurate analysis. This work presents a survey on 10 tools specifically designed for single-cell mRNA sequencing data preprocessing steps, among which 6 tools are used for dropout normalization and 4 tools are for batch effect correction. In this survey, we outline the main methodology for each of these tools, and we also compare these tools to evaluate their normalization performance on datasets which are simulated under the constraints of dropout inefficiency, batch effect or their combined effects. We found that Saver and Baynorm performed better than other methods in dropout normalization, in most cases. Beer and Batchelor performed better in the batch effect normalization, and the Saver-Beer tool combination and the Baynorm-Beer combination performed better in the mixed dropout-and-batch effect normalization. Over-normalization is a common issue occurred to these dropout normalization tools that is worth of future investigation. For the batch normalization tools, the capability of retaining heterogeneity between different groups of cells after normalization can be another direction for future improvement.
单细胞 mRNA 测序已被采纳为一种强大的技术,用于在单细胞水平上理解基因表达谱。然而,由于 mRNA 分子捕获效率低、技术噪声和不同批次细胞的分离测序等因素,仍然存在挑战。归一化方法已被开发出来以确保相对准确的分析。这项工作对专门设计用于单细胞 mRNA 测序数据预处理步骤的 10 个工具进行了调查,其中 6 个工具用于随机缺失归一化,4 个工具用于批处理效应校正。在这项调查中,我们概述了这些工具中的每一个的主要方法,并对这些工具进行了比较,以评估它们在模拟随机缺失效率、批处理效应或其组合效应的约束下的数据集上的归一化性能。我们发现,在大多数情况下,Saver 和 Baynorm 在随机缺失归一化方面的性能优于其他方法。Beer 和 Batchelor 在批处理效应归一化方面表现更好,而 Saver-Beer 工具组合和 Baynorm-Beer 组合在混合随机缺失和批处理效应归一化方面表现更好。过度归一化是这些随机缺失归一化工具中常见的问题,值得未来进一步研究。对于批处理归一化工具,在归一化后保留不同细胞群之间异质性的能力可能是未来改进的另一个方向。