Kim Beomseok, Lee Eunmin, Kim Jong Kyoung
Department of New Biology, DGIST, Daegu, Republic of Korea.
Methods Mol Biol. 2019;1935:25-43. doi: 10.1007/978-1-4939-9057-3_3.
Profiling the transcriptomes of individual cells with single-cell RNA sequencing (scRNA-seq) has been widely applied to provide a detailed molecular characterization of cellular heterogeneity within a population of cells. Despite recent technological advances of scRNA-seq, technical variability of gene expression in scRNA-seq is still much higher than that in bulk RNA-seq. Accounting for technical variability is therefore a prerequisite for correctly analyzing single-cell data. This chapter describes a computational pipeline for detecting highly variable genes exhibiting higher cell-to-cell variability than expected by technical noise. The basic pipeline using the scater and scran R/Bioconductor packages includes deconvolution-based normalization, fitting the mean-variance trend, testing for nonzero biological variability, and visualization with highly variable genes. An outline of the underlying theory of detecting highly variable genes is also presented. We illustrate how the pipeline works by using two case studies, one from mouse embryonic stem cells with external RNA spike-ins, and the other from mouse dentate gyrus cells without spike-ins.
通过单细胞RNA测序(scRNA-seq)对单个细胞的转录组进行分析已被广泛应用,以详细分子表征细胞群体内的细胞异质性。尽管scRNA-seq最近有技术进步,但scRNA-seq中基因表达的技术变异性仍远高于批量RNA测序中的变异性。因此,考虑技术变异性是正确分析单细胞数据的先决条件。本章描述了一个计算流程,用于检测表现出比技术噪声预期更高的细胞间变异性的高变异性基因。使用scater和scran R/Bioconductor软件包的基本流程包括基于反卷积的归一化、拟合均值-方差趋势、测试非零生物学变异性以及用高变异性基因进行可视化。还介绍了检测高变异性基因的基础理论概述。我们通过两个案例研究来说明该流程的工作方式,一个来自带有外部RNA加标的小鼠胚胎干细胞,另一个来自没有加标的小鼠齿状回细胞。