Brief Bioinform. 2019 Jul 19;20(4):1583-1589. doi: 10.1093/bib/bby011.
Traditional RNA sequencing (RNA-seq) allows the detection of gene expression variations between two or more cell populations through differentially expressed gene (DEG) analysis. However, genes that contribute to cell-to-cell differences are not discoverable with RNA-seq because RNA-seq samples are obtained from a mixture of cells. Single-cell RNA-seq (scRNA-seq) allows the detection of gene expression in each cell. With scRNA-seq, highly variable gene (HVG) discovery allows the detection of genes that contribute strongly to cell-to-cell variation within a homogeneous cell population, such as a population of embryonic stem cells. This analysis is implemented in many software packages. In this study, we compare seven HVG methods from six software packages, including BASiCS, Brennecke, scLVM, scran, scVEGs and Seurat. Our results demonstrate that reproducibility in HVG analysis requires a larger sample size than DEG analysis. Discrepancies between methods and potential issues in these tools are discussed and recommendations are made.
传统的 RNA 测序(RNA-seq)通过差异表达基因(DEG)分析允许检测两个或更多细胞群体之间的基因表达变化。然而,由于 RNA-seq 样品是从细胞混合物中获得的,因此无法发现导致细胞间差异的基因。单细胞 RNA-seq(scRNA-seq)允许检测每个细胞中的基因表达。通过 scRNA-seq,高变异基因(HVG)发现允许检测到对同质细胞群体(如胚胎干细胞群体)内细胞间变异有强烈贡献的基因。这种分析在许多软件包中都有实现。在这项研究中,我们比较了来自六个软件包的七种 HVG 方法,包括 BASiCS、Brennecke、scLVM、scran、scVEGs 和 Seurat。我们的结果表明,HVG 分析的重现性需要比 DEG 分析更大的样本量。讨论了方法之间的差异以及这些工具中的潜在问题,并提出了建议。