Telethon Institute of Genetics and Medicine (TIGEM), Armenise/Harvard Laboratory of Integrative Genomics, Pozzuoli, Italy.
Istituto per le Applicazioni del Calcolo "Mauro Picone", Consiglio Nazionale delle Ricerche, Naples, Italy.
Methods Mol Biol. 2021;2284:343-365. doi: 10.1007/978-1-0716-1307-8_19.
Thanks to innovative sample-preparation and sequencing technologies, gene expression in individual cells can now be measured for thousands of cells in a single experiment. Since its introduction, single-cell RNA sequencing (scRNA-seq) approaches have revolutionized the genomics field as they created unprecedented opportunities for resolving cell heterogeneity by exploring gene expression profiles at a single-cell resolution. However, the rapidly evolving field of scRNA-seq invoked the emergence of various analytics approaches aimed to maximize the full potential of this novel strategy. Unlike population-based RNA sequencing approaches, scRNA seq necessitates comprehensive computational tools to address high data complexity and keep up with the emerging single-cell associated challenges. Despite the vast number of analytical methods, a universal standardization is lacking. While this reflects the fields' immaturity, it may also encumber a newcomer to blend in.In this review, we aim to bridge over the abovementioned hurdle and propose four ready-to-use pipelines for scRNA-seq analysis easily accessible by a newcomer, that could fit various biological data types. Here we provide an overview of the currently available single-cell technologies for cell isolation and library preparation and a step by step guide that covers the entire canonical analytic workflow to analyse scRNA-seq data including read mapping, quality controls, gene expression quantification, normalization, feature selection, dimensionality reduction, and cell clustering useful for trajectory inference and differential expression. Such workflow guidelines will escort novices as well as expert users in the analysis of complex scRNA-seq datasets, thus further expanding the research potential of single-cell approaches in basic science, and envisaging its future implementation as best practice in the field.
由于创新的样本制备和测序技术,现在可以在单个实验中测量数千个细胞中的单个细胞基因表达。自推出以来,单细胞 RNA 测序 (scRNA-seq) 方法彻底改变了基因组学领域,因为它们通过探索单细胞分辨率的基因表达谱,为解析细胞异质性创造了前所未有的机会。然而,scRNA-seq 这一快速发展的领域引发了各种分析方法的出现,旨在最大限度地发挥这一新策略的潜力。与基于群体的 RNA 测序方法不同,scRNA-seq 需要全面的计算工具来解决高数据复杂性问题,并跟上新兴的单细胞相关挑战。尽管有大量的分析方法,但缺乏普遍的标准化。虽然这反映了该领域的不成熟,但也可能会阻碍新手融入其中。在这篇综述中,我们旨在克服上述障碍,提出四个易于新手使用的 scRNA-seq 分析即用型管道,它们可以适用于各种生物数据类型。在这里,我们概述了目前用于细胞分离和文库制备的单细胞技术,并提供了一个逐步指南,涵盖了分析 scRNA-seq 数据的整个标准分析工作流程,包括读取映射、质量控制、基因表达定量、归一化、特征选择、降维和细胞聚类,这些都有助于轨迹推断和差异表达。这些工作流程指南将为新手和专家用户在分析复杂的 scRNA-seq 数据集时提供指导,从而进一步扩大单细胞方法在基础科学中的研究潜力,并设想将其作为该领域的最佳实践进行未来的实施。