Department of Veterinary Integrative Biosciences, Texas A&M University, College Station, TX 77843, USA.
Department of Veterinary Pathobiology, Texas A&M University, College Station, TX 77843, USA.
Cells. 2019 Dec 19;9(1):14. doi: 10.3390/cells9010014.
As single-cell RNA sequencing (scRNA-seq) data becomes widely available, cell-to-cell variability in gene expression, or (scEV), has been increasingly appreciated. However, it remains unclear whether this variability is functionally important and, if so, what are its implications for multi-cellular organisms. Here, we analyzed multiple scRNA-seq data sets from lymphoblastoid cell lines (LCLs), lung airway epithelial cells (LAECs), and dermal fibroblasts (DFs) and, for each cell type, selected a group of homogenous cells with highly similar expression profiles. We estimated the scEV levels for genes after correcting the mean-variance dependency in that data and identified 465, 466, and 364 highly variable genes (HVGs) in LCLs, LAECs, and DFs, respectively. Functions of these HVGs were found to be enriched with those biological processes precisely relevant to the corresponding cell type's function, from which the scRNA-seq data used to identify HVGs were generated-e.g., cytokine signaling pathways were enriched in HVGs identified in LCLs, collagen formation in LAECs, and keratinization in DFs. We repeated the same analysis with scRNA-seq data from induced pluripotent stem cells (iPSCs) and identified only 79 HVGs with no statistically significant enriched functions; the overall scEV in iPSCs was of negligible magnitude. Our results support the "variation is function" hypothesis, arguing that scEV is required for cell type-specific, higher-level system function. Thus, quantifying and characterizing scEV are of importance for our understating of normal and pathological cellular processes.
随着单细胞 RNA 测序 (scRNA-seq) 数据的广泛应用,越来越多的人意识到基因表达的细胞间变异性,或 scEV。然而,目前尚不清楚这种变异性是否具有功能重要性,如果是,它对多细胞生物有什么影响。在这里,我们分析了来自淋巴母细胞系 (LCL)、肺气道上皮细胞 (LAEC) 和真皮成纤维细胞 (DF) 的多个 scRNA-seq 数据集,并且对于每种细胞类型,选择了一组具有高度相似表达谱的同质细胞。我们在纠正了该数据中均值方差依赖性后,估计了基因的 scEV 水平,并分别在 LCL、LAEC 和 DF 中鉴定出 465、466 和 364 个高度可变基因 (HVG)。这些 HVG 的功能被发现与那些与相应细胞类型的功能精确相关的生物学过程富集,这些 HVGs 是从用于鉴定 HVGs 的 scRNA-seq 数据中识别出来的,例如,细胞因子信号通路在 LCL 中鉴定的 HVGs 中富集,在 LAEC 中富集胶原蛋白形成,在 DF 中富集角蛋白化。我们用诱导多能干细胞 (iPSC) 的 scRNA-seq 数据重复了相同的分析,只鉴定出了 79 个 HVG,没有统计学上显著富集的功能;iPSC 中的整体 scEV 幅度可以忽略不计。我们的结果支持“变异是功能”的假设,即 scEV 是细胞类型特异性、更高水平系统功能所必需的。因此,量化和描述 scEV 对于我们理解正常和病理细胞过程非常重要。