Cordero Pablo, Stuart Joshua M
UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California, USA.
Pac Symp Biocomput. 2017;22:576-587. doi: 10.1142/9789813207813_0053.
The availability of gene expression data at the single cell level makes it possible to probe the molecular underpinnings of complex biological processes such as differentiation and oncogenesis. Promising new methods have emerged for reconstructing a progression 'trajectory' from static single-cell transcriptome measurements. However, it remains unclear how to adequately model the appreciable level of noise in these data to elucidate gene regulatory network rewiring. Here, we present a framework called Single Cell Inference of MorphIng Trajectories and their Associated Regulation (SCIMITAR) that infers progressions from static single-cell transcriptomes by employing a continuous parametrization of Gaussian mixtures in high-dimensional curves. SCIMITAR yields rich models from the data that highlight genes with expression and co-expression patterns that are associated with the inferred progression. Further, SCIMITAR extracts regulatory states from the implicated trajectory-evolvingco-expression networks. We benchmark the method on simulated data to show that it yields accurate cell ordering and gene network inferences. Applied to the interpretation of a single-cell human fetal neuron dataset, SCIMITAR finds progression-associated genes in cornerstone neural differentiation pathways missed by standard differential expression tests. Finally, by leveraging the rewiring of gene-gene co-expression relations across the progression, the method reveals the rise and fall of co-regulatory states and trajectory-dependent gene modules. These analyses implicate new transcription factors in neural differentiation including putative co-factors for the multi-functional NFAT pathway.
单细胞水平上基因表达数据的可用性使得探究诸如分化和肿瘤发生等复杂生物过程的分子基础成为可能。从静态单细胞转录组测量中重建进展“轨迹”的有前景的新方法已经出现。然而,目前尚不清楚如何充分模拟这些数据中相当程度的噪声,以阐明基因调控网络的重新布线。在此,我们提出了一个名为“形态发生轨迹及其相关调控的单细胞推断”(SCIMITAR)的框架,该框架通过在高维曲线中采用高斯混合的连续参数化,从静态单细胞转录组中推断进展。SCIMITAR从数据中生成丰富的模型,突出显示具有与推断进展相关的表达和共表达模式的基因。此外,SCIMITAR从涉及的轨迹演化共表达网络中提取调控状态。我们在模拟数据上对该方法进行基准测试,以表明它能产生准确的细胞排序和基因网络推断。应用于对单细胞人类胎儿神经元数据集的解释时,SCIMITAR在标准差异表达测试遗漏的基石神经分化途径中发现了与进展相关的基因。最后,通过利用进展过程中基因-基因共表达关系的重新布线,该方法揭示了共调控状态和轨迹依赖性基因模块的兴衰。这些分析表明新的转录因子参与神经分化,包括多功能NFAT途径的假定辅助因子。