Biotech Research and Innovation Centre (BRIC), University of Copenhagen, 2200 Copenhagen N, Denmark.
Novo Nordisk Foundation Center for Stem Cell Biology, DanStem, Faculty of Health and Medical Sciences, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen N, Denmark.
Nucleic Acids Res. 2021 Jan 11;49(1):e1. doi: 10.1093/nar/gkaa1014.
Accurate prediction of gene regulatory rules is important towards understanding of cellular processes. Existing computational algorithms devised for bulk transcriptomics typically require a large number of time points to infer gene regulatory networks (GRNs), are applicable for a small number of genes and fail to detect potential causal relationships effectively. Here, we propose a novel approach 'TENET' to reconstruct GRNs from single cell RNA sequencing (scRNAseq) datasets. Employing transfer entropy (TE) to measure the amount of causal relationships between genes, TENET predicts large-scale gene regulatory cascades/relationships from scRNAseq data. TENET showed better performance than other GRN reconstructors, in identifying key regulators from public datasets. Specifically from scRNAseq, TENET identified key transcriptional factors in embryonic stem cells (ESCs) and during direct cardiomyocytes reprogramming, where other predictors failed. We further demonstrate that known target genes have significantly higher TE values, and TENET predicted higher TE genes were more influenced by the perturbation of their regulator. Using TENET, we identified and validated that Nme2 is a culture condition specific stem cell factor. These results indicate that TENET is uniquely capable of identifying key regulators from scRNAseq data.
准确预测基因调控规则对于理解细胞过程非常重要。现有的用于批量转录组学的计算算法通常需要大量的时间点来推断基因调控网络(GRN),适用于少数基因,并且无法有效地检测潜在的因果关系。在这里,我们提出了一种从单细胞 RNA 测序(scRNAseq)数据集中重建 GRN 的新方法 'TENET'。TENET 采用传递熵(TE)来衡量基因之间因果关系的数量,从 scRNAseq 数据中预测大规模基因调控级联/关系。TENET 在从公共数据集识别关键调节剂方面的表现优于其他 GRN 重构器。具体来说,从 scRNAseq 中,TENET 确定了胚胎干细胞(ESCs)和直接心肌细胞重编程过程中的关键转录因子,而其他预测器则无法做到这一点。我们进一步证明,已知的靶基因具有显著更高的 TE 值,并且 TENET 预测的具有更高 TE 值的基因受到其调节剂的干扰更大。使用 TENET,我们鉴定并验证了 Nme2 是一种特定于培养条件的干细胞因子。这些结果表明,TENET 能够独特地从 scRNAseq 数据中识别关键调节剂。