Lab for Computational Biology, Integromics and Gene Regulation (CBIGR), Cancer Research Institute Ghent (CRIG), Corneel Heymanslaan 10, 9000 Ghent, Belgium.
Department of Biomedical Molecular Biology, Ghent University, Zwijnaarde-Technologiepark 71, 9052 Ghent, Belgium.
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae382.
Eukaryotic gene regulation is a combinatorial, dynamic, and quantitative process that plays a vital role in development and disease and can be modeled at a systems level in gene regulatory networks (GRNs). The wealth of multi-omics data measured on the same samples and even on the same cells has lifted the field of GRN inference to the next stage. Combinations of (single-cell) transcriptomics and chromatin accessibility allow the prediction of fine-grained regulatory programs that go beyond mere correlation of transcription factor and target gene expression, with enhancer GRNs (eGRNs) modeling molecular interactions between transcription factors, regulatory elements, and target genes. In this review, we highlight the key components for successful (e)GRN inference from (sc)RNA-seq and (sc)ATAC-seq data exemplified by state-of-the-art methods as well as open challenges and future developments. Moreover, we address preprocessing strategies, metacell generation and computational omics pairing, transcription factor binding site detection, and linear and three-dimensional approaches to identify chromatin interactions as well as dynamic and causal eGRN inference. We believe that the integration of transcriptomics together with epigenomics data at a single-cell level is the new standard for mechanistic network inference, and that it can be further advanced with integrating additional omics layers and spatiotemporal data, as well as with shifting the focus towards more quantitative and causal modeling strategies.
真核基因调控是一个组合、动态和定量的过程,在发育和疾病中起着至关重要的作用,可以在基因调控网络 (GRN) 中进行系统水平的建模。在同一样本甚至同一细胞上测量的大量多组学数据,使 GRN 推断领域提升到了一个新的阶段。(单细胞)转录组学和染色质可及性的组合,使得可以预测精细的调控程序,超越了转录因子和靶基因表达的简单相关性,增强子 GRN (eGRN) 对转录因子、调控元件和靶基因之间的分子相互作用进行建模。在这篇综述中,我们强调了从(sc)RNA-seq 和(sc)ATAC-seq 数据中成功推断(e)GRN 的关键组成部分,以最先进的方法为例,以及存在的挑战和未来的发展。此外,我们还讨论了预处理策略、元细胞生成和计算组学配对、转录因子结合位点检测以及用于识别染色质相互作用的线性和三维方法,以及动态和因果 eGRN 推断。我们相信,在单细胞水平上将转录组学与表观基因组学数据进行整合是机制网络推断的新标准,并且通过整合更多的组学层和时空数据,以及将重点转移到更定量和因果建模策略上,可以进一步推进这一标准。