Division of Automatic Control, Department of Electrical Engineering, Linköping University, SE-58183 Linköping, Sweden.
Bioinformatics, Department of Physics, Chemistry and Biology, Linköping University, SE-58183 Linköping, Sweden.
Bioinformatics. 2021 Dec 22;38(1):173-178. doi: 10.1093/bioinformatics/btab577.
The simultaneous availability of ATAC-seq and RNA-seq experiments allows to obtain a more in-depth knowledge on the regulatory mechanisms occurring in gene regulatory networks. In this article, we highlight and analyze two novel aspects that leverage on the possibility of pairing RNA-seq and ATAC-seq data. Namely we investigate the causality of the relationships between transcription factors, chromatin and target genes and the internal consistency between the two omics, here measured in terms of structural balance in the sample correlations along elementary length-3 cycles.
We propose a framework that uses the a priori knowledge on the data to infer elementary causal regulatory motifs (namely chains and forks) in the network. It is based on the notions of conditional independence and partial correlation, and can be applied to both longitudinal and non-longitudinal data. Our analysis highlights a strong connection between the causal regulatory motifs that are selected by the data and the structural balance of the underlying sample correlation graphs: strikingly, >97% of the selected regulatory motifs belong to a balanced subgraph. This result shows that internal consistency, as measured by structural balance, is close to a necessary condition for 3-node regulatory motifs to satisfy causality rules.
The analysis was carried out in MATLAB and the code can be found at https://github.com/albertozenere/Multi-omics-elementary-regulatory-motifs.
Supplementary data are available at Bioinformatics online.
ATAC-seq 和 RNA-seq 实验的同时可用性允许获得关于基因调控网络中发生的调控机制的更深入的知识。在本文中,我们强调并分析了利用 RNA-seq 和 ATAC-seq 数据配对的可能性的两个新方面。即,我们研究了转录因子、染色质和靶基因之间关系的因果关系,以及两种组学之间的内在一致性,这里通过样本相关沿基本长度-3 循环的结构平衡来衡量。
我们提出了一个框架,该框架使用关于数据的先验知识来推断网络中的基本因果调节基序(即链和叉)。它基于条件独立性和偏相关的概念,可应用于纵向和非纵向数据。我们的分析强调了由数据选择的因果调节基序与基础样本相关图的结构平衡之间的强烈联系:惊人的是,>97%选择的调节基序属于平衡子图。这一结果表明,结构平衡所衡量的内在一致性接近于满足因果规则的 3 节点调节基序的必要条件。
分析是在 MATLAB 中进行的,代码可在 https://github.com/albertozenere/Multi-omics-elementary-regulatory-motifs 上找到。
补充数据可在生物信息学在线获得。