Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.
VIB Center for Plant Systems Biology, Ghent, Belgium.
Nat Plants. 2021 Apr;7(4):500-513. doi: 10.1038/s41477-021-00894-1. Epub 2021 Apr 12.
Gene regulation is a dynamic process in which transcription factors (TFs) play an important role in controlling spatiotemporal gene expression. To enhance our global understanding of regulatory interactions in Arabidopsis thaliana, different regulatory input networks capturing complementary information about DNA motifs, open chromatin, TF-binding and expression-based regulatory interactions were combined using a supervised learning approach, resulting in an integrated gene regulatory network (iGRN) covering 1,491 TFs and 31,393 target genes (1.7 million interactions). This iGRN outperforms the different input networks to predict known regulatory interactions and has a similar performance to recover functional interactions compared to state-of-the-art experimental methods. The iGRN correctly inferred known functions for 681 TFs and predicted new gene functions for hundreds of unknown TFs. For regulators predicted to be involved in reactive oxygen species (ROS) stress regulation, we confirmed in total 75% of TFs with a function in ROS and/or physiological stress responses. This includes 13 ROS regulators, previously not connected to any ROS or stress function, that were experimentally validated in our ROS-specific phenotypic assays of loss- or gain-of-function lines. In conclusion, the presented iGRN offers a high-quality starting point to enhance our understanding of gene regulation in plants by integrating different experimental data types.
基因调控是一个动态的过程,转录因子(TFs)在控制时空基因表达方面起着重要作用。为了增强我们对拟南芥调控相互作用的全局理解,使用监督学习方法将不同的调控输入网络结合起来,这些网络捕捉了关于 DNA 基序、开放染色质、TF 结合和基于表达的调控相互作用的互补信息,从而构建了一个包含 1491 个 TF 和 31393 个靶基因(170 万个相互作用)的综合基因调控网络(iGRN)。该 iGRN 优于不同的输入网络,可预测已知的调控相互作用,并且与最先进的实验方法相比,其恢复功能相互作用的性能相似。iGRN 正确推断了 681 个 TF 的已知功能,并预测了数百个未知 TF 的新基因功能。对于预测参与活性氧(ROS)应激调节的调节剂,我们总共验证了 75%具有 ROS 和/或生理应激反应功能的 TF。这包括 13 个 ROS 调节剂,以前与任何 ROS 或应激功能都没有联系,它们在我们的 ROS 特异性表型缺失或功能获得系的功能验证实验中得到了实验验证。总之,所提出的 iGRN 通过整合不同的实验数据类型,为我们理解植物中的基因调控提供了一个高质量的起点。