全局转录调控网络将基因表达与转录因子活性紧密地连接起来。

Global transcriptional regulatory network for robustly connects gene expression to transcription factor activities.

机构信息

Department of Bioengineering, University of California at San Diego, La Jolla, CA 92093.

Bioinformatics and Systems Biology Program, University of California at San Diego, La Jolla, CA 92093.

出版信息

Proc Natl Acad Sci U S A. 2017 Sep 19;114(38):10286-10291. doi: 10.1073/pnas.1702581114. Epub 2017 Sep 5.

DOI:10.1073/pnas.1702581114

PMID:28874552

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5617254/

Abstract

Transcriptional regulatory networks (TRNs) have been studied intensely for >25 y. Yet, even for the TRN-probably the best characterized TRN-several questions remain. Here, we address three questions: () How complete is our knowledge of the TRN; () how well can we predict gene expression using this TRN; and () how robust is our understanding of the TRN? First, we reconstructed a high-confidence TRN (hiTRN) consisting of 147 transcription factors (TFs) regulating 1,538 transcription units (TUs) encoding 1,764 genes. The 3,797 high-confidence regulatory interactions were collected from published, validated chromatin immunoprecipitation (ChIP) data and RegulonDB. For 21 different TF knockouts, up to 63% of the differentially expressed genes in the hiTRN were traced to the knocked-out TF through regulatory cascades. Second, we trained supervised machine learning algorithms to predict the expression of 1,364 TUs given TF activities using 441 samples. The algorithms accurately predicted condition-specific expression for 86% (1,174 of 1,364) of the TUs, while 193 TUs (14%) were predicted better than random TRNs. Third, we identified 10 regulatory modules whose definitions were robust against changes to the TRN or expression compendium. Using surrogate variable analysis, we also identified three unmodeled factors that systematically influenced gene expression. Our computational workflow comprehensively characterizes the predictive capabilities and systems-level functions of an organism's TRN from disparate data types.

摘要

转录调控网络 (TRN) 的研究已经进行了超过 25 年。然而，即使对于 TRN——可能是特征研究得最好的 TRN——仍有几个问题悬而未决。在这里，我们提出了三个问题：（1）我们对 TRN 的了解有多完整；（2）我们使用这个 TRN 预测基因表达的能力有多好；（3）我们对 TRN 的理解有多稳健？首先，我们构建了一个由 147 个转录因子（TFs）调控 1,538 个转录单元（TUs）的高可信度 TRN（hiTRN），这些 TUs 编码 1,764 个基因。从已发表的、经过验证的染色质免疫沉淀（ChIP）数据和 RegulonDB 中收集了 3,797 个高可信度的调控相互作用。对于 21 个不同的 TF 敲除，通过调控级联，在 hiTRN 中多达 63%的差异表达基因可以追溯到被敲除的 TF。其次，我们使用 441 个样本训练了监督机器学习算法，根据 TF 活性预测 1,364 个 TU 的表达。对于 1,364 个 TU 中的 86%（1,174 个），算法可以准确地预测特定条件下的表达，而 193 个 TU（14%）的预测比随机 TRN 更好。第三，我们鉴定了 10 个调控模块，它们的定义在 TRN 或表达综合数据库发生变化时具有稳健性。通过替代变量分析，我们还鉴定了三个未建模的因素，这些因素系统地影响基因表达。我们的计算工作流程全面描述了从不同数据类型中获得的生物体 TRN 的预测能力和系统级功能。

相似文献

Global transcriptional regulatory network for robustly connects gene expression to transcription factor activities.全局转录调控网络将基因表达与转录因子活性紧密地连接起来。

Proc Natl Acad Sci U S A. 2017 Sep 19;114(38):10286-10291. doi: 10.1073/pnas.1702581114. Epub 2017 Sep 5.

Interplay between network structures, regulatory modes and sensing mechanisms of transcription factors in the transcriptional regulatory network of E. coli.大肠杆菌转录调控网络中网络结构、调控模式及转录因子传感机制之间的相互作用

J Mol Biol. 2007 Sep 28;372(4):1108-1122. doi: 10.1016/j.jmb.2007.06.084. Epub 2007 Jul 3.

The Escherichia coli transcriptome mostly consists of independently regulated modules.大肠杆菌转录组主要由独立调控的模块组成。

Nat Commun. 2019 Dec 4;10(1):5536. doi: 10.1038/s41467-019-13483-w.

An integrated approach to reconstructing genome-scale transcriptional regulatory networks.一种重建全基因组规模转录调控网络的综合方法。

PLoS Comput Biol. 2015 Feb 27;11(2):e1004103. doi: 10.1371/journal.pcbi.1004103. eCollection 2015 Feb.

Hierarchical structure and modules in the Escherichia coli transcriptional regulatory network revealed by a new top-down approach.一种新的自上而下方法揭示的大肠杆菌转录调控网络中的层次结构和模块

BMC Bioinformatics. 2004 Dec 16;5:199. doi: 10.1186/1471-2105-5-199.

Analysis of the hierarchical structure of the B. subtilis transcriptional regulatory network.枯草芽孢杆菌转录调控网络的层次结构分析。

Mol Biosyst. 2015 Mar;11(3):930-41. doi: 10.1039/c4mb00298a. Epub 2015 Jan 19.

Predicting transcriptional regulatory interactions with artificial neural networks applied to E. coli multidrug resistance efflux pumps.应用人工神经网络预测大肠杆菌多药耐药外排泵的转录调控相互作用。

BMC Microbiol. 2008 Jun 19;8:101. doi: 10.1186/1471-2180-8-101.

Transcriptional regulatory networks via gene ontology and expression data.通过基因本体论和表达数据构建转录调控网络。

In Silico Biol. 2007;7(1):21-34.

Machine Learning of All Mycobacterium tuberculosis H37Rv RNA-seq Data Reveals a Structured Interplay between Metabolism, Stress Response, and Infection.基于 H37Rv 全转录组 RNA-seq 数据的机器学习揭示了代谢、应激反应和感染之间的结构化相互作用。

mSphere. 2022 Apr 27;7(2):e0003322. doi: 10.1128/msphere.00033-22. Epub 2022 Mar 21.

Network motif-based identification of transcription factor-target gene relationships by integrating multi-source biological data.通过整合多源生物数据基于网络基序识别转录因子-靶基因关系

BMC Bioinformatics. 2008 Apr 21;9:203. doi: 10.1186/1471-2105-9-203.

引用本文的文献

How Klebsiella pneumoniae controls its virulence.肺炎克雷伯菌如何控制其毒力。

PLoS Pathog. 2025 Sep 15;21(9):e1013499. doi: 10.1371/journal.ppat.1013499. eCollection 2025 Sep.

Predicting input signals of transcription factors in Escherichia coli.预测大肠杆菌中转录因子的输入信号。

Mol Syst Biol. 2025 Jul 16. doi: 10.1038/s44320-025-00132-2.

Gene network centrality analysis identifies key regulators coordinating day-night metabolic transitions in PCC 7942 despite limited accuracy in predicting direct regulator-gene interactions.基因网络中心性分析确定了协调集胞藻7942昼夜代谢转变的关键调节因子，尽管在预测直接调节因子-基因相互作用方面准确性有限。

Front Microbiol. 2025 Mar 26;16:1569559. doi: 10.3389/fmicb.2025.1569559. eCollection 2025.

Selective phase separation of transcription factors is driven by orthogonal molecular grammar.转录因子的选择性相分离由正交分子语法驱动。

Nat Commun. 2025 Mar 31;16(1):3087. doi: 10.1038/s41467-025-58445-7.

Bimodality in E. coli gene expression: Sources and robustness to genome-wide stresses.大肠杆菌基因表达的双峰性：全基因组应激的来源及稳健性

PLoS Comput Biol. 2025 Feb 13;21(2):e1012817. doi: 10.1371/journal.pcbi.1012817. eCollection 2025 Feb.

Constructing the dynamic transcriptional regulatory networks to identify phenotype-specific transcription regulators.构建动态转录调控网络以识别表型特异性转录调控因子。

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae542.

identification and functional characterization of common genes associated with type 2 diabetes and hypertension.2型糖尿病和高血压相关常见基因的鉴定与功能表征

Heliyon. 2024 Aug 21;10(16):e36546. doi: 10.1016/j.heliyon.2024.e36546. eCollection 2024 Aug 30.

Reconstructing the transcriptional regulatory network of probiotic is enabled by transcriptomics and machine learning.基于转录组学和机器学习来重建益生菌的转录调控网络。

mSystems. 2024 Mar 19;9(3):e0125723. doi: 10.1128/msystems.01257-23. Epub 2024 Feb 13.

Competition and evolutionary selection among core regulatory motifs in gene expression control.基因表达调控中核心调控基序的竞争与进化选择。

Nat Commun. 2023 Dec 13;14(1):8266. doi: 10.1038/s41467-023-43327-7.

Evolutionary innovation through transcription factor rewiring in microbes is shaped by levels of transcription factor activity, expression, and existing connectivity.转录因子重布线在微生物中的进化创新受转录因子活性、表达水平和现有连接性的影响。

PLoS Biol. 2023 Oct 23;21(10):e3002348. doi: 10.1371/journal.pbio.3002348. eCollection 2023 Oct.

本文引用的文献

Few regulatory metabolites coordinate expression of central metabolic genes in Escherichia coli.在大肠杆菌中，很少有调节性代谢物能协调中心代谢基因的表达。

Mol Syst Biol. 2017 Jan 3;13(1):903. doi: 10.15252/msb.20167402.

Multi-omics integration accurately predicts cellular state in unexplored conditions for Escherichia coli.多组学整合能够准确预测未探索条件下大肠杆菌的细胞状态。

Nat Commun. 2016 Oct 7;7:13090. doi: 10.1038/ncomms13090.

Local and global regulation of transcription initiation in bacteria.细菌中转录起始的局部和全局调控。

Nat Rev Microbiol. 2016 Oct;14(10):638-50. doi: 10.1038/nrmicro.2016.103. Epub 2016 Aug 8.

Stability-driven nonnegative matrix factorization to interpret spatial gene expression and build local gene networks.基于稳定性驱动的非负矩阵分解用于解释空间基因表达并构建局部基因网络。

Proc Natl Acad Sci U S A. 2016 Apr 19;113(16):4290-5. doi: 10.1073/pnas.1521171113. Epub 2016 Apr 6.

COLOMBOS v3.0: leveraging gene expression compendia for cross-species analyses.COLOMBOS v3.0：利用基因表达综合数据集进行跨物种分析。

Nucleic Acids Res. 2016 Jan 4;44(D1):D620-3. doi: 10.1093/nar/gkv1251. Epub 2015 Nov 19.

An experimentally supported model of the Bacillus subtilis global transcriptional regulatory network.一个经实验支持的枯草芽孢杆菌全局转录调控网络模型。

Mol Syst Biol. 2015 Nov 17;11(11):839. doi: 10.15252/msb.20156236.

RegulonDB version 9.0: high-level integration of gene regulation, coexpression, motif clustering and beyond.RegulonDB 9.0版本：基因调控、共表达、基序聚类及其他方面的高级整合。

Nucleic Acids Res. 2016 Jan 4;44(D1):D133-43. doi: 10.1093/nar/gkv1156. Epub 2015 Nov 2.

Genome-wide Reconstruction of OxyR and SoxRS Transcriptional Regulatory Networks under Oxidative Stress in Escherichia coli K-12 MG1655.在大肠杆菌 K-12 MG1655 中，氧化应激下的 OxyR 和 SoxRS 转录调控网络的全基因组重建。

Cell Rep. 2015 Aug 25;12(8):1289-99. doi: 10.1016/j.celrep.2015.07.043. Epub 2015 Aug 13.

Systems biology definition of the core proteome of metabolism and expression is consistent with high-throughput data.代谢和表达核心蛋白质组的系统生物学定义与高通量数据一致。

Proc Natl Acad Sci U S A. 2015 Aug 25;112(34):10810-5. doi: 10.1073/pnas.1501384112. Epub 2015 Aug 10.

Decoding genome-wide GadEWX-transcriptional regulatory networks reveals multifaceted cellular responses to acid stress in Escherichia coli.解码全基因组GadEWX转录调控网络揭示了大肠杆菌对酸应激的多方面细胞反应。

Nat Commun. 2015 Aug 10;6:7970. doi: 10.1038/ncomms8970.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验