• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大肠杆菌体内转录因子DNA结合谱汇编的预测性生物物理神经网络建模

Predictive biophysical neural network modeling of a compendium of in vivo transcription factor DNA binding profiles for Escherichia coli.

作者信息

Lally Patrick, Gómez-Romero Laura, Tierrafría Víctor H, Aquino Patricia, Rioualen Claire, Zhang Xiaoman, Kim Sunyoung, Baniulyte Gabriele, Plitnick Jonathan, Smith Carol, Babu Mohan, Collado-Vides Julio, Wade Joseph T, Galagan James E

机构信息

Department of Biomedical Engineering, Boston University, 44 Cummington Mall, Boston, MA, USA.

Instituto Nacional de Medicina Genómica, Periférico Sur 4809, Arenal Tepepan, Ciudad de México, México, México.

出版信息

Nat Commun. 2025 May 7;16(1):4255. doi: 10.1038/s41467-025-58862-8.

DOI:10.1038/s41467-025-58862-8
PMID:40335485
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12059191/
Abstract

The DNA binding of most Escherichia coli Transcription Factors (TFs) has not been comprehensively mapped, and few have models that can quantitatively predict binding affinity. We report the global mapping of in vivo DNA binding for 139 E. coli TFs using ChIP-Seq. We use these data to train BoltzNet, a novel neural network that predicts TF binding energy from DNA sequence. BoltzNet mirrors a quantitative biophysical model and provides directly interpretable predictions genome-wide at nucleotide resolution. We use BoltzNet to quantitatively design novel binding sites, which we validate with biophysical experiments on purified protein. We generate models for 124 TFs that provide insight into global features of TF binding, including clustering of sites, the role of accessory bases, the relevance of weak sites, and the background affinity of the genome. Our paper provides new paradigms for studying TF-DNA binding and for the development of biophysically motivated neural networks.

摘要

大多数大肠杆菌转录因子(TFs)的DNA结合情况尚未得到全面测绘,并且很少有能够定量预测结合亲和力的模型。我们报告了使用ChIP-Seq对139种大肠杆菌TFs进行体内DNA结合的全局测绘。我们利用这些数据训练了BoltzNet,这是一种能从DNA序列预测TF结合能的新型神经网络。BoltzNet反映了一个定量生物物理模型,并在全基因组范围内以核苷酸分辨率提供可直接解释的预测。我们使用BoltzNet定量设计新的结合位点,并通过对纯化蛋白的生物物理实验进行验证。我们为124种TFs生成了模型,这些模型有助于深入了解TF结合的全局特征,包括位点聚类、辅助碱基的作用、弱位点的相关性以及基因组的背景亲和力。我们的论文为研究TF-DNA结合以及开发基于生物物理的神经网络提供了新的范例。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fde/12059191/cedaa858e835/41467_2025_58862_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fde/12059191/14be590808c9/41467_2025_58862_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fde/12059191/f3bd22cebc6d/41467_2025_58862_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fde/12059191/4c1a01daa016/41467_2025_58862_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fde/12059191/1927ce664005/41467_2025_58862_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fde/12059191/9b226cc38e3a/41467_2025_58862_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fde/12059191/cedaa858e835/41467_2025_58862_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fde/12059191/14be590808c9/41467_2025_58862_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fde/12059191/f3bd22cebc6d/41467_2025_58862_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fde/12059191/4c1a01daa016/41467_2025_58862_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fde/12059191/1927ce664005/41467_2025_58862_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fde/12059191/9b226cc38e3a/41467_2025_58862_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3fde/12059191/cedaa858e835/41467_2025_58862_Fig6_HTML.jpg

相似文献

1
Predictive biophysical neural network modeling of a compendium of in vivo transcription factor DNA binding profiles for Escherichia coli.大肠杆菌体内转录因子DNA结合谱汇编的预测性生物物理神经网络建模
Nat Commun. 2025 May 7;16(1):4255. doi: 10.1038/s41467-025-58862-8.
2
Predictive Biophysical Neural Network Modeling of a Compendium of Transcription Factor DNA Binding Profiles for .用于……的转录因子DNA结合谱汇编的预测性生物物理神经网络建模
bioRxiv. 2024 May 24:2024.05.23.594371. doi: 10.1101/2024.05.23.594371.
3
Widespread Strain-Specific Distinctions in Chromosomal Binding Dynamics of a Highly Conserved Escherichia coli Transcription Factor.广泛存在于高度保守的大肠杆菌转录因子的染色体结合动力学中的菌株特异性差异。
mBio. 2020 Jun 23;11(3):e01058-20. doi: 10.1128/mBio.01058-20.
4
Benchmarking and building DNA binding affinity models using allele-specific and allele-agnostic transcription factor binding data.使用等位基因特异性和等位基因非特异性转录因子结合数据进行基准测试和构建 DNA 结合亲和力模型。
Genome Biol. 2024 Oct 31;25(1):284. doi: 10.1186/s13059-024-03424-2.
5
CacPred: a cascaded convolutional neural network for TF-DNA binding prediction.CacPred:用于转录因子-脱氧核糖核酸结合预测的级联卷积神经网络
BMC Genomics. 2025 Mar 18;26(Suppl 2):264. doi: 10.1186/s12864-025-11399-y.
6
A biophysical model for analysis of transcription factor interaction and binding site arrangement from genome-wide binding data.基于全基因组结合数据的转录因子相互作用和结合位点排列的生物物理模型分析。
PLoS One. 2009 Dec 1;4(12):e8155. doi: 10.1371/journal.pone.0008155.
7
Transcription profile of Escherichia coli: genomic SELEX search for regulatory targets of transcription factors.大肠杆菌的转录谱:用于转录因子调控靶点的基因组SELEX筛选
Nucleic Acids Res. 2016 Mar 18;44(5):2058-74. doi: 10.1093/nar/gkw051. Epub 2016 Feb 3.
8
ChIPulate: A comprehensive ChIP-seq simulation pipeline.ChIPulate:一个全面的 ChIP-seq 模拟管道。
PLoS Comput Biol. 2019 Mar 21;15(3):e1006921. doi: 10.1371/journal.pcbi.1006921. eCollection 2019 Mar.
9
Using sequence-specific chemical and structural properties of DNA to predict transcription factor binding sites.利用 DNA 的序列特异性化学和结构特性来预测转录因子结合位点。
PLoS Comput Biol. 2010 Nov 18;6(11):e1001007. doi: 10.1371/journal.pcbi.1001007.
10
The architecture of ArgR-DNA complexes at the genome-scale in Escherichia coli.大肠杆菌基因组尺度下ArgR-DNA复合物的结构
Nucleic Acids Res. 2015 Mar 31;43(6):3079-88. doi: 10.1093/nar/gkv150. Epub 2015 Mar 3.

引用本文的文献

1
The Environment-Dependent Regulatory Landscape of the Genome.基因组的环境依赖性调控格局
ArXiv. 2025 May 13:arXiv:2505.08764v1.
2
The Environment-Dependent Regulatory Landscape of the Genome.基因组的环境依赖性调控格局
bioRxiv. 2025 May 15:2025.05.13.653802. doi: 10.1101/2025.05.13.653802.
3
A Cryptic Prophage Transcription Factor Drives Phenotypic Changes via Host Gene Regulation.一种隐秘原噬菌体转录因子通过宿主基因调控驱动表型变化。

本文引用的文献

1
RegulonDB v12.0: a comprehensive resource of transcriptional regulation in E. coli K-12.RegulonDB v12.0:大肠杆菌 K-12 转录调控的综合资源。
Nucleic Acids Res. 2024 Jan 5;52(D1):D255-D264. doi: 10.1093/nar/gkad1072.
2
Genomic footprinting uncovers global transcription factor responses to amino acids in Escherichia coli.基因组足迹分析揭示了大肠杆菌中全局转录因子对氨基酸的反应。
Cell Syst. 2023 Oct 18;14(10):860-871.e4. doi: 10.1016/j.cels.2023.09.003. Epub 2023 Oct 10.
3
Noncanonical binding of transcription factors: time to revisit ?
bioRxiv. 2024 Sep 21:2024.09.21.614188. doi: 10.1101/2024.09.21.614188.
转录因子的非规范结合:是时候重新审视了吗?
Mol Biol Cell. 2023 Aug 1;34(9). doi: 10.1091/mbc.E22-08-0325.
4
Systematic analysis of low-affinity transcription factor binding site clusters in vitro and in vivo establishes their functional relevance.系统分析体外和体内低亲和力转录因子结合位点簇,确定其功能相关性。
Nat Commun. 2022 Sep 7;13(1):5273. doi: 10.1038/s41467-022-32971-0.
5
Prediction of protein-ligand binding affinity from sequencing data with interpretable machine learning.基于可解释机器学习的测序数据预测蛋白-配体结合亲和力。
Nat Biotechnol. 2022 Oct;40(10):1520-1527. doi: 10.1038/s41587-022-01307-0. Epub 2022 May 23.
6
An Allosteric Transcription Factor DNA-Binding Electrochemical Biosensor for Progesterone.一种用于孕酮的变构转录因子 DNA 结合电化学生物传感器。
ACS Sens. 2022 Apr 22;7(4):1132-1137. doi: 10.1021/acssensors.2c00133. Epub 2022 Apr 12.
7
Base-resolution models of transcription-factor binding reveal soft motif syntax.基于分辨率的转录因子结合模型揭示了软基序语法。
Nat Genet. 2021 Mar;53(3):354-366. doi: 10.1038/s41588-021-00782-6. Epub 2021 Feb 18.
8
Redefining fundamental concepts of transcription initiation in bacteria.重新定义细菌中转录起始的基本概念。
Nat Rev Genet. 2020 Nov;21(11):699-714. doi: 10.1038/s41576-020-0254-8. Epub 2020 Jul 14.
9
A progesterone biosensor derived from microbial screening.一种源自微生物筛选的孕激素生物传感器。
Nat Commun. 2020 Mar 9;11(1):1276. doi: 10.1038/s41467-020-14942-5.
10
Logomaker: beautiful sequence logos in Python.Logomaker:用 Python 绘制优美的序列 logo。
Bioinformatics. 2020 Apr 1;36(7):2272-2274. doi: 10.1093/bioinformatics/btz921.