• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用家族水平的具有生物物理可解释性的机器学习预测转录因子突变体的DNA结合特异性。

Predicting the DNA binding specificity of transcription factor mutants using family-level biophysically interpretable machine learning.

作者信息

Liu Shaoxun, Gomez-Alcala Pilar, Leemans Christ, Glassford William J, Melo Lucas A N, Lu Xiang-Jun, Mann Richard S, Bussemaker Harmen J

机构信息

Department of Biological Sciences, Columbia University, New York, NY 10027, United States.

Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, United States.

出版信息

Nucleic Acids Res. 2025 Aug 27;53(16). doi: 10.1093/nar/gkaf831.

DOI:10.1093/nar/gkaf831
PMID:40874594
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12392098/
Abstract

Sequence-specific interactions of transcription factors (TFs) with genomic DNA underlie many cellular processes. High-throughput in vitro binding assays coupled with machine learning have made it possible to accurately define such molecular recognition in a biophysically interpretable way for hundreds of TFs across many structural families, providing new avenues for predicting how the sequence preference of a TF is impacted by disease-associated mutations in its DNA binding domain. We developed a method based on a reference-free tetrahedral representation of variation in base preference within a given structural family that can be used to accurately predict the effect of mutations in the protein sequence of the TF. Using the basic helix-loop-helix (bHLH) and homeodomain (HD) families as test cases, our results demonstrate the feasibility of accurately predicting the shifts (ΔΔΔG/RT) in binding free energy associated with TF mutants by leveraging high-quality DNA binding models for sets of homologous wild-type TFs.

摘要

转录因子(TFs)与基因组DNA的序列特异性相互作用是许多细胞过程的基础。高通量体外结合测定与机器学习相结合,使得以生物物理可解释的方式准确界定许多结构家族中数百种TFs的这种分子识别成为可能,为预测TF的序列偏好如何受到其DNA结合结构域中疾病相关突变的影响提供了新途径。我们开发了一种基于给定结构家族内碱基偏好变化的无参考四面体表示法的方法,该方法可用于准确预测TF蛋白质序列中突变的影响。以基本螺旋-环-螺旋(bHLH)和同源结构域(HD)家族作为测试案例,我们的结果证明了通过利用同源野生型TFs集合的高质量DNA结合模型,准确预测与TF突变体相关的结合自由能变化(ΔΔΔG/RT)的可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56ec/12392098/e438caeeb319/gkaf831fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56ec/12392098/8fe7b0deee3b/gkaf831figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56ec/12392098/e8b845c907e8/gkaf831fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56ec/12392098/785642e26949/gkaf831fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56ec/12392098/75888b9e8e5a/gkaf831fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56ec/12392098/ed59bdcd4be3/gkaf831fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56ec/12392098/747a2a548fca/gkaf831fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56ec/12392098/e438caeeb319/gkaf831fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56ec/12392098/8fe7b0deee3b/gkaf831figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56ec/12392098/e8b845c907e8/gkaf831fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56ec/12392098/785642e26949/gkaf831fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56ec/12392098/75888b9e8e5a/gkaf831fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56ec/12392098/ed59bdcd4be3/gkaf831fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56ec/12392098/747a2a548fca/gkaf831fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/56ec/12392098/e438caeeb319/gkaf831fig6.jpg

相似文献

1
Predicting the DNA binding specificity of transcription factor mutants using family-level biophysically interpretable machine learning.利用家族水平的具有生物物理可解释性的机器学习预测转录因子突变体的DNA结合特异性。
Nucleic Acids Res. 2025 Aug 27;53(16). doi: 10.1093/nar/gkaf831.
2
Predicting the DNA binding specificity of transcription factor mutants using family-level biophysically interpretable machine learning.利用家族水平的具有生物物理可解释性的机器学习预测转录因子突变体的DNA结合特异性
bioRxiv. 2025 Apr 2:2024.01.24.577115. doi: 10.1101/2024.01.24.577115.
3
Aspects of Genetic Diversity, Host Specificity and Public Health Significance of Single-Celled Intestinal Parasites Commonly Observed in Humans and Mostly Referred to as 'Non-Pathogenic'.人类常见且大多被称为“非致病性”的单细胞肠道寄生虫的遗传多样性、宿主特异性及公共卫生意义
APMIS. 2025 Sep;133(9):e70036. doi: 10.1111/apm.70036.
4
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
5
Interpretable predictive models of genome-wide aryl hydrocarbon receptor-DNA binding reveal tissue-specific binding determinants.可解释的全基因组芳香烃受体-DNA 结合预测模型揭示了组织特异性结合决定因素。
Toxicol Sci. 2023 Nov 28;196(2):170-186. doi: 10.1093/toxsci/kfad094.
6
Molecular insights into DNA recognition by HD-Zip transcription factors.HD-Zip转录因子对DNA识别的分子见解。
Plant Physiol. 2025 Jul 3;198(3). doi: 10.1093/plphys/kiaf292.
7
UV irradiation remodels the specificity landscape of transcription factors.紫外线照射重塑了转录因子的特异性景观。
Proc Natl Acad Sci U S A. 2023 Mar 14;120(11):e2217422120. doi: 10.1073/pnas.2217422120. Epub 2023 Mar 8.
8
DNA-guided transcription factor cooperativity shapes face and limb mesenchyme.DNA 指导的转录因子协同作用塑造面部和肢体质间。
Cell. 2024 Feb 1;187(3):692-711.e26. doi: 10.1016/j.cell.2023.12.032. Epub 2024 Jan 22.
9
Specificity in clustering of gene-specific transcription factors is encoded in the genome.基因特异性转录因子聚类中的特异性是由基因组编码的。
Nucleic Acids Res. 2025 Jul 8;53(13). doi: 10.1093/nar/gkaf625.
10
Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。
Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.

本文引用的文献

1
Benchmarking and building DNA binding affinity models using allele-specific and allele-agnostic transcription factor binding data.使用等位基因特异性和等位基因非特异性转录因子结合数据进行基准测试和构建 DNA 结合亲和力模型。
Genome Biol. 2024 Oct 31;25(1):284. doi: 10.1186/s13059-024-03424-2.
2
Geometric deep learning of protein-DNA binding specificity.蛋白质-DNA 结合特异性的几何深度学习。
Nat Methods. 2024 Sep;21(9):1674-1683. doi: 10.1038/s41592-024-02372-w. Epub 2024 Aug 5.
3
Structure-based learning to predict and model protein-DNA interactions and transcription-factor co-operativity in -regulatory elements.
基于结构的学习,用于预测和建模调控元件中的蛋白质-DNA相互作用及转录因子协同作用。
NAR Genom Bioinform. 2024 Jun 12;6(2):lqae068. doi: 10.1093/nargab/lqae068. eCollection 2024 Jun.
4
Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.
5
DNA binding analysis of rare variants in homeodomains reveals homeodomain specificity-determining residues.在家蝶结构域中稀有变异的 DNA 结合分析揭示了决定同源结构域特异性的残基。
Nat Commun. 2024 Apr 10;15(1):3110. doi: 10.1038/s41467-024-47396-0.
6
A universal deep-learning model for zinc finger design enables transcription factor reprogramming.一种通用的深度学习模型可用于锌指设计,从而实现转录因子的重新编程。
Nat Biotechnol. 2023 Aug;41(8):1117-1129. doi: 10.1038/s41587-022-01624-4. Epub 2023 Jan 26.
7
UniProt: the Universal Protein Knowledgebase in 2023.UniProt:2023 年的通用蛋白质知识库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.
8
DeepZF: improved DNA-binding prediction of C2H2-zinc-finger proteins by deep transfer learning.DeepZF:通过深度迁移学习改进 C2H2-锌指蛋白的 DNA 结合预测。
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii62-ii67. doi: 10.1093/bioinformatics/btac469.
9
Learning probabilistic protein-DNA recognition codes from DNA-binding specificities using structural mappings.从 DNA 结合特异性中学习概率蛋白-DNA 识别码,使用结构映射。
Genome Res. 2022 Sep 27;32(9):1776-1786. doi: 10.1101/gr.276606.122.
10
Transcription factor paralogs orchestrate alternative gene regulatory networks by context-dependent cooperation with multiple cofactors.转录因子同源物通过与多个共因子的上下文相关合作来协调替代基因调控网络。
Nat Commun. 2022 Jul 1;13(1):3808. doi: 10.1038/s41467-022-31501-2.