• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从头预测 Cys2His2 锌指蛋白的 DNA 结合特异性。

De novo prediction of DNA-binding specificities for Cys2His2 zinc finger proteins.

机构信息

Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton NJ 08544, USA and Department of Computer Science, Princeton University, Princeton NJ 08544, USA.

出版信息

Nucleic Acids Res. 2014 Jan;42(1):97-108. doi: 10.1093/nar/gkt890. Epub 2013 Oct 3.

DOI:10.1093/nar/gkt890
PMID:24097433
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3874201/
Abstract

Proteins with sequence-specific DNA binding function are important for a wide range of biological activities. De novo prediction of their DNA-binding specificities from sequence alone would be a great aid in inferring cellular networks. Here we introduce a method for predicting DNA-binding specificities for Cys2His2 zinc fingers (C2H2-ZFs), the largest family of DNA-binding proteins in metazoans. We develop a general approach, based on empirical calculations of pairwise amino acid-nucleotide interaction energies, for predicting position weight matrices (PWMs) representing DNA-binding specificities for C2H2-ZF proteins. We predict DNA-binding specificities on a per-finger basis and merge predictions for C2H2-ZF domains that are arrayed within sequences. We test our approach on a diverse set of natural C2H2-ZF proteins with known binding specificities and demonstrate that for >85% of the proteins, their predicted PWMs are accurate in 50% of their nucleotide positions. For proteins with several zinc finger isoforms, we show via case studies that this level of accuracy enables us to match isoforms with their known DNA-binding specificities. A web server for predicting a PWM given a protein containing C2H2-ZF domains is available online at http://zf.princeton.edu and can be used to aid in protein engineering applications and in genome-wide searches for transcription factor targets.

摘要

具有序列特异性 DNA 结合功能的蛋白质对于广泛的生物活性非常重要。仅从序列中预测其 DNA 结合特异性将极大地有助于推断细胞网络。在这里,我们介绍了一种预测 Cys2His2 锌指(C2H2-ZF)的 DNA 结合特异性的方法,C2H2-ZF 是后生动物中最大的 DNA 结合蛋白家族。我们开发了一种基于经验计算氨基酸-核苷酸相互作用能的通用方法,用于预测代表 C2H2-ZF 蛋白 DNA 结合特异性的位置权重矩阵(PWM)。我们基于每个手指进行 DNA 结合特异性预测,并合并排列在序列中的 C2H2-ZF 结构域的预测。我们在一组具有已知结合特异性的多样化天然 C2H2-ZF 蛋白质上测试了我们的方法,并证明对于 >85%的蛋白质,其预测的 PWM 在 50%的核苷酸位置是准确的。对于具有多个锌指同工型的蛋白质,我们通过案例研究表明,这种准确性水平使我们能够将同工型与其已知的 DNA 结合特异性相匹配。一个用于预测给定含有 C2H2-ZF 结构域的蛋白质的 PWM 的网络服务器可在 http://zf.princeton.edu 上在线获得,并可用于辅助蛋白质工程应用和在全基因组范围内搜索转录因子靶标。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd56/3874201/0eb251a07b91/gkt890f5p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd56/3874201/45926c4e68e2/gkt890f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd56/3874201/41ca0e5347e5/gkt890f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd56/3874201/dd536d4452cf/gkt890f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd56/3874201/749720a1488a/gkt890f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd56/3874201/0eb251a07b91/gkt890f5p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd56/3874201/45926c4e68e2/gkt890f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd56/3874201/41ca0e5347e5/gkt890f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd56/3874201/dd536d4452cf/gkt890f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd56/3874201/749720a1488a/gkt890f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd56/3874201/0eb251a07b91/gkt890f5p.jpg

相似文献

1
De novo prediction of DNA-binding specificities for Cys2His2 zinc finger proteins.从头预测 Cys2His2 锌指蛋白的 DNA 结合特异性。
Nucleic Acids Res. 2014 Jan;42(1):97-108. doi: 10.1093/nar/gkt890. Epub 2013 Oct 3.
2
An expanded binding model for Cys2His2 zinc finger protein-DNA interfaces.Cys2His2 锌指蛋白-DNA 界面的扩展结合模型。
Phys Biol. 2011 Jun;8(3):035010. doi: 10.1088/1478-3975/8/3/035010. Epub 2011 May 13.
3
Predicting DNA recognition by Cys2His2 zinc finger proteins.预测Cys2His2型锌指蛋白对DNA的识别
Bioinformatics. 2009 Jan 1;25(1):22-9. doi: 10.1093/bioinformatics/btn580. Epub 2008 Nov 13.
4
DeepZF: improved DNA-binding prediction of C2H2-zinc-finger proteins by deep transfer learning.DeepZF:通过深度迁移学习改进 C2H2-锌指蛋白的 DNA 结合预测。
Bioinformatics. 2022 Sep 16;38(Suppl_2):ii62-ii67. doi: 10.1093/bioinformatics/btac469.
5
Structural metal sites in nonclassical zinc finger proteins involved in transcriptional and translational regulation.非经典锌指蛋白中参与转录和翻译调控的结构金属结合位点。
Acc Chem Res. 2014 Aug 19;47(8):2643-50. doi: 10.1021/ar500182d. Epub 2014 Aug 6.
6
A systematic survey of the Cys2His2 zinc finger DNA-binding landscape.对Cys2His2锌指DNA结合图谱的系统调查。
Nucleic Acids Res. 2015 Feb 18;43(3):1965-84. doi: 10.1093/nar/gku1395. Epub 2015 Jan 15.
7
Balance between asymmetry and abundance in multi-domain DNA-binding proteins may regulate the kinetics of their binding to DNA.多结构域 DNA 结合蛋白的非对称与丰度之间的平衡可能调节其与 DNA 结合的动力学。
PLoS Comput Biol. 2020 May 26;16(5):e1007867. doi: 10.1371/journal.pcbi.1007867. eCollection 2020 May.
8
A structural approach reveals how neighbouring C2H2 zinc fingers influence DNA binding specificity.一种结构学方法揭示了相邻的C2H2锌指如何影响DNA结合特异性。
Nucleic Acids Res. 2015 Oct 30;43(19):9147-57. doi: 10.1093/nar/gkv919. Epub 2015 Sep 17.
9
On the prediction of DNA-binding preferences of C2H2-ZF domains using structural models: application on human CTCF.利用结构模型预测C2H2型锌指结构域的DNA结合偏好性:在人类CTCF上的应用
NAR Genom Bioinform. 2020 Jul 1;2(3):lqaa046. doi: 10.1093/nargab/lqaa046. eCollection 2020 Sep.
10
The protein-binding potential of C2H2 zinc finger domains.C2H2锌指结构域的蛋白质结合潜力。
Cell Biochem Biophys. 2008;51(1):9-19. doi: 10.1007/s12013-008-9007-6. Epub 2008 Feb 20.

引用本文的文献

1
Zinc-Induced Folding and Solution Structure of the Eponymous Novel Zinc Finger from the ZC4H2 Protein.锌诱导的来自ZC4H2蛋白的同名新型锌指的折叠及溶液结构
Biomolecules. 2025 Jul 28;15(8):1091. doi: 10.3390/biom15081091.
2
Machine-guided dual-objective protein engineering for deimmunization and therapeutic functions.用于去免疫化和治疗功能的机器引导双目标蛋白质工程
Cell Syst. 2025 Jul 16;16(7):101299. doi: 10.1016/j.cels.2025.101299. Epub 2025 Jun 3.
3
Zinc finger protein 184 prevents α-synuclein preformed fibril-mediated neurodegeneration through the interleukin enhancer binding factor 3-microRNA-7 pathway.

本文引用的文献

1
Global analysis of Drosophila Cys₂-His₂ zinc finger proteins reveals a multitude of novel recognition motifs and binding determinants.对果蝇 Cys₂-His₂锌指蛋白的全局分析揭示了大量新的识别基序和结合决定因素。
Genome Res. 2013 Jun;23(6):928-40. doi: 10.1101/gr.151472.112. Epub 2013 Mar 7.
2
DNA-binding specificities of human transcription factors.人类转录因子的 DNA 结合特异性。
Cell. 2013 Jan 17;152(1-2):327-39. doi: 10.1016/j.cell.2012.12.009.
3
FlyBase: improvements to the bibliography.FlyBase:文献目录的改进。
锌指蛋白184通过白细胞介素增强子结合因子3-微小RNA-7途径预防α-突触核蛋白原纤维介导的神经退行性变。
PLoS One. 2025 May 7;20(5):e0323279. doi: 10.1371/journal.pone.0323279. eCollection 2025.
4
Multimeric transcription factor BCL11A utilizes two zinc-finger tandem arrays to bind clustered short sequence motifs.多聚体转录因子BCL11A利用两个锌指串联阵列结合成簇的短序列基序。
Nat Commun. 2025 Apr 17;16(1):3672. doi: 10.1038/s41467-025-58998-7.
5
Unexpectedly low recombination rates and presence of hotspots in termite genomes.白蚁基因组中意外低的重组率和热点的存在。
Genome Res. 2025 May 2;35(5):1124-1137. doi: 10.1101/gr.279180.124.
6
Solution structure of the Z0 domain from transcription repressor BCL11A sheds light on the sequence properties of protein-binding zinc fingers.转录抑制因子BCL11A的Z0结构域的溶液结构揭示了蛋白质结合锌指的序列特性。
Protein Sci. 2025 Apr;34(4):e70097. doi: 10.1002/pro.70097.
7
The NAT1-bHLH110-CER1/CER1L module regulates heat stress tolerance in rice.NAT1-bHLH110-CER1/CER1L模块调控水稻的耐热性。
Nat Genet. 2025 Feb;57(2):427-440. doi: 10.1038/s41588-024-02065-2. Epub 2025 Jan 14.
8
Structural basis for C. elegans pairing center DNA binding specificity by the ZIM/HIM-8 family proteins.ZIM/HIM-8 家族蛋白介导的秀丽隐杆线虫配对中心 DNA 结合特异性的结构基础。
Nat Commun. 2024 Nov 28;15(1):10355. doi: 10.1038/s41467-024-54548-9.
9
Current and future directions in network biology.网络生物学的当前与未来发展方向。
Bioinform Adv. 2024 Aug 14;4(1):vbae099. doi: 10.1093/bioadv/vbae099. eCollection 2024.
10
Geometric deep learning of protein-DNA binding specificity.蛋白质-DNA 结合特异性的几何深度学习。
Nat Methods. 2024 Sep;21(9):1674-1683. doi: 10.1038/s41592-024-02372-w. Epub 2024 Aug 5.
Nucleic Acids Res. 2013 Jan;41(Database issue):D751-7. doi: 10.1093/nar/gks1024. Epub 2012 Nov 3.
4
Architecture of the human regulatory network derived from ENCODE data.人类调控网络的结构源自 ENCODE 数据。
Nature. 2012 Sep 6;489(7414):91-100. doi: 10.1038/nature11245.
5
Recognition models to predict DNA-binding specificities of homeodomain proteins.预测同源域蛋白 DNA 结合特异性的识别模型。
Bioinformatics. 2012 Jun 15;28(12):i84-9. doi: 10.1093/bioinformatics/bts202.
6
An optimized two-finger archive for ZFN-mediated gene targeting.一种优化的用于 ZFN 介导的基因靶向的双指文库。
Nat Methods. 2012 Jun;9(6):588-90. doi: 10.1038/nmeth.1994. Epub 2012 Apr 29.
7
Genome-wide CTCF distribution in vertebrates defines equivalent sites that aid the identification of disease-associated genes.脊椎动物全基因组 CTCF 分布定义了等效位点,有助于鉴定与疾病相关的基因。
Nat Struct Mol Biol. 2011 Jun;18(6):708-14. doi: 10.1038/nsmb.2059. Epub 2011 May 22.
8
An expanded binding model for Cys2His2 zinc finger protein-DNA interfaces.Cys2His2 锌指蛋白-DNA 界面的扩展结合模型。
Phys Biol. 2011 Jun;8(3):035010. doi: 10.1088/1478-3975/8/3/035010. Epub 2011 May 13.
9
Improved similarity scores for comparing motifs.改进了 motif 比较的相似度得分。
Bioinformatics. 2011 Jun 15;27(12):1603-9. doi: 10.1093/bioinformatics/btr257. Epub 2011 May 4.
10
Extensive protein and DNA backbone sampling improves structure-based specificity prediction for C2H2 zinc fingers.广泛的蛋白质和 DNA 骨架采样可提高基于结构的 C2H2 锌指特异性预测。
Nucleic Acids Res. 2011 Jun;39(11):4564-76. doi: 10.1093/nar/gkr048. Epub 2011 Feb 22.