• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Residue-level prediction of DNA-binding sites and its application on DNA-binding protein predictions.DNA结合位点的残基水平预测及其在DNA结合蛋白预测中的应用。
FEBS Lett. 2007 Mar 6;581(5):1058-66. doi: 10.1016/j.febslet.2007.01.086. Epub 2007 Feb 7.
2
Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information.基于组成、序列和结构信息对DNA结合蛋白及其结合残基进行分析和预测。
Bioinformatics. 2004 Mar 1;20(4):477-86. doi: 10.1093/bioinformatics/btg432. Epub 2004 Jan 22.
3
BindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequences.BindN:一种用于高效预测氨基酸序列中DNA和RNA结合位点的基于网络的工具。
Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W243-8. doi: 10.1093/nar/gkl298.
4
Using evolutionary and structural information to predict DNA-binding sites on DNA-binding proteins.利用进化和结构信息预测DNA结合蛋白上的DNA结合位点。
Proteins. 2006 Jul 1;64(1):19-27. doi: 10.1002/prot.20977.
5
Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.基于序列的DNA中蛋白质结合位点预测:两种支持向量机模型的比较研究
Comput Methods Programs Biomed. 2014 Nov;117(2):158-67. doi: 10.1016/j.cmpb.2014.07.009. Epub 2014 Aug 1.
6
Sequence-based prediction of DNA-binding residues in proteins with conservation and correlation information.基于序列的具有保守性和相关性信息的蛋白质 DNA 结合残基预测。
IEEE/ACM Trans Comput Biol Bioinform. 2012 Nov-Dec;9(6):1766-75. doi: 10.1109/TCBB.2012.106.
7
An accurate feature-based method for identifying DNA-binding residues on protein surfaces.一种用于鉴定蛋白质表面 DNA 结合残基的基于特征的精确方法。
Proteins. 2011 Feb;79(2):509-17. doi: 10.1002/prot.22898.
8
Kernel-based machine learning protocol for predicting DNA-binding proteins.用于预测DNA结合蛋白的基于核的机器学习协议。
Nucleic Acids Res. 2005 Nov 10;33(20):6486-93. doi: 10.1093/nar/gki949. Print 2005.
9
Automated structure-based prediction of functional sites in proteins: applications to assessing the validity of inheriting protein function from homology in genome annotation and to protein docking.基于结构的蛋白质功能位点自动预测:应用于评估基因组注释中同源蛋白功能继承的有效性及蛋白质对接。
J Mol Biol. 2001 Aug 10;311(2):395-408. doi: 10.1006/jmbi.2001.4870.
10
An overview of the prediction of protein DNA-binding sites.蛋白质DNA结合位点预测综述。
Int J Mol Sci. 2015 Mar 6;16(3):5194-215. doi: 10.3390/ijms16035194.

引用本文的文献

1
EGPDI: identifying protein-DNA binding sites based on multi-view graph embedding fusion.EGPDI:基于多视图图嵌入融合的蛋白质-DNA 结合位点识别。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae330.
2
ProkDBP: Toward more precise identification of prokaryotic DNA binding proteins.ProkDBP:致力于更精确地识别原核 DNA 结合蛋白。
Protein Sci. 2024 Jun;33(6):e5015. doi: 10.1002/pro.5015.
3
DeepDISE: DNA Binding Site Prediction Using a Deep Learning Method.DeepDISE:一种基于深度学习的 DNA 结合位点预测方法。
Int J Mol Sci. 2021 May 24;22(11):5510. doi: 10.3390/ijms22115510.
4
Functional Site Discovery From Incomplete Training Data: A Case Study With Nucleic Acid-Binding Proteins.从不完整训练数据中发现功能位点:以核酸结合蛋白为例的研究
Front Genet. 2019 Aug 30;10:729. doi: 10.3389/fgene.2019.00729. eCollection 2019.
5
Improved detection of DNA-binding proteins via compression technology on PSSM information.通过基于位置特异性得分矩阵(PSSM)信息的压缩技术改进DNA结合蛋白的检测。
PLoS One. 2017 Sep 29;12(9):e0185587. doi: 10.1371/journal.pone.0185587. eCollection 2017.
6
EL_PSSM-RT: DNA-binding residue prediction by integrating ensemble learning with PSSM Relation Transformation.EL_PSSM-RT:通过整合集成学习与PSSM关系转换进行DNA结合残基预测
BMC Bioinformatics. 2017 Aug 29;18(1):379. doi: 10.1186/s12859-017-1792-8.
7
PDNAsite: Identification of DNA-binding Site from Protein Sequence by Incorporating Spatial and Sequence Context.PDNAsite:通过整合空间和序列上下文从蛋白质序列中识别 DNA 结合位点。
Sci Rep. 2016 Jun 10;6:27653. doi: 10.1038/srep27653.
8
A Large-Scale Assessment of Nucleic Acids Binding Site Prediction Programs.核酸结合位点预测程序的大规模评估
PLoS Comput Biol. 2015 Dec 17;11(12):e1004639. doi: 10.1371/journal.pcbi.1004639. eCollection 2015 Dec.
9
nDNA-Prot: identification of DNA-binding proteins based on unbalanced classification.nDNA-Prot:基于不平衡分类的 DNA 结合蛋白识别。
BMC Bioinformatics. 2014 Sep 8;15(1):298. doi: 10.1186/1471-2105-15-298.
10
enDNA-Prot: identification of DNA-binding proteins by applying ensemble learning.enDNA-Prot:通过应用集成学习识别DNA结合蛋白。
Biomed Res Int. 2014;2014:294279. doi: 10.1155/2014/294279. Epub 2014 May 26.

本文引用的文献

1
Improved protein fold assignment using support vector machines.使用支持向量机改进蛋白质折叠分配
Int J Bioinform Res Appl. 2005;1(3):319-34. doi: 10.1504/IJBRA.2005.007909.
2
Predicting DNA-binding sites of proteins from amino acid sequence.从氨基酸序列预测蛋白质的DNA结合位点。
BMC Bioinformatics. 2006 May 19;7:262. doi: 10.1186/1471-2105-7-262.
3
Structural bioinformatics prediction of membrane-binding proteins.膜结合蛋白的结构生物信息学预测
J Mol Biol. 2006 Jun 2;359(2):486-95. doi: 10.1016/j.jmb.2006.03.039. Epub 2006 Mar 30.
4
Using evolutionary and structural information to predict DNA-binding sites on DNA-binding proteins.利用进化和结构信息预测DNA结合蛋白上的DNA结合位点。
Proteins. 2006 Jul 1;64(1):19-27. doi: 10.1002/prot.20977.
5
Kernel-based machine learning protocol for predicting DNA-binding proteins.用于预测DNA结合蛋白的基于核的机器学习协议。
Nucleic Acids Res. 2005 Nov 10;33(20):6486-93. doi: 10.1093/nar/gki949. Print 2005.
6
Protein-DNA recognition patterns and predictions.蛋白质 - DNA 识别模式与预测
Annu Rev Biophys Biomol Struct. 2005;34:379-98. doi: 10.1146/annurev.biophys.34.040204.144537.
7
PSSM-based prediction of DNA binding sites in proteins.基于位置特异性得分矩阵的蛋白质中DNA结合位点预测
BMC Bioinformatics. 2005 Feb 19;6:33. doi: 10.1186/1471-2105-6-33.
8
Structure-based prediction of DNA-binding sites on proteins using the empirical preference of electrostatic potential and the shape of molecular surfaces.利用静电势的经验偏好和分子表面形状对蛋白质上的DNA结合位点进行基于结构的预测。
Proteins. 2004 Jun 1;55(4):885-94. doi: 10.1002/prot.20111.
9
Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information.基于组成、序列和结构信息对DNA结合蛋白及其结合残基进行分析和预测。
Bioinformatics. 2004 Mar 1;20(4):477-86. doi: 10.1093/bioinformatics/btg432. Epub 2004 Jan 22.
10
Using electrostatic potentials to predict DNA-binding sites on DNA-binding proteins.利用静电势预测DNA结合蛋白上的DNA结合位点。
Nucleic Acids Res. 2003 Dec 15;31(24):7189-98. doi: 10.1093/nar/gkg922.

DNA结合位点的残基水平预测及其在DNA结合蛋白预测中的应用。

Residue-level prediction of DNA-binding sites and its application on DNA-binding protein predictions.

作者信息

Bhardwaj Nitin, Lu Hui

机构信息

Bioinformatics Program, Department of Bioengineering, University of Illinois at Chicago, Chicago, IL 60607, USA.

出版信息

FEBS Lett. 2007 Mar 6;581(5):1058-66. doi: 10.1016/j.febslet.2007.01.086. Epub 2007 Feb 7.

DOI:10.1016/j.febslet.2007.01.086
PMID:17316627
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1993824/
Abstract

Protein-DNA interactions are crucial to many cellular activities such as expression-control and DNA-repair. These interactions between amino acids and nucleotides are highly specific and any aberrance at the binding site can render the interaction completely incompetent. In this study, we have three aims focusing on DNA-binding residues on the protein surface: to develop an automated approach for fast and reliable recognition of DNA-binding sites; to improve the prediction by distance-dependent refinement; use these predictions to identify DNA-binding proteins. We use a support vector machines (SVM)-based approach to harness the features of the DNA-binding residues to distinguish them from non-binding residues. Features used for distinction include the residue's identity, charge, solvent accessibility, average potential, the secondary structure it is embedded in, neighboring residues, and location in a cationic patch. These features collected from 50 proteins are used to train SVM. Testing is then performed on another set of 37 proteins, much larger than any testing set used in previous studies. The testing set has no more than 20% sequence identity not only among its pairs, but also with the proteins in the training set, thus removing any undesired redundancy due to homology. This set also has proteins with an unseen DNA-binding structural class not present in the training set. With the above features, an accuracy of 66% with balanced sensitivity and specificity is achieved without relying on homology or evolutionary information. We then develop a post-processing scheme to improve the prediction using the relative location of the predicted residues. Balanced success is then achieved with average sensitivity, specificity and accuracy pegged at 71.3%, 69.3% and 70.5%, respectively. Average net prediction is also around 70%. Finally, we show that the number of predicted DNA-binding residues can be used to differentiate DNA-binding proteins from non-DNA-binding proteins with an accuracy of 78%. Results presented here demonstrate that machine-learning can be applied to automated identification of DNA-binding residues and that the success rate can be ameliorated as more features are added. Such functional site prediction protocols can be useful in guiding consequent works such as site-directed mutagenesis and macromolecular docking.

摘要

蛋白质与DNA的相互作用对许多细胞活动至关重要,如表达调控和DNA修复。氨基酸与核苷酸之间的这些相互作用具有高度特异性,结合位点的任何异常都可能导致相互作用完全失效。在本研究中,我们有三个目标聚焦于蛋白质表面的DNA结合残基:开发一种自动方法以快速可靠地识别DNA结合位点;通过距离依赖性优化来改进预测;利用这些预测来识别DNA结合蛋白。我们使用基于支持向量机(SVM)的方法来利用DNA结合残基的特征,将它们与非结合残基区分开来。用于区分的特征包括残基的身份、电荷、溶剂可及性、平均电势、其所在的二级结构、相邻残基以及在阳离子斑块中的位置。从50种蛋白质中收集的这些特征用于训练支持向量机。然后在另一组37种蛋白质上进行测试,该测试集比以往研究中使用的任何测试集都要大得多。该测试集不仅其配对之间的序列同一性不超过20%,而且与训练集中的蛋白质也不超过20%,从而消除了由于同源性导致的任何不必要的冗余。该测试集还包含训练集中不存在的具有未见DNA结合结构类别的蛋白质。利用上述特征,在不依赖同源性或进化信息的情况下,实现了平衡敏感性和特异性的66%的准确率。然后我们开发了一种后处理方案,利用预测残基的相对位置来改进预测。随后实现了平衡成功,平均敏感性、特异性和准确率分别为71.3%、69.3%和70.5%。平均净预测也约为70%。最后,我们表明预测的DNA结合残基数量可用于区分DNA结合蛋白和非DNA结合蛋白,准确率为78%。这里呈现的结果表明机器学习可应用于DNA结合残基的自动识别,并且随着添加更多特征成功率可以得到改善。这种功能位点预测方案可用于指导后续工作,如定点诱变和大分子对接。