• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于统计和几何特征及支持向量机的 DNA 结合蛋白预测。

Prediction of DNA-binding protein based on statistical and geometric features and support vector machines.

机构信息

Department of Electronic Engineering, City University of Hong Kong, Kowloon, Hong Kong.

出版信息

Proteome Sci. 2011 Oct 14;9 Suppl 1(Suppl 1):S1. doi: 10.1186/1477-5956-9-S1-S1.

DOI:10.1186/1477-5956-9-S1-S1
PMID:22166014
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3289070/
Abstract

BACKGROUND

Previous studies on protein-DNA interaction mostly focused on the bound structure of DNA-binding proteins but few paid enough attention to the unbound structures. As more new proteins are discovered, it is useful and imperative to develop algorithms for the functional prediction of unbound proteins. In our work, we apply an alpha shape model to represent the surface structure of the protein-DNA complex and extract useful statistical and geometric features, and use structural alignment and support vector machines for the prediction of unbound DNA-binding proteins.

RESULTS

The performance of our method is evaluated by discriminating a set of 104 DNA-binding proteins from 401 non-DNA-binding proteins. In the same test, the proposed method outperforms the other method using conditional probability. The results achieved by our proposed method for; precision, 83.33%; accuracy, 86.53%; and MCC, 0.5368 demonstrate its good performance.

CONCLUSIONS

In this study we develop an effective method for the prediction of protein-DNA interactions based on statistical and geometric features and support vector machines. Our results show that interface surface features play an important role in protein-DNA interaction. Our technique is able to predict unbound DNA-binding protein and discriminatory DNA-binding proteins from proteins that bind with other molecules.

摘要

背景

先前关于蛋白质与 DNA 相互作用的研究大多集中在 DNA 结合蛋白的结合结构上,但很少有研究关注未结合结构。随着越来越多的新蛋白质被发现,开发用于预测未结合蛋白质功能的算法是有用且必要的。在我们的工作中,我们应用了一种 alpha 形状模型来表示蛋白质-DNA 复合物的表面结构,并提取了有用的统计和几何特征,然后使用结构对齐和支持向量机进行未结合 DNA 结合蛋白的预测。

结果

通过将 104 个 DNA 结合蛋白与 401 个非 DNA 结合蛋白区分开来,评估了我们方法的性能。在相同的测试中,我们提出的方法优于使用条件概率的其他方法。我们提出的方法在预测精度、准确率和 MCC 方面的结果分别为 83.33%、86.53%和 0.5368,证明了其良好的性能。

结论

在这项研究中,我们开发了一种基于统计和几何特征以及支持向量机的有效预测蛋白质-DNA 相互作用的方法。我们的结果表明,界面表面特征在蛋白质-DNA 相互作用中起着重要作用。我们的技术能够预测未结合的 DNA 结合蛋白,并区分与其他分子结合的蛋白质中的 DNA 结合蛋白。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a431/3289070/f1a19ee15d15/1477-5956-9-S1-S1-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a431/3289070/3d1207c7a2a2/1477-5956-9-S1-S1-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a431/3289070/f1a19ee15d15/1477-5956-9-S1-S1-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a431/3289070/3d1207c7a2a2/1477-5956-9-S1-S1-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a431/3289070/f1a19ee15d15/1477-5956-9-S1-S1-2.jpg

相似文献

1
Prediction of DNA-binding protein based on statistical and geometric features and support vector machines.基于统计和几何特征及支持向量机的 DNA 结合蛋白预测。
Proteome Sci. 2011 Oct 14;9 Suppl 1(Suppl 1):S1. doi: 10.1186/1477-5956-9-S1-S1.
2
Protein-RNA interface residue prediction using machine learning: an assessment of the state of the art.基于机器学习的蛋白质-RNA 界面残基预测:现状评估。
BMC Bioinformatics. 2012 May 10;13:89. doi: 10.1186/1471-2105-13-89.
3
Predicting target DNA sequences of DNA-binding proteins based on unbound structures.基于未结合结构预测 DNA 结合蛋白的靶 DNA 序列。
PLoS One. 2012;7(2):e30446. doi: 10.1371/journal.pone.0030446. Epub 2012 Feb 1.
4
Prediction of nuclear proteins using nuclear translocation signals proposed by probabilistic latent semantic indexing.基于概率潜在语义索引的核转位信号预测核蛋白。
BMC Bioinformatics. 2012;13 Suppl 17(Suppl 17):S13. doi: 10.1186/1471-2105-13-S17-S13. Epub 2012 Dec 13.
5
A graph kernel method for DNA-binding site prediction.一种用于DNA结合位点预测的图核方法。
BMC Syst Biol. 2014;8 Suppl 4(Suppl 4):S10. doi: 10.1186/1752-0509-8-S4-S10. Epub 2014 Dec 8.
6
DNABind: a hybrid algorithm for structure-based prediction of DNA-binding residues by combining machine learning- and template-based approaches.DNABind:一种基于机器学习和模板的混合算法,用于预测基于结构的 DNA 结合残基。
Proteins. 2013 Nov;81(11):1885-99. doi: 10.1002/prot.24330. Epub 2013 Aug 16.
7
bSiteFinder, an improved protein-binding sites prediction server based on structural alignment: more accurate and less time-consuming.bSiteFinder,一个基于结构比对的改进型蛋白质结合位点预测服务器:更准确且耗时更少。
J Cheminform. 2016 Jul 11;8:38. doi: 10.1186/s13321-016-0149-z. eCollection 2016.
8
ProMate: a structure based prediction program to identify the location of protein-protein binding sites.ProMate:一个基于结构的预测程序,用于识别蛋白质-蛋白质结合位点的位置。
J Mol Biol. 2004 Apr 16;338(1):181-99. doi: 10.1016/j.jmb.2004.02.040.
9
Prediction of protein binding sites in protein structures using hidden Markov support vector machine.利用隐马尔可夫支持向量机预测蛋白质结构中的蛋白质结合位点。
BMC Bioinformatics. 2009 Nov 20;10:381. doi: 10.1186/1471-2105-10-381.
10
PreDNA: accurate prediction of DNA-binding sites in proteins by integrating sequence and geometric structure information.PreDNA:通过整合序列和几何结构信息来准确预测蛋白质中的 DNA 结合位点。
Bioinformatics. 2013 Mar 15;29(6):678-85. doi: 10.1093/bioinformatics/btt029. Epub 2013 Jan 17.

引用本文的文献

1
Integrating sequence and gene expression information predicts genome-wide DNA-binding proteins and suggests a cooperative mechanism.整合序列和基因表达信息可预测全基因组 DNA 结合蛋白,并提出一种协同作用机制。
Nucleic Acids Res. 2018 Jan 9;46(1):54-70. doi: 10.1093/nar/gkx1166.
2
Improved detection of DNA-binding proteins via compression technology on PSSM information.通过基于位置特异性得分矩阵(PSSM)信息的压缩技术改进DNA结合蛋白的检测。
PLoS One. 2017 Sep 29;12(9):e0185587. doi: 10.1371/journal.pone.0185587. eCollection 2017.
3
Analysis and prediction of single-stranded and double-stranded DNA binding proteins based on protein sequences.

本文引用的文献

1
A discriminatory function for prediction of protein-DNA interactions based on alpha shape modeling.基于 Alpha 形状建模的蛋白质-DNA 相互作用预测的判别函数。
Bioinformatics. 2010 Oct 15;26(20):2541-8. doi: 10.1093/bioinformatics/btq478. Epub 2010 Aug 23.
2
Applying the Naïve Bayes classifier with kernel density estimation to the prediction of protein-protein interaction sites.应用朴素贝叶斯分类器和核密度估计对蛋白质-蛋白质相互作用位点进行预测。
Bioinformatics. 2010 Aug 1;26(15):1841-8. doi: 10.1093/bioinformatics/btq302. Epub 2010 Jun 6.
3
Predicting helix-helix interactions from residue contacts in membrane proteins.
基于蛋白质序列的单链和双链DNA结合蛋白分析与预测
BMC Bioinformatics. 2017 Jun 12;18(1):300. doi: 10.1186/s12859-017-1715-8.
4
Predicting DNA-binding proteins and binding residues by complex structure prediction and application to human proteome.通过复合物结构预测来预测DNA结合蛋白和结合残基并应用于人类蛋白质组
PLoS One. 2014 May 2;9(5):e96694. doi: 10.1371/journal.pone.0096694. eCollection 2014.
5
Sequence based prediction of DNA-binding proteins based on hybrid feature selection using random forest and Gaussian naïve Bayes.基于随机森林和高斯朴素贝叶斯混合特征选择的DNA结合蛋白序列预测
PLoS One. 2014 Jan 24;9(1):e86703. doi: 10.1371/journal.pone.0086703. eCollection 2014.
6
Prediction of protein-protein interaction sites by random forest algorithm with mRMR and IFS.基于 mRMR 和 IFS 的随机森林算法预测蛋白质相互作用位点
PLoS One. 2012;7(8):e43927. doi: 10.1371/journal.pone.0043927. Epub 2012 Aug 28.
从膜蛋白中的残基接触预测螺旋-螺旋相互作用。
Bioinformatics. 2009 Apr 15;25(8):996-1003. doi: 10.1093/bioinformatics/btp114. Epub 2009 Feb 25.
4
Defining and characterizing protein surface using alpha shapes.使用α形状定义和表征蛋白质表面。
Proteins. 2009 Jul;76(1):1-12. doi: 10.1002/prot.22301.
5
An all-atom, distance-dependent scoring function for the prediction of protein-DNA interactions from structure.一种用于从结构预测蛋白质 - DNA 相互作用的全原子、距离依赖评分函数。
Proteins. 2007 Feb 1;66(2):359-74. doi: 10.1002/prot.21162.
6
TM-align: a protein structure alignment algorithm based on the TM-score.TM-align:一种基于TM分数的蛋白质结构比对算法。
Nucleic Acids Res. 2005 Apr 22;33(7):2302-9. doi: 10.1093/nar/gki524. Print 2005.
7
Moment-based prediction of DNA-binding proteins.基于矩的DNA结合蛋白预测。
J Mol Biol. 2004 Jul 30;341(1):65-71. doi: 10.1016/j.jmb.2004.05.058.
8
Voronoi and Voronoi-related tessellations in studies of protein structure and interaction.蛋白质结构与相互作用研究中的沃罗诺伊图及与沃罗诺伊图相关的镶嵌图
Curr Opin Struct Biol. 2004 Apr;14(2):233-41. doi: 10.1016/j.sbi.2004.03.010.
9
Simplicial edge representation of protein structures and alpha contact potential with confidence measure.蛋白质结构的单纯形边表示及具有置信度度量的α接触势
Proteins. 2003 Dec 1;53(4):792-805. doi: 10.1002/prot.10442.
10
Use of pair potentials across protein interfaces in screening predicted docked complexes.在筛选预测的对接复合物时跨蛋白质界面使用对势。
Proteins. 1999 May 15;35(3):364-73.