• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用残基水平和序列轮廓水平的界面倾向用于蛋白质结合位点预测。

Exploiting residue-level and profile-level interface propensities for usage in binding sites prediction of proteins.

作者信息

Dong Qiwen, Wang Xiaolong, Lin Lei, Guan Yi

机构信息

School of Computer Science and Technology, Harbin Institute of Technology, Harbin, China.

出版信息

BMC Bioinformatics. 2007 May 5;8:147. doi: 10.1186/1471-2105-8-147.

DOI:10.1186/1471-2105-8-147
PMID:17480235
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1885810/
Abstract

BACKGROUND

Recognition of binding sites in proteins is a direct computational approach to the characterization of proteins in terms of biological and biochemical function. Residue preferences have been widely used in many studies but the results are often not satisfactory. Although different amino acid compositions among the interaction sites of different complexes have been observed, such differences have not been integrated into the prediction process. Furthermore, the evolution information has not been exploited to achieve a more powerful propensity.

RESULT

In this study, the residue interface propensities of four kinds of complexes (homo-permanent complexes, homo-transient complexes, hetero-permanent complexes and hetero-transient complexes) are investigated. These propensities, combined with sequence profiles and accessible surface areas, are inputted to the support vector machine for the prediction of protein binding sites. Such propensities are further improved by taking evolutional information into consideration, which results in a class of novel propensities at the profile level, i.e. the binary profiles interface propensities. Experiment is performed on the 1139 non-redundant protein chains. Although different residue interface propensities among different complexes are observed, the improvement of the classifier with residue interface propensities can be negligible in comparison with that without propensities. The binary profile interface propensities can significantly improve the performance of binding sites prediction by about ten percent in term of both precision and recall.

CONCLUSION

Although there are minor differences among the four kinds of complexes, the residue interface propensities cannot provide efficient discrimination for the complicated interfaces of proteins. The binary profile interface propensities can significantly improve the performance of binding sites prediction of protein, which indicates that the propensities at the profile level are more accurate than those at the residue level.

摘要

背景

识别蛋白质中的结合位点是一种直接的计算方法,可从生物学和生化功能方面对蛋白质进行表征。残基偏好性已在许多研究中广泛使用,但结果往往不尽人意。尽管已观察到不同复合物相互作用位点之间存在不同的氨基酸组成,但这种差异尚未整合到预测过程中。此外,尚未利用进化信息来获得更强有力的倾向。

结果

在本研究中,对四种复合物(同型永久复合物、同型瞬时复合物、异型永久复合物和异型瞬时复合物)的残基界面倾向进行了研究。这些倾向与序列谱和可及表面积相结合,输入到支持向量机中用于预测蛋白质结合位点。通过考虑进化信息进一步改进了这些倾向,从而在谱水平上产生了一类新的倾向,即二元谱界面倾向。对1139条非冗余蛋白质链进行了实验。尽管观察到不同复合物之间存在不同的残基界面倾向,但与不使用倾向的情况相比,使用残基界面倾向的分类器的改进可以忽略不计。二元谱界面倾向在精度和召回率方面都能显著提高结合位点预测的性能约10%。

结论

尽管四种复合物之间存在细微差异,但残基界面倾向不能为蛋白质复杂界面提供有效的区分。二元谱界面倾向可以显著提高蛋白质结合位点预测的性能,这表明谱水平的倾向比残基水平的倾向更准确。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a15/1885810/02c38512a0bb/1471-2105-8-147-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a15/1885810/1698b13ef16e/1471-2105-8-147-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a15/1885810/993e689899d4/1471-2105-8-147-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a15/1885810/ea2c9fe7c790/1471-2105-8-147-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a15/1885810/02c38512a0bb/1471-2105-8-147-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a15/1885810/1698b13ef16e/1471-2105-8-147-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a15/1885810/993e689899d4/1471-2105-8-147-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a15/1885810/ea2c9fe7c790/1471-2105-8-147-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a15/1885810/02c38512a0bb/1471-2105-8-147-4.jpg

相似文献

1
Exploiting residue-level and profile-level interface propensities for usage in binding sites prediction of proteins.利用残基水平和序列轮廓水平的界面倾向用于蛋白质结合位点预测。
BMC Bioinformatics. 2007 May 5;8:147. doi: 10.1186/1471-2105-8-147.
2
Exploiting three kinds of interface propensities to identify protein binding sites.利用三种界面倾向来识别蛋白质结合位点。
Comput Biol Chem. 2009 Aug;33(4):303-11. doi: 10.1016/j.compbiolchem.2009.07.001. Epub 2009 Jul 9.
3
Amino acid residue doublet propensity in the protein-RNA interface and its application to RNA interface prediction.蛋白质-RNA界面中的氨基酸残基双峰倾向及其在RNA界面预测中的应用。
Nucleic Acids Res. 2006;34(22):6450-60. doi: 10.1093/nar/gkl819. Epub 2006 Nov 27.
4
Prediction of protein-protein interaction sites using support vector machines.使用支持向量机预测蛋白质-蛋白质相互作用位点。
Protein Eng Des Sel. 2004 Feb;17(2):165-73. doi: 10.1093/protein/gzh020. Epub 2004 Jan 20.
5
Predicting RNA-binding sites in proteins using the interaction propensity of amino acid triplets.利用氨基酸三联体的相互作用倾向预测蛋白质中的RNA结合位点。
Protein Pept Lett. 2010 Sep;17(9):1102-10. doi: 10.2174/092986610791760388.
6
Intramolecular surface contacts contain information about protein-protein interface regions.分子内表面接触包含有关蛋白质-蛋白质界面区域的信息。
Bioinformatics. 2006 Sep 1;22(17):2094-8. doi: 10.1093/bioinformatics/btl275. Epub 2006 Jun 9.
7
How proteins get in touch: interface prediction in the study of biomolecular complexes.蛋白质如何相互作用:生物分子复合物研究中的界面预测
Curr Protein Pept Sci. 2008 Aug;9(4):394-406. doi: 10.2174/138920308785132712.
8
A quantitative analysis of interfacial amino acid conservation in protein-protein hetero complexes.蛋白质-蛋白质异源复合物中界面氨基酸保守性的定量分析。
J Bioinform Comput Biol. 2005 Oct;3(5):1137-50. doi: 10.1142/s0219720005001429.
9
Oligomerisation status and evolutionary conservation of interfaces of protein structural domain superfamilies.蛋白质结构域超家族界面的寡聚化状态与进化保守性
Mol Biosyst. 2013 Jul;9(7):1652-61. doi: 10.1039/c3mb25484d. Epub 2013 Mar 27.
10
Improving protein-protein interaction prediction using evolutionary information from low-quality MSAs.利用来自低质量多序列比对的进化信息改进蛋白质-蛋白质相互作用预测。
PLoS One. 2017 Feb 6;12(2):e0169356. doi: 10.1371/journal.pone.0169356. eCollection 2017.

引用本文的文献

1
Prediction of Protein-Protein Interaction Sites Using Convolutional Neural Network and Improved Data Sets.利用卷积神经网络和改进数据集预测蛋白质-蛋白质相互作用位点。
Int J Mol Sci. 2020 Jan 11;21(2):467. doi: 10.3390/ijms21020467.
2
Prediction of Protein-Protein Interactions from Amino Acid Sequences Based on Continuous and Discrete Wavelet Transform Features.基于连续和离散小波变换特征的氨基酸序列蛋白质-蛋白质相互作用预测。
Molecules. 2018 Apr 4;23(4):823. doi: 10.3390/molecules23040823.
3
Modeling protein quaternary structure of homo- and hetero-oligomers beyond binary interactions by homology.

本文引用的文献

1
Protein-protein interaction site prediction based on conditional random fields.基于条件随机场的蛋白质-蛋白质相互作用位点预测
Bioinformatics. 2007 Mar 1;23(5):597-604. doi: 10.1093/bioinformatics/btl660. Epub 2007 Jan 18.
2
Using local gene expression similarities to discover regulatory binding site modules.利用局部基因表达相似性发现调控结合位点模块。
BMC Bioinformatics. 2006 Nov 17;7:505. doi: 10.1186/1471-2105-7-505.
3
A machine learning strategy to identify candidate binding sites in human protein-coding sequence.一种用于识别人类蛋白质编码序列中候选结合位点的机器学习策略。
通过同源建模来模拟同聚体和异聚体的蛋白质四级结构,超越二元相互作用。
Sci Rep. 2017 Sep 5;7(1):10480. doi: 10.1038/s41598-017-09654-8.
4
Improved protein-protein interactions prediction via weighted sparse representation model combining continuous wavelet descriptor and PseAA composition.通过结合连续小波描述符和伪氨基酸组成的加权稀疏表示模型改进蛋白质-蛋白质相互作用预测
BMC Syst Biol. 2016 Dec 23;10(Suppl 4):120. doi: 10.1186/s12918-016-0360-6.
5
Sequence-based prediction of protein-protein interactions using weighted sparse representation model combined with global encoding.基于序列的蛋白质-蛋白质相互作用预测:结合全局编码的加权稀疏表示模型
BMC Bioinformatics. 2016 Apr 26;17(1):184. doi: 10.1186/s12859-016-1035-4.
6
Progress and challenges in predicting protein interfaces.预测蛋白质界面的进展与挑战。
Brief Bioinform. 2016 Jan;17(1):117-31. doi: 10.1093/bib/bbv027. Epub 2015 May 13.
7
Algorithmic approaches to protein-protein interaction site prediction.蛋白质-蛋白质相互作用位点预测的算法方法。
Algorithms Mol Biol. 2015 Feb 15;10:7. doi: 10.1186/s13015-015-0033-9. eCollection 2015.
8
Principal components analysis of protein sequence clusters.蛋白质序列簇的主成分分析。
J Struct Funct Genomics. 2014 Mar;15(1):1-11. doi: 10.1007/s10969-014-9173-2. Epub 2014 Feb 5.
9
Specialized Dynamical Properties of Promiscuous Residues Revealed by Simulated Conformational Ensembles.通过模拟构象集合揭示的混杂残基的特殊动力学性质。
J Chem Theory Comput. 2013 Nov 12;9(11):5127-5147. doi: 10.1021/ct400486p. Epub 2013 Sep 27.
10
SVM based model generation for binding site prediction on helix turn helix motif type of transcription factors in eukaryotes.基于支持向量机的真核生物中螺旋-转角-螺旋基序型转录因子结合位点预测模型生成。
Bioinformation. 2013 Jun 8;9(10):500-5. doi: 10.6026/97320630009500. Print 2013.
BMC Bioinformatics. 2006 Sep 26;7:419. doi: 10.1186/1471-2105-7-419.
4
ROSETTALIGAND: protein-small molecule docking with full side-chain flexibility.罗塞塔配体:具有全侧链灵活性的蛋白质-小分子对接
Proteins. 2006 Nov 15;65(3):538-48. doi: 10.1002/prot.21086.
5
Localization of binding sites in protein structures by optimization of a composite scoring function.通过复合评分函数的优化对蛋白质结构中结合位点进行定位。
Protein Sci. 2006 Oct;15(10):2366-80. doi: 10.1110/ps.062247506. Epub 2006 Sep 8.
6
Insights into protein-protein interfaces using a Bayesian network prediction method.使用贝叶斯网络预测方法洞察蛋白质-蛋白质相互作用界面
J Mol Biol. 2006 Sep 15;362(2):365-86. doi: 10.1016/j.jmb.2006.07.028. Epub 2006 Jul 21.
7
Rank information: a structure-independent measure of evolutionary trace quality that improves identification of protein functional sites.排名信息:一种与结构无关的进化踪迹质量度量,可改进蛋白质功能位点的识别。
Proteins. 2006 Oct 1;65(1):111-23. doi: 10.1002/prot.21101.
8
Protein binding site prediction using an empirical scoring function.使用经验评分函数预测蛋白质结合位点。
Nucleic Acids Res. 2006 Aug 7;34(13):3698-707. doi: 10.1093/nar/gkl454. Print 2006.
9
ET viewer: an application for predicting and visualizing functional sites in protein structures.ET查看器:一种用于预测和可视化蛋白质结构中功能位点的应用程序。
Bioinformatics. 2006 Aug 15;22(16):2049-50. doi: 10.1093/bioinformatics/btl285. Epub 2006 Jun 29.
10
Novel knowledge-based mean force potential at the profile level.轮廓水平上基于新知识的平均力势。
BMC Bioinformatics. 2006 Jun 27;7:324. doi: 10.1186/1471-2105-7-324.