• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于结构的蛋白质界面整体预测。

Integrated structure-based protein interface prediction.

机构信息

Department of Chemistry, Yeshiva College, Yeshiva University, New York, NY, 10033, USA.

Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY, 10461, USA.

出版信息

BMC Bioinformatics. 2022 Jul 25;23(1):301. doi: 10.1186/s12859-022-04852-2.

DOI:10.1186/s12859-022-04852-2
PMID:35879651
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9316365/
Abstract

BACKGROUND

Identifying protein interfaces can inform how proteins interact with their binding partners, uncover the regulatory mechanisms that control biological functions and guide the development of novel therapeutic agents. A variety of computational approaches have been developed for predicting a protein's interfacial residues from its known sequence and structure. Methods using the known three-dimensional structures of proteins can be template-based or template-free. Template-based methods have limited success in predicting interfaces when homologues with known complex structures are not available to use as templates. The prediction performance of template-free methods that only rely only upon proteins' intrinsic properties is limited by the amount of biologically relevant features that can be included in an interface prediction model.

RESULTS

We describe the development of an integrated method for protein interface prediction (ISPIP) to explore the hypothesis that the efficacy of a computational prediction method of protein binding sites can be enhanced by using a combination of methods that rely on orthogonal structure-based properties of a query protein, combining and balancing both template-free and template-based features. ISPIP is a method that integrates these approaches through simple linear or logistic regression models and more complex decision tree models. On a diverse test set of 156 query proteins, ISPIP outperforms each of its individual classifiers in identifying protein binding interfaces.

CONCLUSIONS

The integrated method captures the best performance of individual classifiers and delivers an improved interface prediction. The method is robust and performs well even when one of the individual classifiers performs poorly on a particular query protein. This work demonstrates that integrating orthogonal methods that depend on different structural properties of proteins performs better at interface prediction than any individual classifier alone.

摘要

背景

鉴定蛋白质界面可以揭示蛋白质与其结合伴侣相互作用的方式,揭示控制生物功能的调控机制,并指导新型治疗剂的开发。已经开发了多种从已知序列和结构预测蛋白质界面残基的计算方法。使用蛋白质已知三维结构的方法可以是基于模板的或无模板的。当没有已知复合物结构的同源物可用作模板时,基于模板的方法在预测界面方面的成功率有限。仅依赖蛋白质固有特性的无模板方法的预测性能受到可以包含在接口预测模型中的生物学相关特征的数量的限制。

结果

我们描述了一种用于蛋白质界面预测(ISPIP)的综合方法的开发,以探索以下假设:通过使用组合方法来增强计算蛋白质结合位点预测方法的功效,这些方法依赖于查询蛋白质的正交基于结构的特性,结合和平衡无模板和基于模板的特征。ISPIP 通过简单的线性或逻辑回归模型以及更复杂的决策树模型来集成这些方法。在包含 156 个查询蛋白的多样化测试集中,ISPIP 在识别蛋白质结合界面方面优于其各个分类器。

结论

综合方法捕获了各个分类器的最佳性能,并提供了改进的接口预测。该方法稳健,即使个别分类器在特定查询蛋白上表现不佳,它也能很好地执行。这项工作表明,集成依赖于蛋白质不同结构特性的正交方法在界面预测方面的性能优于任何单个分类器。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c3/9316365/855eba452121/12859_2022_4852_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c3/9316365/94fd0c7b5c12/12859_2022_4852_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c3/9316365/f7d1f1e215bc/12859_2022_4852_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c3/9316365/0263bbde5c93/12859_2022_4852_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c3/9316365/02d60bdb20f1/12859_2022_4852_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c3/9316365/855eba452121/12859_2022_4852_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c3/9316365/94fd0c7b5c12/12859_2022_4852_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c3/9316365/f7d1f1e215bc/12859_2022_4852_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c3/9316365/0263bbde5c93/12859_2022_4852_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c3/9316365/02d60bdb20f1/12859_2022_4852_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c3/9316365/855eba452121/12859_2022_4852_Fig5_HTML.jpg

相似文献

1
Integrated structure-based protein interface prediction.基于结构的蛋白质界面整体预测。
BMC Bioinformatics. 2022 Jul 25;23(1):301. doi: 10.1186/s12859-022-04852-2.
2
DNABind: a hybrid algorithm for structure-based prediction of DNA-binding residues by combining machine learning- and template-based approaches.DNABind:一种基于机器学习和模板的混合算法,用于预测基于结构的 DNA 结合残基。
Proteins. 2013 Nov;81(11):1885-99. doi: 10.1002/prot.24330. Epub 2013 Aug 16.
3
Ab initio and template-based prediction of multi-class distance maps by two-dimensional recursive neural networks.基于二维递归神经网络的多类别距离图的从头预测和基于模板的预测。
BMC Struct Biol. 2009 Jan 30;9:5. doi: 10.1186/1472-6807-9-5.
4
RBRDetector: improved prediction of binding residues on RNA-binding protein structures using complementary feature- and template-based strategies.RBRDetector:利用基于互补特征和模板的策略改进对RNA结合蛋白结构上结合残基的预测。
Proteins. 2014 Oct;82(10):2455-71. doi: 10.1002/prot.24610. Epub 2014 Jun 9.
5
PCI-SS: MISO dynamic nonlinear protein secondary structure prediction.PCI-SS:MISO动态非线性蛋白质二级结构预测
BMC Bioinformatics. 2009 Jul 17;10:222. doi: 10.1186/1471-2105-10-222.
6
Protein-RNA interface residue prediction using machine learning: an assessment of the state of the art.基于机器学习的蛋白质-RNA 界面残基预测:现状评估。
BMC Bioinformatics. 2012 May 10;13:89. doi: 10.1186/1471-2105-13-89.
7
Prediction of protein-protein interaction sites from weakly homologous template structures using meta-threading and machine learning.利用元线程和机器学习从弱同源模板结构预测蛋白质-蛋白质相互作用位点。
J Mol Recognit. 2015 Jan;28(1):35-48. doi: 10.1002/jmr.2410.
8
A hybrid method for protein-protein interface prediction.一种用于蛋白质-蛋白质界面预测的混合方法。
Protein Sci. 2016 Jan;25(1):159-65. doi: 10.1002/pro.2744. Epub 2015 Jul 21.
9
Ensemble of Template-Free and Template-Based Classifiers for Protein Secondary Structure Prediction.无模板和基于模板的分类器集成方法用于蛋白质二级结构预测。
Int J Mol Sci. 2021 Oct 23;22(21):11449. doi: 10.3390/ijms222111449.
10
APIS: accurate prediction of hot spots in protein interfaces by combining protrusion index with solvent accessibility.APIS:通过结合突出指数和溶剂可及性来准确预测蛋白质界面热点。
BMC Bioinformatics. 2010 Apr 8;11:174. doi: 10.1186/1471-2105-11-174.

引用本文的文献

1
Advances in Computational Intelligence-Based Methods of Structure and Function Prediction of Proteins.基于计算智能的蛋白质结构与功能预测方法的进展。
Biomolecules. 2024 Aug 29;14(9):1083. doi: 10.3390/biom14091083.
2
Computational prediction of multiple antigen epitopes.计算预测多种抗原表位。
Bioinformatics. 2024 Oct 1;40(10). doi: 10.1093/bioinformatics/btae556.
3
Computational Methods to Predict Conformational B-Cell Epitopes.计算方法预测构象 B 细胞表位。

本文引用的文献

1
Hydrogen-Deuterium Exchange and Hydroxyl Radical Footprinting for Mapping Hydrophobic Interactions of Human Bromodomain with a Small Molecule Inhibitor.氢氘交换和羟基自由基足迹法用于绘制小分子抑制剂与人溴结构域的疏水相互作用。
J Am Soc Mass Spectrom. 2019 Dec;30(12):2795-2804. doi: 10.1007/s13361-019-02316-1. Epub 2019 Nov 12.
2
The EMBL-EBI search and sequence analysis tools APIs in 2019.2019 年的 EMBL-EBI 搜索和序列分析工具 API。
Nucleic Acids Res. 2019 Jul 2;47(W1):W636-W641. doi: 10.1093/nar/gkz268.
3
Protein-protein binding supersites.
Biomolecules. 2024 Aug 10;14(8):983. doi: 10.3390/biom14080983.
4
Assessing the functional impact of protein binding site definition.评估蛋白质结合位点定义的功能影响。
Protein Sci. 2024 Jun;33(6):e5026. doi: 10.1002/pro.5026.
5
Optimal selection of suitable templates in protein interface prediction.蛋白质界面预测中合适模板的最优选择。
Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad510.
蛋白质-蛋白质结合超位。
PLoS Comput Biol. 2019 Jan 7;15(1):e1006704. doi: 10.1371/journal.pcbi.1006704. eCollection 2019 Jan.
4
The choice of sequence homologs included in multiple sequence alignments has a dramatic impact on evolutionary conservation analysis.多序列比对中所包含的序列同源物的选择对进化保守性分析有显著影响。
Bioinformatics. 2019 Jan 1;35(1):12-19. doi: 10.1093/bioinformatics/bty523.
5
Exploring the potential of 3D Zernike descriptors and SVM for protein-protein interface prediction.探索 3D Zernike 描述符和 SVM 在蛋白质-蛋白质界面预测中的应用潜力。
BMC Bioinformatics. 2018 Feb 6;19(1):35. doi: 10.1186/s12859-018-2043-3.
6
ISPRED4: interaction sites PREDiction in protein structures with a refining grammar model.ISPRED4:使用优化语法模型预测蛋白质结构中的相互作用位点
Bioinformatics. 2017 Jun 1;33(11):1656-1663. doi: 10.1093/bioinformatics/btx044.
7
iFrag: A Protein-Protein Interface Prediction Server Based on Sequence Fragments.iFrag:一个基于序列片段的蛋白质-蛋白质相互作用界面预测服务器。
J Mol Biol. 2017 Feb 3;429(3):382-389. doi: 10.1016/j.jmb.2016.11.034. Epub 2016 Dec 10.
8
New Parameters for Higher Accuracy in the Computation of Binding Free Energy Differences upon Alanine Scanning Mutagenesis on Protein-Protein Interfaces.用于在蛋白质-蛋白质界面进行丙氨酸扫描诱变时更精确计算结合自由能差异的新参数
J Chem Inf Model. 2017 Jan 23;57(1):60-72. doi: 10.1021/acs.jcim.6b00378. Epub 2016 Dec 22.
9
Computational prediction of protein interfaces: A review of data driven methods.蛋白质界面的计算预测:数据驱动方法综述
FEBS Lett. 2015 Nov 30;589(23):3516-26. doi: 10.1016/j.febslet.2015.10.003. Epub 2015 Oct 13.
10
The revolution will not be crystallized: a new method sweeps through structural biology.这场革命不会被固化:一种新方法席卷结构生物学领域。
Nature. 2015 Sep 10;525(7568):172-4. doi: 10.1038/525172a.