• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习算法的蛋白质-蛋白质相互作用界面分类与预测。

Classification and prediction of protein-protein interaction interface using machine learning algorithm.

机构信息

Structural Biology and Bioinformatics Division, CSIR-Indian Institute of Chemical Biology, Kolkata, WB, India.

出版信息

Sci Rep. 2021 Jan 19;11(1):1761. doi: 10.1038/s41598-020-80900-2.

DOI:10.1038/s41598-020-80900-2
PMID:33469042
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7815773/
Abstract

Structural insight of the protein-protein interaction (PPI) interface can provide knowledge about the kinetics, thermodynamics and molecular functions of the complex while elucidating its role in diseases and further enabling it as a potential therapeutic target. However, owing to experimental lag in solving protein-protein complex structures, three-dimensional (3D) knowledge of the PPI interfaces can be gained via computational approaches like molecular docking and post-docking analyses. Despite development of numerous docking tools and techniques, success in identification of native like interfaces based on docking score functions is limited. Hence, we employed an in-depth investigation of the structural features of the interface that might successfully delineate native complexes from non-native ones. We identify interface properties, which show statistically significant difference between native and non-native interfaces belonging to homo and hetero, protein-protein complexes. Utilizing these properties, a support vector machine (SVM) based classification scheme has been implemented to differentiate native and non-native like complexes generated using docking decoys. Benchmarking and comparative analyses suggest very good performance of our SVM classifiers. Further, protein interactions, which are proven via experimental findings but not resolved structurally, were subjected to this approach where 3D-models of the complexes were generated and most likely interfaces were predicted. A web server called Protein Complex Prediction by Interface Properties (PCPIP) is developed to predict whether interface of a given protein-protein dimer complex resembles known protein interfaces. The server is freely available at http://www.hpppi.iicb.res.in/pcpip/ .

摘要

蛋白质-蛋白质相互作用(PPI)界面的结构洞察力可以提供有关复合物的动力学、热力学和分子功能的知识,同时阐明其在疾病中的作用,并进一步将其作为潜在的治疗靶点。然而,由于解决蛋白质-蛋白质复合物结构的实验滞后,通过分子对接和对接后分析等计算方法可以获得 PPI 界面的三维(3D)知识。尽管已经开发了许多对接工具和技术,但基于对接评分函数成功识别天然类似界面的成功率有限。因此,我们深入研究了界面的结构特征,这些特征可能成功地区分天然复合物和非天然复合物。我们确定了界面特性,这些特性在属于同型和异型的天然和非天然界面之间表现出统计学上的显著差异,蛋白质-蛋白质复合物。利用这些特性,我们实施了基于支持向量机(SVM)的分类方案,以区分使用对接诱饵生成的天然和非天然类似复合物。基准测试和比较分析表明,我们的 SVM 分类器表现非常出色。此外,我们还对经过实验验证但尚未通过结构解析的蛋白质相互作用进行了这种方法的处理,生成了复合物的 3D 模型,并预测了最有可能的界面。我们开发了一个名为“通过界面特性预测蛋白质复合物(PCPIP)”的网络服务器,用于预测给定蛋白质-蛋白质二聚体复合物的界面是否类似于已知的蛋白质界面。该服务器可免费在 http://www.hpppi.iicb.res.in/pcpip/ 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e75b/7815773/6b08cf17ed02/41598_2020_80900_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e75b/7815773/d729fe5e5b12/41598_2020_80900_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e75b/7815773/e17429694c38/41598_2020_80900_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e75b/7815773/1bdb1e92853b/41598_2020_80900_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e75b/7815773/6b08cf17ed02/41598_2020_80900_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e75b/7815773/d729fe5e5b12/41598_2020_80900_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e75b/7815773/e17429694c38/41598_2020_80900_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e75b/7815773/1bdb1e92853b/41598_2020_80900_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e75b/7815773/6b08cf17ed02/41598_2020_80900_Fig4_HTML.jpg

相似文献

1
Classification and prediction of protein-protein interaction interface using machine learning algorithm.基于机器学习算法的蛋白质-蛋白质相互作用界面分类与预测。
Sci Rep. 2021 Jan 19;11(1):1761. doi: 10.1038/s41598-020-80900-2.
2
Predicted binding site information improves model ranking in protein docking using experimental and computer-generated target structures.利用实验性和计算机生成的靶标结构时,预测的结合位点信息可改善蛋白质对接中的模型排序。
BMC Struct Biol. 2015 Nov 23;15:23. doi: 10.1186/s12900-015-0050-4.
3
A simple reference state makes a significant improvement in near-native selections from structurally refined docking decoys.一个简单的参考状态显著改善了从结构优化的对接诱饵中进行的近天然选择。
Proteins. 2007 Nov 1;69(2):244-53. doi: 10.1002/prot.21498.
4
Nonlinear scoring functions for similarity-based ligand docking and binding affinity prediction.基于相似性的配体对接和结合亲和力预测的非线性评分函数。
J Chem Inf Model. 2013 Nov 25;53(11):3097-112. doi: 10.1021/ci400510e. Epub 2013 Nov 11.
5
Binding interface prediction by combining protein-protein docking results.通过结合蛋白质-蛋白质对接结果进行结合界面预测。
Proteins. 2014 Jan;82(1):57-66. doi: 10.1002/prot.24354. Epub 2013 Aug 31.
6
CLUB-MARTINI: Selecting Favourable Interactions amongst Available Candidates, a Coarse-Grained Simulation Approach to Scoring Docking Decoys.CLUB-MARTINI:在现有候选物中选择有利相互作用,一种用于对接诱饵评分的粗粒度模拟方法。
PLoS One. 2016 May 11;11(5):e0155251. doi: 10.1371/journal.pone.0155251. eCollection 2016.
7
How to choose templates for modeling of protein complexes: Insights from benchmarking template-based docking.如何为蛋白质复合物建模选择模板:基于基准测试的模板对接的见解。
Proteins. 2020 Aug;88(8):1070-1081. doi: 10.1002/prot.25875. Epub 2020 Feb 7.
8
Energy-based graph convolutional networks for scoring protein docking models.基于能量的图卷积网络在蛋白质对接模型评分中的应用。
Proteins. 2020 Aug;88(8):1091-1099. doi: 10.1002/prot.25888. Epub 2020 Mar 16.
9
Feature-based classification of native and non-native protein-protein interactions: Comparing supervised and semi-supervised learning approaches.基于特征的天然和非天然蛋白质-蛋白质相互作用分类:比较监督学习和半监督学习方法。
Proteomics. 2011 Nov;11(22):4321-30. doi: 10.1002/pmic.201100217. Epub 2011 Oct 17.
10
An SVM-based method for assessment of transcription factor-DNA complex models.基于支持向量机的转录因子-DNA 复合物模型评估方法。
BMC Bioinformatics. 2018 Dec 21;19(Suppl 20):506. doi: 10.1186/s12859-018-2538-y.

引用本文的文献

1
MESM: integrating multi-source data for high-accuracy protein-protein interactions prediction through multimodal language models.MESM:通过多模态语言模型整合多源数据以进行高精度蛋白质-蛋白质相互作用预测
BMC Biol. 2025 Aug 11;23(1):253. doi: 10.1186/s12915-025-02356-y.
2
Unravelling the human taste receptor interactome: machine learning and molecular modelling insights into protein-protein interactions.解析人类味觉受体相互作用组:机器学习与蛋白质-蛋白质相互作用的分子建模见解
NPJ Sci Food. 2025 Jul 1;9(1):113. doi: 10.1038/s41538-025-00478-9.
3
Negative sampling strategies impact the prediction of scale-free biomolecular network interactions with machine learning.

本文引用的文献

1
Protein-protein binding supersites.蛋白质-蛋白质结合超位。
PLoS Comput Biol. 2019 Jan 7;15(1):e1006704. doi: 10.1371/journal.pcbi.1006704. eCollection 2019 Jan.
2
Protein-protein interaction specificity is captured by contact preferences and interface composition.蛋白质-蛋白质相互作用的特异性由接触偏好和界面组成来捕获。
Bioinformatics. 2018 Feb 1;34(3):459-468. doi: 10.1093/bioinformatics/btx584.
3
Computational modeling of protein assemblies.蛋白质组装的计算建模。
负采样策略会影响利用机器学习对无标度生物分子网络相互作用的预测。
BMC Biol. 2025 May 9;23(1):123. doi: 10.1186/s12915-025-02231-w.
4
Learning Universal Representations of Intermolecular Interactions with ATOMICA.利用ATOMICA学习分子间相互作用的通用表示。
bioRxiv. 2025 Jul 15:2025.04.02.646906. doi: 10.1101/2025.04.02.646906.
5
Network pharmacology and AI in cancer research uncovering biomarkers and therapeutic targets for RALGDS mutations.癌症研究中的网络药理学与人工智能:揭示RALGDS突变的生物标志物和治疗靶点
Sci Rep. 2025 Mar 29;15(1):10938. doi: 10.1038/s41598-025-91568-x.
6
Structure-Based Deep Learning Framework for Modeling Human-Gut Bacterial Protein Interactions.用于模拟人肠道细菌蛋白质相互作用的基于结构的深度学习框架
Proteomes. 2025 Feb 17;13(1):10. doi: 10.3390/proteomes13010010.
7
PIPENN-EMB ensemble net and protein embeddings generalise protein interface prediction beyond homology.PIPENN-EMB集成网络和蛋白质嵌入技术将蛋白质界面预测推广到同源性之外。
Sci Rep. 2025 Feb 5;15(1):4391. doi: 10.1038/s41598-025-88445-y.
8
Structure-Based Approaches for Protein-Protein Interaction Prediction Using Machine Learning and Deep Learning.基于结构的机器学习和深度学习蛋白质-蛋白质相互作用预测方法
Biomolecules. 2025 Jan 17;15(1):141. doi: 10.3390/biom15010141.
9
A comprehensive survey of scoring functions for protein docking models.蛋白质对接模型评分函数的全面综述。
BMC Bioinformatics. 2025 Jan 22;26(1):25. doi: 10.1186/s12859-024-05991-4.
10
Interactomic Analyses and a Reverse Engineering Study Identify Specific Functional Activities of One-to-One Interactions of the S1 Subunit of the SARS-CoV-2 Spike Protein with the Human Proteome.相互作用组分析和逆向工程研究确定了新冠病毒刺突蛋白S1亚基与人蛋白质组一对一相互作用的特定功能活性。
Biomolecules. 2024 Dec 3;14(12):1549. doi: 10.3390/biom14121549.
Curr Opin Struct Biol. 2017 Jun;44:179-189. doi: 10.1016/j.sbi.2017.04.006. Epub 2017 May 12.
4
Improved prediction of protein-protein interactions using novel negative samples, features, and an ensemble classifier.利用新型负样本、特征和集成分类器提高蛋白质-蛋白质相互作用预测。
Artif Intell Med. 2017 Nov;83:67-74. doi: 10.1016/j.artmed.2017.03.001. Epub 2017 Mar 4.
5
Protein-protein interface prediction based on hexagon structure similarity.基于六边形结构相似性的蛋白质-蛋白质界面预测
Comput Biol Chem. 2016 Aug;63:83-88. doi: 10.1016/j.compbiolchem.2016.02.008. Epub 2016 Feb 12.
6
Integrative Modeling of Macromolecular Assemblies from Low to Near-Atomic Resolution.从低分辨率到近原子分辨率的大分子组装体的整合建模
Comput Struct Biotechnol J. 2015 Sep 3;13:492-503. doi: 10.1016/j.csbj.2015.08.005. eCollection 2015.
7
PPCheck: A Webserver for the Quantitative Analysis of Protein-Protein Interfaces and Prediction of Residue Hotspots.PPCheck:用于蛋白质-蛋白质相互作用定量分析和残基热点预测的网络服务器。
Bioinform Biol Insights. 2015 Sep 21;9:141-51. doi: 10.4137/BBI.S25928. eCollection 2015.
8
Structural neighboring property for identifying protein-protein binding sites.用于识别蛋白质-蛋白质结合位点的结构邻接属性
BMC Syst Biol. 2015;9 Suppl 5(Suppl 5):S3. doi: 10.1186/1752-0509-9-S5-S3. Epub 2015 Sep 1.
9
Application of Enhanced Sampling Monte Carlo Methods for High-Resolution Protein-Protein Docking in Rosetta.增强采样蒙特卡罗方法在Rosetta中用于高分辨率蛋白质-蛋白质对接的应用
PLoS One. 2015 Jun 8;10(6):e0125941. doi: 10.1371/journal.pone.0125941. eCollection 2015.
10
Progress and challenges in predicting protein interfaces.预测蛋白质界面的进展与挑战。
Brief Bioinform. 2016 Jan;17(1):117-31. doi: 10.1093/bib/bbv027. Epub 2015 May 13.