• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MetaScore:一种改进基于传统打分函数的蛋白质-蛋白质对接构象打分方法的新型机器学习方法。

MetaScore: A Novel Machine-Learning-Based Approach to Improve Traditional Scoring Functions for Scoring Protein-Protein Docking Conformations.

机构信息

Bioinformatics & Genomics Graduate Program, Pennsylvania State University, University Park, PA 16802, USA.

Artificial Intelligence Research Laboratory, Pennsylvania State University, University Park, PA 16802, USA.

出版信息

Biomolecules. 2023 Jan 6;13(1):121. doi: 10.3390/biom13010121.

DOI:10.3390/biom13010121
PMID:36671507
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9855734/
Abstract

Protein-protein interactions play a ubiquitous role in biological function. Knowledge of the three-dimensional (3D) structures of the complexes they form is essential for understanding the structural basis of those interactions and how they orchestrate key cellular processes. Computational docking has become an indispensable alternative to the expensive and time-consuming experimental approaches for determining the 3D structures of protein complexes. Despite recent progress, identifying near-native models from a large set of conformations sampled by docking-the so-called scoring problem-still has considerable room for improvement. We present MetaScore, a new machine-learning-based approach to improve the scoring of docked conformations. MetaScore utilizes a random forest (RF) classifier trained to distinguish near-native from non-native conformations using their protein-protein interfacial features. The features include physicochemical properties, energy terms, interaction-propensity-based features, geometric properties, interface topology features, evolutionary conservation, and also scores produced by traditional scoring functions (SFs). MetaScore scores docked conformations by simply averaging the score produced by the RF classifier with that produced by any traditional SF. We demonstrate that (i) MetaScore consistently outperforms each of the nine traditional SFs included in this work in terms of success rate and hit rate evaluated over conformations ranked among the top 10; (ii) an ensemble method, MetaScore-Ensemble, that combines 10 variants of MetaScore obtained by combining the RF score with each of the traditional SFs outperforms each of the MetaScore variants. We conclude that the performance of traditional SFs can be improved upon by using machine learning to judiciously leverage protein-protein interfacial features and by using ensemble methods to combine multiple scoring functions.

摘要

蛋白质-蛋白质相互作用在生物功能中起着普遍的作用。了解它们形成的复合物的三维(3D)结构对于理解这些相互作用的结构基础以及它们如何协调关键细胞过程是至关重要的。计算对接已成为确定蛋白质复合物 3D 结构的昂贵且耗时的实验方法的不可或缺的替代方法。尽管最近取得了进展,但从对接采样的大量构象中识别接近天然的模型-即所谓的评分问题-仍然有很大的改进空间。我们提出了 MetaScore,这是一种基于机器学习的新方法,可以提高对接构象的评分。MetaScore 利用随机森林(RF)分类器,该分类器经过训练,可以使用其蛋白质-蛋白质界面特征来区分近天然和非天然构象。特征包括物理化学性质、能量项、基于相互作用倾向的特征、几何性质、界面拓扑特征、进化保守性,以及传统评分函数(SFs)产生的分数。MetaScore 通过简单地将 RF 分类器产生的分数与任何传统 SF 产生的分数相加,对对接构象进行评分。我们证明:(i)MetaScore 在成功率和命中率方面始终优于本工作中包含的九个传统 SF 中的每一个,评估的构象排在前 10 名;(ii)一种集成方法,MetaScore-Ensemble,通过将 RF 分数与每个传统 SF 结合,组合了 10 种 MetaScore 变体,优于每种 MetaScore 变体。我们得出结论,通过使用机器学习明智地利用蛋白质-蛋白质界面特征,并使用集成方法结合多个评分函数,可以提高传统 SF 的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df28/9855734/28a89fa46e3b/biomolecules-13-00121-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df28/9855734/f5cc20ea8a42/biomolecules-13-00121-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df28/9855734/28a89fa46e3b/biomolecules-13-00121-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df28/9855734/f5cc20ea8a42/biomolecules-13-00121-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df28/9855734/28a89fa46e3b/biomolecules-13-00121-g002.jpg

相似文献

1
MetaScore: A Novel Machine-Learning-Based Approach to Improve Traditional Scoring Functions for Scoring Protein-Protein Docking Conformations.MetaScore:一种改进基于传统打分函数的蛋白质-蛋白质对接构象打分方法的新型机器学习方法。
Biomolecules. 2023 Jan 6;13(1):121. doi: 10.3390/biom13010121.
2
Boosted neural networks scoring functions for accurate ligand docking and ranking.用于精确配体对接和排序的增强神经网络评分函数。
J Bioinform Comput Biol. 2018 Apr;16(2):1850004. doi: 10.1142/S021972001850004X. Epub 2018 Feb 4.
3
iScore: a novel graph kernel-based function for scoring protein-protein docking models.iScore:一种用于评估蛋白质-蛋白质对接模型的基于新型图核的函数。
Bioinformatics. 2020 Jan 1;36(1):112-121. doi: 10.1093/bioinformatics/btz496.
4
Machine learning in computational docking.计算对接中的机器学习。
Artif Intell Med. 2015 Mar;63(3):135-52. doi: 10.1016/j.artmed.2015.02.002. Epub 2015 Feb 16.
5
Task-Specific Scoring Functions for Predicting Ligand Binding Poses and Affinity and for Screening Enrichment.用于预测配体结合构象和亲和力以及进行筛选富集的任务特定评分函数。
J Chem Inf Model. 2018 Jan 22;58(1):119-133. doi: 10.1021/acs.jcim.7b00309. Epub 2017 Dec 20.
6
Machine-learning scoring functions for identifying native poses of ligands docked to known and novel proteins.用于识别对接至已知和新型蛋白质的配体天然构象的机器学习评分函数。
BMC Bioinformatics. 2015;16 Suppl 6(Suppl 6):S3. doi: 10.1186/1471-2105-16-S6-S3. Epub 2015 Apr 17.
7
Systematic Improvement of the Performance of Machine Learning Scoring Functions by Incorporating Features of Protein-Bound Water Molecules.通过纳入结合蛋白结合水分子的特征来系统提高机器学习打分函数的性能。
J Chem Inf Model. 2022 Sep 26;62(18):4369-4379. doi: 10.1021/acs.jcim.2c00916. Epub 2022 Sep 9.
8
Can machine learning consistently improve the scoring power of classical scoring functions? Insights into the role of machine learning in scoring functions.机器学习能否持续提高经典评分函数的评分能力?深入探讨机器学习在评分函数中的作用。
Brief Bioinform. 2021 Jan 18;22(1):497-514. doi: 10.1093/bib/bbz173.
9
Scoring a diverse set of high-quality docked conformations: a metascore based on electrostatic and desolvation interactions.对一系列多样的高质量对接构象进行评分:基于静电和去溶剂化相互作用的元评分。
Proteins. 2006 Jun 1;63(4):868-77. doi: 10.1002/prot.20932.
10
DockRank: ranking docked conformations using partner-specific sequence homology-based protein interface prediction.DockRank:利用基于特定伙伴序列同源性的蛋白质界面预测对对接构象进行排名。
Proteins. 2014 Feb;82(2):250-67. doi: 10.1002/prot.24370. Epub 2013 Oct 17.

引用本文的文献

1
Compact Assessment of Molecular Surface Complementarities Enhances Neural Network-Aided Prediction of Key Binding Residues.分子表面互补性的紧凑评估增强了神经网络辅助的关键结合残基预测。
J Chem Inf Model. 2025 Mar 10;65(5):2695-2709. doi: 10.1021/acs.jcim.4c02286. Epub 2025 Feb 21.
2
A comprehensive survey of scoring functions for protein docking models.蛋白质对接模型评分函数的全面综述。
BMC Bioinformatics. 2025 Jan 22;26(1):25. doi: 10.1186/s12859-024-05991-4.
3
DeepRank-GNN-esm: a graph neural network for scoring protein-protein models using protein language model.

本文引用的文献

1
Developing an effective polarizable bond method for small molecules with application to optimized molecular docking.开发一种适用于小分子的有效极化键方法,并应用于优化分子对接。
RSC Adv. 2020 Apr 20;10(26):15530-15540. doi: 10.1039/d0ra01483d. eCollection 2020 Apr 16.
2
A Novel Knowledge Distillation-Based Feature Selection for the Classification of ADHD.一种基于知识蒸馏的用于注意缺陷多动障碍分类的新型特征选择方法。
Biomolecules. 2021 Jul 23;11(8):1093. doi: 10.3390/biom11081093.
3
PASSer: Prediction of Allosteric Sites Server.PASSer:变构位点预测服务器。
DeepRank-GNN-esm:一种使用蛋白质语言模型对蛋白质-蛋白质模型进行评分的图神经网络。
Bioinform Adv. 2024 Jan 5;4(1):vbad191. doi: 10.1093/bioadv/vbad191. eCollection 2024.
4
Computational methods in glaucoma research: Current status and future outlook.青光眼研究中的计算方法:现状与展望。
Mol Aspects Med. 2023 Dec;94:101222. doi: 10.1016/j.mam.2023.101222. Epub 2023 Nov 3.
5
The Molecular Docking of MAX Fungal Effectors with Plant HMA Domain-Binding Proteins.MAX 真菌效应物与植物 HMA 结构域结合蛋白的分子对接。
Int J Mol Sci. 2023 Oct 16;24(20):15239. doi: 10.3390/ijms242015239.
Mach Learn Sci Technol. 2021 Sep;2(3). doi: 10.1088/2632-2153/abe6d6. Epub 2021 May 13.
4
Integrative Modelling of Biomolecular Complexes.生物分子复合物的综合建模。
J Mol Biol. 2020 Apr 17;432(9):2861-2881. doi: 10.1016/j.jmb.2019.11.009. Epub 2019 Nov 26.
5
iScore: a novel graph kernel-based function for scoring protein-protein docking models.iScore:一种用于评估蛋白质-蛋白质对接模型的基于新型图核的函数。
Bioinformatics. 2020 Jan 1;36(1):112-121. doi: 10.1093/bioinformatics/btz496.
6
Partner-specific prediction of RNA-binding residues in proteins: A critical assessment.蛋白质中 RNA 结合残基的伴侣特异性预测:一项批判性评估。
Proteins. 2019 Mar;87(3):198-211. doi: 10.1002/prot.25639. Epub 2018 Dec 30.
7
BIPSPI: a method for the prediction of partner-specific protein-protein interfaces.BIPSPI:一种预测伴侣特异性蛋白质-蛋白质界面的方法。
Bioinformatics. 2019 Feb 1;35(3):470-477. doi: 10.1093/bioinformatics/bty647.
8
The challenge of modeling protein assemblies: the CASP12-CAPRI experiment.蛋白质组装体建模的挑战:CASP12-CAPRI实验
Proteins. 2018 Mar;86 Suppl 1:257-273. doi: 10.1002/prot.25419. Epub 2017 Nov 26.
9
Protein-protein interaction specificity is captured by contact preferences and interface composition.蛋白质-蛋白质相互作用的特异性由接触偏好和界面组成来捕获。
Bioinformatics. 2018 Feb 1;34(3):459-468. doi: 10.1093/bioinformatics/btx584.
10
LightDock: a new multi-scale approach to protein-protein docking.LightDock:一种新的蛋白质-蛋白质对接多尺度方法。
Bioinformatics. 2018 Jan 1;34(1):49-55. doi: 10.1093/bioinformatics/btx555.