• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于集成的方法估计预测蛋白配体结合亲和力值的置信度。

An ensemble-based approach to estimate confidence of predicted protein-ligand binding affinity values.

机构信息

Applied Biotechnology Research Center, Baqiyatallah University of Medical Sciences, Tehran, Iran.

出版信息

Mol Inform. 2024 Apr;43(4):e202300292. doi: 10.1002/minf.202300292. Epub 2024 Feb 15.

DOI:10.1002/minf.202300292
PMID:38358080
Abstract

When designing a machine learning-based scoring function, we access a limited number of protein-ligand complexes with experimentally determined binding affinity values, representing only a fraction of all possible protein-ligand complexes. Consequently, it is crucial to report a measure of confidence and quantify the uncertainty in the model's predictions during test time. Here, we adopt the conformal prediction technique to evaluate the confidence of a prediction for each member of the core set of the CASF 2016 benchmark. The conformal prediction technique requires a diverse ensemble of predictors for uncertainty estimation. To this end, we introduce ENS-Score as an ensemble predictor, which includes 30 models with different protein-ligand representation approaches and achieves Pearson's correlation of 0.842 on the core set of the CASF 2016 benchmark. Also, we comprehensively investigate the residual error of each data point to assess the normality behavior of the distribution of the residual errors and their correlation to the structural features of the ligands, such as hydrophobic interactions and halogen bonding. In the end, we provide a local host web application to facilitate the usage of ENS-Score. All codes to repeat results are provided at https://github.com/miladrayka/ENS_Score.

摘要

在设计基于机器学习的评分函数时,我们可以访问具有实验确定的结合亲和力值的有限数量的蛋白质-配体复合物,这些复合物仅代表所有可能的蛋白质-配体复合物的一部分。因此,在测试时报告置信度度量并量化模型预测的不确定性至关重要。在这里,我们采用一致预测技术来评估 CASF 2016 基准核心集中每个成员的预测置信度。一致预测技术需要使用多样化的预测器集合进行不确定性估计。为此,我们引入了 ENS-Score 作为一个集成预测器,它包括 30 种具有不同蛋白质-配体表示方法的模型,在 CASF 2016 基准核心集上实现了 0.842 的皮尔逊相关系数。此外,我们还全面研究了每个数据点的残差,以评估残差分布的正态性行为及其与配体结构特征(如疏水相互作用和卤键)的相关性。最后,我们提供了一个本地主机网络应用程序,以方便使用 ENS-Score。重复结果的所有代码都可在 https://github.com/miladrayka/ENS_Score 上找到。

相似文献

1
An ensemble-based approach to estimate confidence of predicted protein-ligand binding affinity values.基于集成的方法估计预测蛋白配体结合亲和力值的置信度。
Mol Inform. 2024 Apr;43(4):e202300292. doi: 10.1002/minf.202300292. Epub 2024 Feb 15.
2
GB-score: Minimally designed machine learning scoring function based on distance-weighted interatomic contact features.GB评分:基于距离加权原子间接触特征的最小化设计机器学习评分函数。
Mol Inform. 2023 Mar;42(3):e2200135. doi: 10.1002/minf.202200135. Epub 2023 Feb 1.
3
ET-score: Improving Protein-ligand Binding Affinity Prediction Based on Distance-weighted Interatomic Contact Features Using Extremely Randomized Trees Algorithm.ET-得分:利用极端随机树算法基于距离加权原子间接触特征改进蛋白质-配体结合亲和力预测。
Mol Inform. 2021 Aug;40(8):e2060084. doi: 10.1002/minf.202060084. Epub 2021 May 21.
4
BgN-Score and BsN-Score: bagging and boosting based ensemble neural networks scoring functions for accurate binding affinity prediction of protein-ligand complexes.BgN分数和BsN分数:基于装袋法和提升法的集成神经网络评分函数,用于准确预测蛋白质-配体复合物的结合亲和力。
BMC Bioinformatics. 2015;16 Suppl 4(Suppl 4):S8. doi: 10.1186/1471-2105-16-S4-S8. Epub 2015 Feb 23.
5
Delta Machine Learning to Improve Scoring-Ranking-Screening Performances of Protein-Ligand Scoring Functions.利用 Delta 机器学习改进蛋白质配体打分函数的评分-排名-筛选性能。
J Chem Inf Model. 2022 Jun 13;62(11):2696-2712. doi: 10.1021/acs.jcim.2c00485. Epub 2022 May 17.
6
Protein-ligand binding affinity prediction exploiting sequence constituent homology.利用序列组成成分同源性预测蛋白质-配体结合亲和力。
Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad502.
7
PharmRF: A machine-learning scoring function to identify the best protein-ligand complexes for structure-based pharmacophore screening with high enrichments.PharmRF:一种机器学习评分函数,用于识别具有高富集度的基于结构的药效团筛选的最佳蛋白质-配体复合物。
J Comput Chem. 2022 May 5;43(12):847-863. doi: 10.1002/jcc.26840. Epub 2022 Mar 18.
8
Comparative assessment of scoring functions on an updated benchmark: 2. Evaluation methods and general results.更新后的基准上评分函数的比较评估:2. 评估方法与总体结果。
J Chem Inf Model. 2014 Jun 23;54(6):1717-36. doi: 10.1021/ci500081m. Epub 2014 Jun 2.
9
Forging the Basis for Developing Protein-Ligand Interaction Scoring Functions.为开发蛋白质-配体相互作用评分函数奠定基础。
Acc Chem Res. 2017 Feb 21;50(2):302-309. doi: 10.1021/acs.accounts.6b00491. Epub 2017 Feb 9.
10
AK-Score: Accurate Protein-Ligand Binding Affinity Prediction Using an Ensemble of 3D-Convolutional Neural Networks.AK-Score:使用 3D 卷积神经网络集成进行准确的蛋白质-配体结合亲和力预测。
Int J Mol Sci. 2020 Nov 10;21(22):8424. doi: 10.3390/ijms21228424.

引用本文的文献

1
Achieving well-informed decision-making in drug discovery: a comprehensive calibration study using neural network-based structure-activity models.在药物发现中实现明智的决策:一项使用基于神经网络的构效模型的全面校准研究。
J Cheminform. 2025 Mar 5;17(1):29. doi: 10.1186/s13321-025-00964-y.