用于预测配体结合构象和亲和力以及进行筛选富集的任务特定评分函数。

Task-Specific Scoring Functions for Predicting Ligand Binding Poses and Affinity and for Screening Enrichment.

机构信息

Department of Electrical and Computer Engineering, Michigan State University , East Lansing, Michigan 48824-1226, United States.

出版信息

J Chem Inf Model. 2018 Jan 22;58(1):119-133. doi: 10.1021/acs.jcim.7b00309. Epub 2017 Dec 20.

DOI:10.1021/acs.jcim.7b00309

PMID:29190087

Abstract

Molecular docking, scoring, and virtual screening play an increasingly important role in computer-aided drug discovery. Scoring functions (SFs) are typically employed to predict the binding conformation (docking task), binding affinity (scoring task), and binary activity level (screening task) of ligands against a critical protein target in a disease's pathway. In most molecular docking software packages available today, a generic binding affinity-based (BA-based) SF is invoked for all three tasks to solve three different, but related, prediction problems. The limited predictive accuracies of such SFs in these three tasks has been a major roadblock toward cost-effective drug discovery. Therefore, in this work, we develop BT-Score, an ensemble machine-learning (ML) SF of boosted decision trees and thousands of predictive descriptors to estimate BA. BT-Score reproduced BA of out-of-sample test complexes with correlation of 0.825. Even with this high accuracy in the scoring task, we demonstrate that the docking and screening performance of BT-Score and other BA-based SFs is far from ideal. This has motivated us to build two task-specific ML SFs for the docking and screening problems. We propose BT-Dock, a boosted-tree ensemble model trained on a large number of native and computer-generated ligand conformations and optimized to predict binding poses explicitly. This model has shown an average improvement of 25% over its BA-based counterparts in different ligand pose prediction scenarios. Similar improvement has also been obtained by our screening-based SF, BT-Screen, which directly models the ligand activity labeling task as a classification problem. BT-Screen is trained on thousands of active and inactive protein-ligand complexes to optimize it for finding real actives from databases of ligands not seen in its training set. In addition to the three task-specific SFs, we propose a novel multi-task deep neural network (MT-Net) that is trained on data from the three tasks to simultaneously predict binding poses, affinities, and activity levels. We show that the performance of MT-Net is superior to conventional SFs and on a par with or better than models based on single-task neural networks.

摘要

分子对接、评分和虚拟筛选在计算机辅助药物发现中发挥着越来越重要的作用。评分函数（SF）通常用于预测配体与疾病途径中关键蛋白靶标的结合构象（对接任务）、结合亲和力（评分任务）和二元活性水平（筛选任务）。在当今可用的大多数分子对接软件包中，针对所有三个任务调用通用基于结合亲和力的（BA 基）SF，以解决三个不同但相关的预测问题。在这三个任务中，此类 SF 的有限预测准确性一直是实现具有成本效益的药物发现的主要障碍。因此，在这项工作中，我们开发了 BT-Score，这是一种基于集成机器学习（ML）的决策树和数千个预测描述符的增强型 SF，用于估计 BA。BT-Score 对样本外测试复合物的 BA 进行了重现，相关系数为 0.825。即使在评分任务中具有如此高的准确性，我们也证明了 BT-Score 和其他 BA 基 SF 的对接和筛选性能远非理想。这促使我们为对接和筛选问题构建了两个特定于任务的 ML SF。我们提出了 BT-Dock，这是一种基于大量天然和计算机生成的配体构象的增强树集成模型，经过优化可明确预测结合构象。在不同的配体构象预测场景中，该模型与基于 BA 的对应模型相比平均提高了 25%。我们的基于筛选的 SF BT-Screen 也取得了类似的改进，该模型直接将配体活性标记任务建模为分类问题。BT-Screen 在数千个活性和非活性的蛋白质-配体复合物上进行训练，以优化其从其训练集中未见过的配体数据库中找到真实活性的能力。除了这三个特定于任务的 SF 之外，我们还提出了一种新颖的多任务深度神经网络（MT-Net），该网络基于三个任务的数据进行训练，以同时预测结合构象、亲和力和活性水平。我们表明，MT-Net 的性能优于传统 SF，并且与基于单任务神经网络的模型相当或更好。

相似文献

Task-Specific Scoring Functions for Predicting Ligand Binding Poses and Affinity and for Screening Enrichment.用于预测配体结合构象和亲和力以及进行筛选富集的任务特定评分函数。

J Chem Inf Model. 2018 Jan 22;58(1):119-133. doi: 10.1021/acs.jcim.7b00309. Epub 2017 Dec 20.

Boosted neural networks scoring functions for accurate ligand docking and ranking.用于精确配体对接和排序的增强神经网络评分函数。

J Bioinform Comput Biol. 2018 Apr;16(2):1850004. doi: 10.1142/S021972001850004X. Epub 2018 Feb 4.

BgN-Score and BsN-Score: bagging and boosting based ensemble neural networks scoring functions for accurate binding affinity prediction of protein-ligand complexes.BgN分数和BsN分数：基于装袋法和提升法的集成神经网络评分函数，用于准确预测蛋白质-配体复合物的结合亲和力。

BMC Bioinformatics. 2015;16 Suppl 4(Suppl 4):S8. doi: 10.1186/1471-2105-16-S4-S8. Epub 2015 Feb 23.

A Comparative Assessment of Predictive Accuracies of Conventional and Machine Learning Scoring Functions for Protein-Ligand Binding Affinity Prediction.传统评分函数与机器学习评分函数在蛋白质-配体结合亲和力预测中的预测准确性比较评估

IEEE/ACM Trans Comput Biol Bioinform. 2015 Mar-Apr;12(2):335-47. doi: 10.1109/TCBB.2014.2351824.

Machine-learning scoring functions for identifying native poses of ligands docked to known and novel proteins.用于识别对接至已知和新型蛋白质的配体天然构象的机器学习评分函数。

BMC Bioinformatics. 2015;16 Suppl 6(Suppl 6):S3. doi: 10.1186/1471-2105-16-S6-S3. Epub 2015 Apr 17.

A comparative assessment of ranking accuracies of conventional and machine-learning-based scoring functions for protein-ligand binding affinity prediction.常规与基于机器学习打分函数对蛋白质-配体结合亲和力预测的排序准确性比较评估。

IEEE/ACM Trans Comput Biol Bioinform. 2012 Sep-Oct;9(5):1301-13. doi: 10.1109/TCBB.2012.36.

Machine learning in computational docking.计算对接中的机器学习。

Artif Intell Med. 2015 Mar;63(3):135-52. doi: 10.1016/j.artmed.2015.02.002. Epub 2015 Feb 16.

Rescoring of docking poses under Occam's Razor: are there simpler solutions?奥卡姆剃刀下对接构象的重评分：是否存在更简单的解决方案？

J Comput Aided Mol Des. 2018 Sep;32(9):877-888. doi: 10.1007/s10822-018-0155-5. Epub 2018 Sep 1.

Beware of machine learning-based scoring functions-on the danger of developing black boxes.警惕基于机器学习的评分函数——开发黑盒的危险。

J Chem Inf Model. 2014 Oct 27;54(10):2807-15. doi: 10.1021/ci500406k. Epub 2014 Sep 24.

SCORCH: Improving structure-based virtual screening with machine learning classifiers, data augmentation, and uncertainty estimation.SCORCH：利用机器学习分类器、数据增强和不确定性估计改进基于结构的虚拟筛选。

J Adv Res. 2023 Apr;46:135-147. doi: 10.1016/j.jare.2022.07.001. Epub 2022 Jul 25.

引用本文的文献

Normalized Protein-Ligand Distance Likelihood Score for End-to-End Blind Docking and Virtual Screening.用于端到端盲对接和虚拟筛选的归一化蛋白质-配体距离似然得分

J Chem Inf Model. 2025 Feb 10;65(3):1101-1114. doi: 10.1021/acs.jcim.4c01014. Epub 2025 Jan 17.

RankMHC: Learning to Rank Class-I Peptide-MHC Structural Models.RankMHC：学习对I类肽-主要组织相容性复合体结构模型进行排序。

J Chem Inf Model. 2024 Dec 9;64(23):8729-8742. doi: 10.1021/acs.jcim.4c01278. Epub 2024 Nov 18.

A comprehensive review of artificial intelligence for pharmacology research.药理学研究中人工智能的全面综述。

Front Genet. 2024 Sep 3;15:1450529. doi: 10.3389/fgene.2024.1450529. eCollection 2024.

PLAS-20k: Extended Dataset of Protein-Ligand Affinities from MD Simulations for Machine Learning Applications.PLAS-20k：用于机器学习应用的 MD 模拟中蛋白质-配体亲和力的扩展数据集。

Sci Data. 2024 Feb 9;11(1):180. doi: 10.1038/s41597-023-02872-y.

Inactive-enriched machine-learning models exploiting patent data improve structure-based virtual screening for PDL1 dimerizers.利用专利数据的非活性增强型机器学习模型改进了基于结构的PDL1二聚体虚拟筛选。

J Adv Res. 2025 Jan;67:185-196. doi: 10.1016/j.jare.2024.01.024. Epub 2024 Jan 26.

Integrated Molecular Modeling and Machine Learning for Drug Design.基于分子模拟的药物设计与机器学习的整合。

J Chem Theory Comput. 2023 Nov 14;19(21):7478-7495. doi: 10.1021/acs.jctc.3c00814. Epub 2023 Oct 26.

A practical guide to machine-learning scoring for structure-based virtual screening.基于结构的虚拟筛选的机器学习评分实用指南。

Nat Protoc. 2023 Nov;18(11):3460-3511. doi: 10.1038/s41596-023-00885-w. Epub 2023 Oct 16.

Beware of Simple Methods for Structure-Based Virtual Screening: The Critical Importance of Broader Comparisons.警惕基于结构的虚拟筛选的简单方法：更广泛比较的至关重要性。

J Chem Inf Model. 2023 Mar 13;63(5):1401-1405. doi: 10.1021/acs.jcim.3c00218. Epub 2023 Feb 27.

Scoring Functions for Protein-Ligand Binding Affinity Prediction using Structure-Based Deep Learning: A Review.基于结构的深度学习预测蛋白质-配体结合亲和力的评分函数综述

Front Bioinform. 2022 Jun 17;2. doi: 10.3389/fbinf.2022.885983.

A reinforcement learning approach for protein-ligand binding pose prediction.一种用于蛋白质-配体结合构象预测的强化学习方法。

BMC Bioinformatics. 2022 Sep 8;23(1):368. doi: 10.1186/s12859-022-04912-7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于预测配体结合构象和亲和力以及进行筛选富集的任务特定评分函数。

Task-Specific Scoring Functions for Predicting Ligand Binding Poses and Affinity and for Screening Enrichment.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献