用于蛋白质-配体相互作用建模和排序的统计势能。

Statistical potential for modeling and ranking of protein-ligand interactions.

机构信息

Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, USA.

出版信息

J Chem Inf Model. 2011 Dec 27;51(12):3078-92. doi: 10.1021/ci200377u. Epub 2011 Nov 21.

DOI:10.1021/ci200377u

PMID:22014038

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3246566/

Abstract

Applications in structural biology and medicinal chemistry require protein-ligand scoring functions for two distinct tasks: (i) ranking different poses of a small molecule in a protein binding site and (ii) ranking different small molecules by their complementarity to a protein site. Using probability theory, we developed two atomic distance-dependent statistical scoring functions: PoseScore was optimized for recognizing native binding geometries of ligands from other poses and RankScore was optimized for distinguishing ligands from nonbinding molecules. Both scores are based on a set of 8,885 crystallographic structures of protein-ligand complexes but differ in the values of three key parameters. Factors influencing the accuracy of scoring were investigated, including the maximal atomic distance and non-native ligand geometries used for scoring, as well as the use of protein models instead of crystallographic structures for training and testing the scoring function. For the test set of 19 targets, RankScore improved the ligand enrichment (logAUC) and early enrichment (EF(1)) scores computed by DOCK 3.6 for 13 and 14 targets, respectively. In addition, RankScore performed better at rescoring than each of seven other scoring functions tested. Accepting both the crystal structure and decoy geometries with all-atom root-mean-square errors of up to 2 Å from the crystal structure as correct binding poses, PoseScore gave the best score to a correct binding pose among 100 decoys for 88% of all cases in a benchmark set containing 100 protein-ligand complexes. PoseScore accuracy is comparable to that of DrugScore(CSD) and ITScore/SE and superior to 12 other tested scoring functions. Therefore, RankScore can facilitate ligand discovery, by ranking complexes of the target with different small molecules; PoseScore can be used for protein-ligand complex structure prediction, by ranking different conformations of a given protein-ligand pair. The statistical potentials are available through the Integrative Modeling Platform (IMP) software package (http://salilab.org/imp) and the LigScore Web server (http://salilab.org/ligscore/).

摘要

应用于结构生物学和药物化学的蛋白质配体评分函数需要完成两个不同的任务

（i）对小分子在蛋白质结合部位的不同构象进行排序；（ii）根据小分子与蛋白质部位的互补性对小分子进行排序。我们使用概率论开发了两种原子距离相关的统计评分函数：PoseScore 旨在识别来自其他构象的配体的天然结合构象，而 RankScore 旨在区分配体和非结合分子。这两个分数都是基于 8885 个蛋白质-配体复合物的晶体结构，但在三个关键参数的值上有所不同。还研究了影响评分准确性的因素，包括用于评分的最大原子距离和非天然配体构象，以及使用蛋白质模型代替晶体结构进行评分函数的训练和测试。对于 19 个靶标测试集，RankScore 提高了 DOCK 3.6 计算的 13 个和 14 个靶标配体的富集（logAUC）和早期富集（EF(1)）分数。此外，RankScore 在重新评分方面的表现优于测试的其他七种评分函数中的每一种。在接受晶体结构和带有所有原子 RMSD 误差高达 2Å 的诱饵构象的情况下，PoseScore 在包含 100 个蛋白质-配体复合物的基准集中，在 100 个诱饵构象中，88%的情况下，对正确结合构象的评分优于其他 12 种测试的评分函数。PoseScore 的准确性可与 DrugScore(CSD)和 ITScore/SE 相媲美，优于其他 12 种测试的评分函数。因此，RankScore 可以通过对目标与不同小分子的复合物进行排序来促进配体发现；PoseScore 可以通过对给定蛋白质-配体对的不同构象进行排序来用于蛋白质-配体复合物结构预测。统计势可通过 Integrative Modeling Platform (IMP) 软件包（http://salilab.org/imp）和 LigScore Web 服务器（http://salilab.org/ligscore/）获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b25/3246566/7e9798a0abad/nihms340113f1.jpg

相似文献

Statistical potential for modeling and ranking of protein-ligand interactions.用于蛋白质-配体相互作用建模和排序的统计势能。

J Chem Inf Model. 2011 Dec 27;51(12):3078-92. doi: 10.1021/ci200377u. Epub 2011 Nov 21.

DrugScore(CSD)-knowledge-based scoring function derived from small molecule crystal data with superior recognition rate of near-native ligand poses and better affinity prediction.DrugScore（CSD）——一种基于小分子晶体数据的知识评分函数，对近天然配体构象具有卓越的识别率和更好的亲和力预测能力。

J Med Chem. 2005 Oct 6;48(20):6296-303. doi: 10.1021/jm050436v.

Boosted neural networks scoring functions for accurate ligand docking and ranking.用于精确配体对接和排序的增强神经网络评分函数。

J Bioinform Comput Biol. 2018 Apr;16(2):1850004. doi: 10.1142/S021972001850004X. Epub 2018 Feb 4.

A knowledge-based energy function for protein-ligand, protein-protein, and protein-DNA complexes.一种用于蛋白质-配体、蛋白质-蛋白质和蛋白质-DNA复合物的基于知识的能量函数。

J Med Chem. 2005 Apr 7;48(7):2325-35. doi: 10.1021/jm049314d.

Improving docking results via reranking of ensembles of ligand poses in multiple X-ray protein conformations with MM-GBSA.通过使用 MM-GBSA 对多个 X 射线蛋白质构象中的配体构象进行重新排序，从而提高对接结果。

J Chem Inf Model. 2014 Oct 27;54(10):2697-717. doi: 10.1021/ci5003735. Epub 2014 Sep 30.

Comparative assessment of scoring functions on a diverse test set.在多样化测试集上对评分函数的比较评估。

J Chem Inf Model. 2009 Apr;49(4):1079-93. doi: 10.1021/ci9000053.

Assessing scoring functions for protein-ligand interactions.评估蛋白质-配体相互作用的评分函数。

J Med Chem. 2004 Jun 3;47(12):3032-47. doi: 10.1021/jm030489h.

An evaluation of combined strategies for improving the performance of molecular docking.评价提高分子对接性能的联合策略。

J Bioinform Comput Biol. 2021 Apr;19(2):2150003. doi: 10.1142/S0219720021500037. Epub 2021 Feb 27.

Target-specific native/decoy pose classifier improves the accuracy of ligand ranking in the CSAR 2013 benchmark.靶点特异性天然/诱饵构象分类器提高了CSAR 2013基准测试中配体排名的准确性。

J Chem Inf Model. 2015 Jan 26;55(1):63-71. doi: 10.1021/ci500519w. Epub 2014 Dec 18.

The consequences of scoring docked ligand conformations using free energy correlations.使用自由能相关性对对接配体构象进行评分的后果。

Eur J Med Chem. 2007 Jul;42(7):921-33. doi: 10.1016/j.ejmech.2006.12.037. Epub 2007 Jan 21.

引用本文的文献

Identifying potent inhibitors for Mycobacterium tuberculosis MabA (FabG1).鉴定结核分枝杆菌MabA（FabG1）的有效抑制剂。

Mol Divers. 2025 May 13. doi: 10.1007/s11030-025-11205-7.

In Vitro Evaluation of the Antiviral Activity of Polyphenol (-)-Epigallocatechin-3-Gallate (EGCG) Against Mayaro Virus.表没食子儿茶素-3-没食子酸酯（EGCG）对马亚罗病毒抗病毒活性的体外评价

Viruses. 2025 Feb 14;17(2):258. doi: 10.3390/v17020258.

Normalized Protein-Ligand Distance Likelihood Score for End-to-End Blind Docking and Virtual Screening.用于端到端盲对接和虚拟筛选的归一化蛋白质-配体距离似然得分

J Chem Inf Model. 2025 Feb 10;65(3):1101-1114. doi: 10.1021/acs.jcim.4c01014. Epub 2025 Jan 17.

Detailed Analyses of Molecular Interactions between Favipiravir and RNA Viruses In Silico.计算机模拟分析非那韦与 RNA 病毒的分子相互作用

Viruses. 2022 Feb 7;14(2):338. doi: 10.3390/v14020338.

Target specificity of selective bioactive compounds in blocking α-dystroglycan receptor to suppress Lassa virus infection: an approach.选择性生物活性化合物阻断α- dystroglycan受体以抑制拉沙病毒感染的靶向特异性：一种方法。

J Biomed Res. 2021 Nov 6;35(6):459-473. doi: 10.7555/JBR.35.20210111.

Ligand Strain Energy in Large Library Docking.配体应变能在大型文库对接中的应用。

J Chem Inf Model. 2021 Sep 27;61(9):4331-4341. doi: 10.1021/acs.jcim.1c00368. Epub 2021 Sep 1.

Extension of an Atom-Atom Dispersion Function to Halogen Bonds and Its Use for Rational Design of Drugs and Biocatalysts.原子-原子色散函数向卤键的扩展及其在药物和生物催化剂合理设计中的应用。

J Phys Chem A. 2021 Mar 4;125(8):1787-1799. doi: 10.1021/acs.jpca.0c11347. Epub 2021 Feb 23.

How Far Are We from the Rapid Prediction of Drug Resistance Arising Due to Kinase Mutations?我们距离快速预测激酶突变引起的耐药性还有多远？

ACS Omega. 2021 Jan 4;6(2):1254-1265. doi: 10.1021/acsomega.0c04672. eCollection 2021 Jan 19.

Archiving and disseminating integrative structure models.整合结构模型的归档和传播。

J Biomol NMR. 2019 Jul;73(6-7):385-398. doi: 10.1007/s10858-019-00264-2. Epub 2019 Jul 5.

Integrative structure modeling with the Integrative Modeling Platform.使用整合建模平台进行整合结构建模。

Protein Sci. 2018 Jan;27(1):245-258. doi: 10.1002/pro.3311. Epub 2017 Oct 10.

本文引用的文献

Ligand discovery from a dopamine D3 receptor homology model and crystal structure.从多巴胺 D3 受体同源模型和晶体结构中发现配体。

Nat Chem Biol. 2011 Sep 18;7(11):769-78. doi: 10.1038/nchembio.662.

Structure-based discovery of prescription drugs that interact with the norepinephrine transporter, NET.基于结构的去甲肾上腺素转运体（NET）相互作用的处方药发现。

Proc Natl Acad Sci U S A. 2011 Sep 20;108(38):15810-5. doi: 10.1073/pnas.1106030108. Epub 2011 Sep 1.

Discovery of a cytokinin deaminase.细胞分裂素脱氨酶的发现。

ACS Chem Biol. 2011 Oct 21;6(10):1036-40. doi: 10.1021/cb200198c. Epub 2011 Aug 12.

Enzymatic deamination of the epigenetic base N-6-methyladenine.酶促脱氨作用对表观遗传碱基 N6-甲基腺嘌呤的影响。

J Am Chem Soc. 2011 Feb 23;133(7):2080-3. doi: 10.1021/ja110157u. Epub 2011 Jan 28.

Rapid context-dependent ligand desolvation in molecular docking.快速上下文相关配体去溶剂化在分子对接中的作用。

J Chem Inf Model. 2010 Sep 27;50(9):1561-73. doi: 10.1021/ci100214a.

New statistical potential for quality assessment of protein models and a survey of energy functions.新的蛋白质模型质量评估统计势函数和能量函数综述。

BMC Bioinformatics. 2010 Mar 12;11:128. doi: 10.1186/1471-2105-11-128.

Inclusion of solvation and entropy in the knowledge-based scoring function for protein-ligand interactions.将溶剂化和熵纳入基于知识的蛋白质-配体相互作用评分函数中。

J Chem Inf Model. 2010 Feb 22;50(2):262-73. doi: 10.1021/ci9002987.

Molecular docking screens using comparative models of proteins.利用蛋白质比较模型进行分子对接筛选。

J Chem Inf Model. 2009 Nov;49(11):2512-27. doi: 10.1021/ci9003706.

Automated docking screens: a feasibility study.自动对接筛选：一项可行性研究。

J Med Chem. 2009 Sep 24;52(18):5712-20. doi: 10.1021/jm9006966.

Evaluation of ligand-binding affinity using polynomial empirical scoring functions.使用多项式经验评分函数评估配体结合亲和力。

Bioorg Med Chem. 2008 Oct 15;16(20):9378-82. doi: 10.1016/j.bmc.2008.08.014. Epub 2008 Aug 7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验