• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于提高虚拟筛选富集度的共识评分标准。

Consensus scoring criteria for improving enrichment in virtual screening.

作者信息

Yang Jinn-Moon, Chen Yen-Fu, Shen Tsai-Wei, Kristal Bruce S, Hsu D Frank

机构信息

Department of Biological Science and Technology, National Chiao Tung University, Hsinchu 30050, Taiwan.

出版信息

J Chem Inf Model. 2005 Jul-Aug;45(4):1134-46. doi: 10.1021/ci050034w.

DOI:10.1021/ci050034w
PMID:16045308
Abstract

MOTIVATION

Virtual screening of molecular compound libraries is a potentially powerful and inexpensive method for the discovery of novel lead compounds for drug development. The major weakness of virtual screening-the inability to consistently identify true positives (leads)-is likely due to our incomplete understanding of the chemistry involved in ligand binding and the subsequently imprecise scoring algorithms. It has been demonstrated that combining multiple scoring functions (consensus scoring) improves the enrichment of true positives. Previous efforts at consensus scoring have largely focused on empirical results, but they have yet to provide a theoretical analysis that gives insight into real features of combinations and data fusion for virtual screening.

RESULTS

We demonstrate that combining multiple scoring functions improves the enrichment of true positives only if (a) each of the individual scoring functions has relatively high performance and (b) the individual scoring functions are distinctive. Notably, these two prediction variables are previously established criteria for the performance of data fusion approaches using either rank or score combinations. This work, thus, establishes a potential theoretical basis for the probable success of data fusion approaches to improve yields in in silico screening experiments. Furthermore, it is similarly established that the second criterion (b) can, in at least some cases, be functionally defined as the area between the rank versus score plots generated by the two (or more) algorithms. Because rank-score plots are independent of the performance of the individual scoring function, this establishes a second theoretically defined approach to determining the likely success of combining data from different predictive algorithms. This approach is, thus, useful in practical settings in the virtual screening process when the performance of at least two individual scoring functions (such as in criterion a) can be estimated as having a high likelihood of having high performance, even if no training sets are available. We provide initial validation of this theoretical approach using data from five scoring systems with two evolutionary docking algorithms on four targets, thymidine kinase, human dihydrofolate reductase, and estrogen receptors of antagonists and agonists. Our procedure is computationally efficient, able to adapt to different situations, and scalable to a large number of compounds as well as to a greater number of combinations. Results of the experiment show a fairly significant improvement (vs single algorithms) in several measures of scoring quality, specifically "goodness-of-hit" scores, false positive rates, and "enrichment". This approach (available online at http://gemdock.life. nctu.edu.tw/dock/download.php) has practical utility for cases where the basic tools are known or believed to be generally applicable, but where specific training sets are absent.

摘要

动机

对分子化合物库进行虚拟筛选是一种潜在的强大且经济的方法,可用于发现药物开发的新型先导化合物。虚拟筛选的主要弱点——无法始终如一地识别真正的阳性结果(先导物)——可能是由于我们对配体结合所涉及的化学过程理解不完整,以及随后评分算法不够精确。已经证明,组合多个评分函数(共识评分)可提高真正阳性结果的富集度。以往在共识评分方面的努力主要集中在实证结果上,但尚未提供理论分析,以深入了解虚拟筛选中组合和数据融合的实际特征。

结果

我们证明,只有在以下情况下,组合多个评分函数才会提高真正阳性结果的富集度:(a)每个单独的评分函数都具有相对较高的性能;(b)各个评分函数具有独特性。值得注意的是,这两个预测变量是先前为使用排名或分数组合的数据融合方法的性能确立的标准。因此,这项工作为数据融合方法在提高计算机筛选实验产量方面可能取得成功建立了潜在的理论基础。此外,同样可以确定,在至少某些情况下,第二个标准(b)可以在功能上定义为两种(或更多)算法生成的排名与分数图之间的面积。由于排名 - 分数图与各个评分函数的性能无关,这确立了第二种理论定义的方法,用于确定组合来自不同预测算法的数据可能取得的成功。因此,当至少两个单独评分函数的性能(如标准a中所述)被估计很可能具有高性能时,即使没有可用的训练集,这种方法在虚拟筛选过程的实际应用中也很有用。我们使用来自五个评分系统的数据以及针对四个靶点(胸苷激酶、人二氢叶酸还原酶以及拮抗剂和激动剂的雌激素受体)的两种进化对接算法,对这种理论方法进行了初步验证。我们的程序计算效率高,能够适应不同情况,并且可扩展到大量化合物以及更多的组合。实验结果表明,在评分质量的几个指标上,特别是“命中优度”分数、假阳性率和“富集度”方面,与单一算法相比有相当显著的提高。这种方法(可在http://gemdock.life.nctu.edu.tw/dock/download.php在线获取)对于基本工具已知或被认为普遍适用但缺乏特定训练集的情况具有实际应用价值。

相似文献

1
Consensus scoring criteria for improving enrichment in virtual screening.用于提高虚拟筛选富集度的共识评分标准。
J Chem Inf Model. 2005 Jul-Aug;45(4):1134-46. doi: 10.1021/ci050034w.
2
SeleX-CS: a new consensus scoring algorithm for hit discovery and lead optimization.SeleX-CS:一种用于命中发现和先导优化的新型共识评分算法。
J Chem Inf Model. 2009 Mar;49(3):623-33. doi: 10.1021/ci800335j.
3
Evaluation of library ranking efficacy in virtual screening.虚拟筛选中库排名效能的评估。
J Comput Chem. 2005 Jan 15;26(1):11-22. doi: 10.1002/jcc.20141.
4
Considerations in compound database preparation--"hidden" impact on virtual screening results.化合物数据库制备中的注意事项——对虚拟筛选结果的“隐藏”影响。
J Chem Inf Model. 2005 Nov-Dec;45(6):1908-19. doi: 10.1021/ci050185z.
5
Protein flexibility in ligand docking and virtual screening to protein kinases.用于蛋白激酶的配体对接和虚拟筛选中的蛋白质柔性
J Mol Biol. 2004 Mar 12;337(1):209-25. doi: 10.1016/j.jmb.2004.01.003.
6
Binding energy landscape analysis helps to discriminate true hits from high-scoring decoys in virtual screening.结合能景观分析有助于在虚拟筛选中区分真正的命中物和高得分的伪靶标。
J Chem Inf Model. 2010 Oct 25;50(10):1855-64. doi: 10.1021/ci900463u.
7
Maximum common binding modes (MCBM): consensus docking scoring using multiple ligand information and interaction fingerprints.最大公共结合模式(MCBM):使用多种配体信息和相互作用指纹的一致性对接评分
J Chem Inf Model. 2008 Feb;48(2):319-32. doi: 10.1021/ci7003626. Epub 2008 Jan 23.
8
A pharmacophore-based evolutionary approach for screening selective estrogen receptor modulators.一种基于药效团的筛选选择性雌激素受体调节剂的进化方法。
Proteins. 2005 May 1;59(2):205-20. doi: 10.1002/prot.20387.
9
Comparative assessment of scoring functions on a diverse test set.在多样化测试集上对评分函数的比较评估。
J Chem Inf Model. 2009 Apr;49(4):1079-93. doi: 10.1021/ci9000053.
10
Retrospective docking study of PDE4B ligands and an analysis of the behavior of selected scoring functions.磷酸二酯酶4B(PDE4B)配体的回顾性对接研究及所选评分函数行为分析
J Chem Inf Model. 2005 Jul-Aug;45(4):1061-74. doi: 10.1021/ci050044x.

引用本文的文献

1
Blocking XIAP:CASP7-p19 selectively induces apoptosis of CASP3/DR malignancies by a novel reversible small molecule.阻断XIAP:CASP7-p19可通过一种新型可逆小分子选择性诱导CASP3/DR恶性肿瘤细胞凋亡。
Cell Death Dis. 2025 Jun 18;16(1):459. doi: 10.1038/s41419-025-07774-y.
2
Consensus holistic virtual screening for drug discovery: a novel machine learning model approach.用于药物发现的共识整体虚拟筛选:一种新型机器学习模型方法。
J Cheminform. 2024 May 28;16(1):62. doi: 10.1186/s13321-024-00855-8.
3
Exploration of a Large Virtual Chemical Space: Identification of Potent Inhibitors of Lactate Dehydrogenase-A against Pancreatic Cancer.
探索大型虚拟化学空间:鉴定针对胰腺癌的乳酸脱氢酶 A 的有效抑制剂。
J Chem Inf Model. 2023 Feb 13;63(3):1028-1043. doi: 10.1021/acs.jcim.2c01544. Epub 2023 Jan 16.
4
ES-Screen: A Novel Electrostatics-Driven Method for Drug Discovery Virtual Screening.ES-Screen:一种用于药物发现虚拟筛选的新型静电驱动方法。
Int J Mol Sci. 2022 Nov 27;23(23):14830. doi: 10.3390/ijms232314830.
5
Predicting drug toxicity at the intersection of informatics and biology: DTox builds a foundation.在信息学与生物学交叉领域预测药物毒性:DTox奠定了基础。
Patterns (N Y). 2022 Sep 9;3(9):100586. doi: 10.1016/j.patter.2022.100586.
6
Consensus scoring evaluated using the GPCR-Bench dataset: Reconsidering the role of MM/GBSA.基于 GPCR-Bench 数据集的共识评分评估:重新考虑 MM/GBSA 的作用。
J Comput Aided Mol Des. 2022 Jun;36(6):427-441. doi: 10.1007/s10822-022-00456-3. Epub 2022 May 18.
7
Ranks underlie outcome of combining classifiers: Quantitative roles for and .分类器组合的结果基于排序: 和 的定量作用。
Patterns (N Y). 2021 Dec 22;3(2):100415. doi: 10.1016/j.patter.2021.100415. eCollection 2022 Feb 11.
8
Improving SDG Classification Precision Using Combinatorial Fusion.利用组合融合提高可持续发展目标分类精度。
Sensors (Basel). 2022 Jan 29;22(3):1067. doi: 10.3390/s22031067.
9
Identifying potential inhibitors of biofilm-antagonistic proteins to promote biofilm formation: a virtual screening and molecular dynamics simulations approach.鉴定生物膜拮抗蛋白的潜在抑制剂以促进生物膜形成:虚拟筛选和分子动力学模拟方法。
Mol Divers. 2022 Aug;26(4):2135-2147. doi: 10.1007/s11030-021-10320-5. Epub 2021 Sep 21.
10
Molecular insights on ABL kinase activation using tree-based machine learning models and molecular docking.基于树的机器学习模型和分子对接的 ABL 激酶激活的分子见解。
Mol Divers. 2021 Aug;25(3):1301-1314. doi: 10.1007/s11030-021-10261-z. Epub 2021 Jun 30.