Hangzhou Institute of Innovative Medicine, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, Zhejiang, P. R. China.
State Key Laboratory of Quality Research in Chinese Medicine, Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macau, SAR, China.
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbaa410.
Machine-learning (ML)-based scoring functions (MLSFs) have gradually emerged as a promising alternative for protein-ligand binding affinity prediction and structure-based virtual screening. However, clouds of doubts have still been raised against the benefits of this novel type of scoring functions (SFs). In this study, to benchmark the performance of target-specific MLSFs on a relatively unbiased dataset, the MLSFs trained from three representative protein-ligand interaction representations were assessed on the LIT-PCBA dataset, and the classical Glide SP SF and three types of ligand-based quantitative structure-activity relationship (QSAR) models were also utilized for comparison. Two major aspects in virtual screening campaigns, including prediction accuracy and hit novelty, were systematically explored. The calculation results illustrate that the tested target-specific MLSFs yielded generally superior performance over the classical Glide SP SF, but they could hardly outperform the 2D fingerprint-based QSAR models. Although substantial improvements could be achieved by integrating multiple types of protein-ligand interaction features, the MLSFs were still not sufficient to exceed MACCS-based QSAR models. In terms of the correlations between the hit ranks or the structures of the top-ranked hits, the MLSFs developed by different featurization strategies would have the ability to identify quite different hits. Nevertheless, it seems that target-specific MLSFs do not have the intrinsic attributes of a traditional SF and may not be a substitute for classical SFs. In contrast, MLSFs can be regarded as a new derivative of ligand-based QSAR models. It is expected that our study may provide valuable guidance for the assessment and further development of target-specific MLSFs.
基于机器学习 (ML) 的打分函数 (MLSFs) 逐渐成为预测蛋白质-配体结合亲和力和基于结构的虚拟筛选的有前途的替代方法。然而,对于这种新型打分函数 (SFs) 的益处,仍然存在许多疑虑。在这项研究中,为了在相对无偏数据集上基准化针对特定目标的 MLSFs 的性能,评估了基于三种代表性蛋白质-配体相互作用表示的 MLSFs 在 LIT-PCBA 数据集上的性能,同时还利用了经典的 Glide SP SF 和三种类型的基于配体的定量构效关系 (QSAR) 模型进行比较。系统地探讨了虚拟筛选活动中的两个主要方面,包括预测准确性和命中新颖性。计算结果表明,测试的针对特定目标的 MLSFs 通常优于经典的 Glide SP SF,但很难超过基于 2D 指纹的 QSAR 模型。虽然通过整合多种类型的蛋白质-配体相互作用特征可以取得实质性的改进,但 MLSFs 仍然不足以超过基于 MACCS 的 QSAR 模型。就命中排名或排名靠前的命中结构之间的相关性而言,采用不同特征化策略开发的 MLSFs 有能力识别出截然不同的命中。然而,针对特定目标的 MLSFs 似乎没有传统 SF 的内在属性,并且可能不能替代经典 SF。相比之下,MLSFs 可以被视为基于配体的 QSAR 模型的一种新衍生。预计我们的研究可为针对特定目标的 MLSFs 的评估和进一步发展提供有价值的指导。