Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China.
Chem Biol Drug Des. 2020 Sep;96(3):973-983. doi: 10.1111/cbdd.13648.
Deep learning-based methods have been extensively developed to improve scoring performance in structure-based drug discovery. Extending multitask deep networks in addressing pharmaceutical problems shows remarkable improvements over single task network. Recently, grid featurization has been introduced to convert protein-ligand complex co-ordinates into fingerprints with the advantage of incorporating inter- and intra-molecular information. The combination of grid featurization with multitask deep networks would hold great potential to boost the scoring performance. We examined the performance of three novel multitask deep networks (standard multitask, bypass, and progressive network) in reproducing the binding affinities of protein-ligand complexes in comparison with AutoDock Vina docking and MM/GBSA method. Among five evaluated methods, progressive network combined with grid featurization provided the best Pearson correlation coefficient (0.74) and least mean absolute average error (0.98) for the overall scoring performance. Moreover, all networks increased screening ability for the re-docking pose and progressive network even achieved AUC of 0.87 over 0.52 of AutoDock Vina. Our results demonstrated that progressive network combined with grid featurization would be one powerful rescoring approach to strengthen screening results after obtaining protein-ligand complex in the conventional docking software.
基于深度学习的方法已经被广泛开发,以提高基于结构的药物发现中的评分性能。在解决药物问题时,扩展多任务深度网络显示出比单任务网络更显著的改进。最近,网格特征化已被引入,将蛋白质-配体复合物的坐标转换为指纹,具有合并分子内和分子间信息的优势。网格特征化与多任务深度网络的结合将具有很大的潜力来提高评分性能。我们研究了三种新型多任务深度网络(标准多任务、旁路和渐进网络)在与 AutoDock Vina 对接和 MM/GBSA 方法相比,重现蛋白质-配体复合物结合亲和力的性能。在评估的五种方法中,渐进网络与网格特征化相结合,在整体评分性能方面提供了最好的 Pearson 相关系数(0.74)和最小平均绝对误差(0.98)。此外,所有网络都提高了再对接构象的筛选能力,而渐进网络甚至在 AutoDock Vina 的 0.52 之上实现了 0.87 的 AUC。我们的结果表明,渐进网络与网格特征化相结合,将是一种强大的重新评分方法,可在传统对接软件中获得蛋白质-配体复合物后,加强筛选结果。