Department of Computational and Systems Biology, University of Pittsburgh, 3501 Fifth Ave, Pittsburgh, PA, 15260, United States.
Department of Computational and Systems Biology, University of Pittsburgh, 3501 Fifth Ave, Pittsburgh, PA, 15260, United States.
J Mol Graph Model. 2018 Sep;84:96-108. doi: 10.1016/j.jmgm.2018.06.005. Epub 2018 Jun 18.
Protein-ligand scoring is an important step in a structure-based drug design pipeline. Selecting a correct binding pose and predicting the binding affinity of a protein-ligand complex enables effective virtual screening. Machine learning techniques can make use of the increasing amounts of structural data that are becoming publicly available. Convolutional neural network (CNN) scoring functions in particular have shown promise in pose selection and affinity prediction for protein-ligand complexes. Neural networks are known for being difficult to interpret. Understanding the decisions of a particular network can help tune parameters and training data to maximize performance. Visualization of neural networks helps decompose complex scoring functions into pictures that are more easily parsed by humans. Here we present three methods for visualizing how individual protein-ligand complexes are interpreted by 3D convolutional neural networks. We also present a visualization of the convolutional filters and their weights. We describe how the intuition provided by these visualizations aids in network design.
蛋白质-配体评分是基于结构的药物设计流程中的重要步骤。选择正确的结合构象并预测蛋白质-配体复合物的结合亲和力,可以实现有效的虚拟筛选。机器学习技术可以利用越来越多的公开结构数据。特别是卷积神经网络 (CNN) 评分函数在蛋白质-配体复合物的构象选择和亲和力预测方面表现出了很大的潜力。神经网络的可解释性一直是一个难题。理解特定网络的决策有助于调整参数和训练数据,以实现最佳性能。神经网络的可视化可以将复杂的评分函数分解为更易于人类解析的图片。在这里,我们提出了三种方法来可视化 3D 卷积神经网络如何解释单个蛋白质-配体复合物。我们还展示了卷积滤波器及其权重的可视化。我们描述了这些可视化所提供的直观信息如何辅助网络设计。