Misiura Mikita, Shroff Raghav, Thyer Ross, Kolomeisky Anatoly B
Department of Chemistry, Center for Theoretical Biological Physics, Rice University, Houston, Texas, USA.
CCDC Army Research Lab, Austin, Texas, USA.
Proteins. 2022 Jun;90(6):1278-1290. doi: 10.1002/prot.26311. Epub 2022 Feb 22.
Prediction of side chain conformations of amino acids in proteins (also termed "packing") is an important and challenging part of protein structure prediction with many interesting applications in protein design. A variety of methods for packing have been developed but more accurate ones are still needed. Machine learning (ML) methods have recently become a powerful tool for solving various problems in diverse areas of science, including structural biology. In this study, we evaluate the potential of deep neural networks (DNNs) for prediction of amino acid side chain conformations. We formulate the problem as image-to-image transformation and train a U-net style DNN to solve the problem. We show that our method outperforms other physics-based methods by a significant margin: reconstruction RMSDs for most amino acids are about 20% smaller compared to SCWRL4 and Rosetta Packer with RMSDs for bulky hydrophobic amino acids Phe, Tyr, and Trp being up to 50% smaller.
预测蛋白质中氨基酸的侧链构象(也称为“堆积”)是蛋白质结构预测中一个重要且具有挑战性的部分,在蛋白质设计中有许多有趣的应用。已经开发了多种堆积方法,但仍需要更精确的方法。机器学习(ML)方法最近已成为解决包括结构生物学在内的不同科学领域中各种问题的强大工具。在本研究中,我们评估了深度神经网络(DNN)在预测氨基酸侧链构象方面的潜力。我们将该问题表述为图像到图像的转换,并训练一个U-net风格的DNN来解决该问题。我们表明,我们的方法比其他基于物理的方法有显著优势:与SCWRL4和Rosetta Packer相比,大多数氨基酸的重建均方根偏差(RMSD)小约20%,对于大体积疏水氨基酸苯丙氨酸(Phe)、酪氨酸(Tyr)和色氨酸(Trp),RMSD小高达50%。