a Department of Chemistry , Vanderbilt University , Nashville , TN , USA.
b Center for Structural Biology , Vanderbilt University , Nashville , TN , USA.
Crit Rev Biochem Mol Biol. 2018 Feb;53(1):1-28. doi: 10.1080/10409238.2017.1380596. Epub 2017 Oct 4.
Prediction of protein tertiary structures from amino acid sequence and understanding the mechanisms of how proteins fold, collectively known as "the protein folding problem," has been a grand challenge in molecular biology for over half a century. Theories have been developed that provide us with an unprecedented understanding of protein folding mechanisms. However, computational simulation of protein folding is still difficult, and prediction of protein tertiary structure from amino acid sequence is an unsolved problem. Progress toward a satisfying solution has been slow due to challenges in sampling the vast conformational space and deriving sufficiently accurate energy functions. Nevertheless, several techniques and algorithms have been adopted to overcome these challenges, and the last two decades have seen exciting advances in enhanced sampling algorithms, computational power and tertiary structure prediction methodologies. This review aims at summarizing these computational techniques, specifically conformational sampling algorithms and energy approximations that have been frequently used to study protein-folding mechanisms or to de novo predict protein tertiary structures. We hope that this review can serve as an overview on how the protein-folding problem can be studied computationally and, in cases where experimental approaches are prohibitive, help the researcher choose the most relevant computational approach for the problem at hand. We conclude with a summary of current challenges faced and an outlook on potential future directions.
从氨基酸序列预测蛋白质的三级结构并理解蛋白质折叠的机制,这一被称为“蛋白质折叠问题”的难题,在半个多世纪以来一直是分子生物学领域的重大挑战。已经有一些理论为我们提供了对蛋白质折叠机制的前所未有的理解。然而,蛋白质折叠的计算模拟仍然很困难,从氨基酸序列预测蛋白质的三级结构仍然是一个尚未解决的问题。由于在采样广阔构象空间和推导足够准确的能量函数方面存在挑战,因此朝着令人满意的解决方案取得进展一直很缓慢。尽管如此,已经采用了几种技术和算法来克服这些挑战,在过去的二十年中,增强采样算法、计算能力和三级结构预测方法方面取得了令人兴奋的进展。这篇综述旨在总结这些计算技术,特别是构象采样算法和能量近似,这些技术和算法经常被用于研究蛋白质折叠机制或从头预测蛋白质的三级结构。我们希望这篇综述可以作为一个概述,说明如何通过计算的方法来研究蛋白质折叠问题,并且在实验方法不可行的情况下,帮助研究人员为手头的问题选择最相关的计算方法。我们以总结当前面临的挑战和展望潜在的未来方向作为结束。