Liu Tianyun, Tang Grace W, Capriotti Emidio
Department of Bioengineering, Stanford University, 318 Campus Dr, Room S240 Mail code: 5448, Stanford, CA 94305, USA.
Comb Chem High Throughput Screen. 2011 Jul;14(6):532-47. doi: 10.2174/138620711795767811.
The goal of computational protein structure prediction is to provide three-dimensional (3D) structures with resolution comparable to experimental results. Comparative modeling, which predicts the 3D structure of a protein based on its sequence similarity to homologous structures, is the most accurate computational method for structure prediction. In the last two decades, significant progress has been made on comparative modeling methods. Using the large number of protein structures deposited in the Protein Data Bank (65,000), automatic prediction pipelines are generating a tremendous number of models (1.9 million) for sequences whose structures have not been experimentally determined. Accurate models are suitable for a wide range of applications, such as prediction of protein binding sites, prediction of the effect of protein mutations, and structure-guided virtual screening. In particular, comparative modeling has enabled structure-based drug design against protein targets with unknown structures. In this review, we describe the theoretical basis of comparative modeling, the available automatic methods and databases, and the algorithms to evaluate the accuracy of predicted structures. Finally, we discuss relevant applications in the prediction of important drug target proteins, focusing on the G protein-coupled receptor (GPCR) and protein kinase families.
计算蛋白质结构预测的目标是提供分辨率与实验结果相当的三维(3D)结构。比较建模是基于蛋白质与同源结构的序列相似性来预测其3D结构,是结构预测中最准确的计算方法。在过去二十年中,比较建模方法取得了显著进展。利用蛋白质数据库中大量的蛋白质结构(约65,000个),自动预测流程正在为那些结构尚未通过实验确定的序列生成大量模型(约190万个)。准确的模型适用于广泛的应用,如蛋白质结合位点预测、蛋白质突变效应预测以及基于结构的虚拟筛选。特别是,比较建模使得针对结构未知的蛋白质靶点进行基于结构的药物设计成为可能。在本综述中,我们描述了比较建模的理论基础、可用的自动方法和数据库,以及评估预测结构准确性的算法。最后,我们讨论在重要药物靶点蛋白质预测中的相关应用,重点关注G蛋白偶联受体(GPCR)和蛋白激酶家族。