Ma Jianpeng
Department of Biochemistry and Molecular Biology, Baylor College of Medicine, One Baylor Plaza, Houston, Texas 77030, USA.
Acc Chem Res. 2009 Aug 18;42(8):1087-96. doi: 10.1021/ar900009e.
Protein structure modeling and prediction have important applications throughout the biological sciences, from the design of pharmaceuticals to the elucidation of enzyme mechanisms. At the core of most protein modeling is an energy function, the minimum of which represents the free energy "cost" for forming a correct protein structure. The most commonly used energy functions are knowledge-based statistical potential functions; that is, they are empirically derived from statistical analysis of a set of high-resolution protein structures. When that kind of potential function is constructed, the anisotropic orientation dependence between the interacting groups is a critical component for accurately representing key molecular interactions, such as those involved in protein side-chain packing. In the literature, however, many potential functions are limited in their ability to describe orientation dependence. In all-atom potentials, they typically ignore heterogeneous chemical-bond connectivity. In coarse-grained potentials, such as (semi)-residue-based potentials, the simplified representation of residues often reduces the sensitivity of the potential to side-chain orientation. Recently, in an effort to maximally capture the orientation dependence in side-chain interactions, a new type of all-atom statistical potential was developed: OPUS-PSP (potential derived from side-chain packing). The key feature of this potential is its explicit description of orientation dependence in molecular interactions, which is achieved with a basis set of 19 rigid-body blocks extracted from the chemical structures of 20 amino acid residues. This basis set is specifically designed to maximally capture the essential elements of orientation dependence in molecular packing interactions. The potential is constructed from the orientation-specific packing statistics of pairs of those blocks in a nonredundant structural database. On decoy set tests, OPUS-PSP significantly outperforms most of the existing knowledge-based potentials in terms of both its ability to recognize native structures and its consistency in achieving high Z scores across decoy sets. The application of OPUS-PSP to conformational modeling of side chains has led to another method, called OPUS-Rota. In terms of combined speed and accuracy, OPUS-Rota outperforms all of the other methods in modeling side-chain conformation. In this Account, we briefly outline the basic scheme of the OPUS-PSP potential and its application to side-chain modeling via OPUS-Rota. Future perspectives on the modeling of orientation dependence are also discussed. The computer programs for OPUS-PSP and OPUS-Rota can be downloaded at http://sigler.bioch.bcm.tmc.edu/MaLab . They are free for academic users.
蛋白质结构建模与预测在整个生物科学领域都有重要应用,从药物设计到酶机制的阐明。大多数蛋白质建模的核心是一个能量函数,其最小值代表形成正确蛋白质结构的自由能“成本”。最常用的能量函数是基于知识的统计势函数;也就是说,它们是从一组高分辨率蛋白质结构的统计分析中经验性推导出来的。构建这类势函数时,相互作用基团之间的各向异性取向依赖性是准确表示关键分子相互作用(如蛋白质侧链堆积中涉及的相互作用)的关键组成部分。然而,在文献中,许多势函数描述取向依赖性的能力有限。在全原子势中,它们通常忽略异构化学键的连接性。在粗粒度势中,如(半)基于残基的势,残基的简化表示通常会降低势对侧链取向的敏感性。最近,为了最大程度地捕捉侧链相互作用中的取向依赖性,开发了一种新型的全原子统计势:OPUS - PSP(源自侧链堆积的势)。这种势的关键特征是其对分子相互作用中取向依赖性的明确描述,这是通过从20种氨基酸残基的化学结构中提取的19个刚体模块的基组来实现的。这个基组经过专门设计,以最大程度地捕捉分子堆积相互作用中取向依赖性的基本要素。该势是根据这些模块对在非冗余结构数据库中的特定取向堆积统计构建的。在诱饵集测试中,就识别天然结构的能力以及在各个诱饵集上获得高Z分数的一致性而言,OPUS - PSP显著优于大多数现有的基于知识的势。OPUS - PSP在侧链构象建模中的应用产生了另一种方法,称为OPUS - Rota。在速度和准确性方面,OPUS - Rota在侧链构象建模中优于所有其他方法。在本综述中,我们简要概述了OPUS - PSP势的基本方案及其通过OPUS - Rota在侧链建模中的应用。还讨论了取向依赖性建模的未来展望。OPUS - PSP和OPUS - Rota的计算机程序可从http://sigler.bioch.bcm.tmc.edu/MaLab下载。它们对学术用户免费。