Brandon Christopher J, Martin Benjamin P, McGee Kelly J, Stewart James J P, Braun-Sand Sonja B
Department of Chemistry and Biochemistry, University of Colorado, Colorado Springs, Colorado Springs, CO, 80918, USA,
J Mol Model. 2015 Jan;21(1):3. doi: 10.1007/s00894-014-2520-1. Epub 2015 Jan 22.
An accurate model of three-dimensional protein structure is important in a variety of fields such as structure-based drug design and mechanistic studies of enzymatic reactions. While the entries in the Protein Data Bank ( http://www.pdb.org ) provide valuable information about protein structures, a small fraction of the PDB structures were found to contain anomalies not reported in the PDB file. The semiempirical PM7 method in MOPAC2012 was used for identifying anomalously short hydrogen bonds, C-H⋯O/C-H⋯N interactions, non-bonding close contacts, and unrealistic covalent bond lengths in recently published Protein Data Bank files. It was also used to generate new structures with these faults removed. When the semiempirical models were compared to those of PDB_REDO (http://www.cmbi.ru.nl/pdb_redo/), the clashscores, as defined by MolProbity ( http://molprobity.biochem.duke.edu/), were better in about 50% of the structures. The semiempirical models also had a lower root-mean-square-deviation value in nearly all cases than those from PDB_REDO, indicative of a better conservation of the tertiary structure. Finally, the semiempirical models were found to have lower clashscores than the initial PDB file in all but one case. Because this approach maintains as much of the original tertiary structure as possible while improving anomalous interactions, it should be useful to theoreticians, experimentalists, and crystallographers investigating the structure and function of proteins.
精确的三维蛋白质结构模型在诸如基于结构的药物设计和酶促反应机理研究等多个领域都非常重要。虽然蛋白质数据库(http://www.pdb.org)中的条目提供了有关蛋白质结构的宝贵信息,但发现一小部分蛋白质数据库结构包含蛋白质数据库文件中未报告的异常情况。MOPAC2012中的半经验PM7方法用于识别最近发布的蛋白质数据库文件中异常短的氢键、C-H⋯O/C-H⋯N相互作用、非键紧密接触以及不切实际的共价键长度。它还用于生成去除这些错误的新结构。当将半经验模型与PDB_REDO(http://www.cmbi.ru.nl/pdb_redo/)的模型进行比较时,由MolProbity(http://molprobity.biochem.duke.edu/)定义的冲突分数在约50%的结构中更好。在几乎所有情况下,半经验模型的均方根偏差值也低于PDB_REDO的模型,这表明三级结构的保守性更好。最后,除了一个案例外,发现半经验模型的冲突分数低于初始蛋白质数据库文件。由于这种方法在改善异常相互作用的同时尽可能多地保留了原始三级结构,因此对于研究蛋白质结构和功能的理论学家、实验学家和晶体学家应该是有用的。