Jaskolski Mariusz, Gilski Miroslaw, Dauter Zbigniew, Wlodawer Alexander
Department of Crystallography, Faculty of Chemistry, A. Mickiewicz University and Center for Biocrystallographic Research, Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland.
Acta Crystallogr D Biol Crystallogr. 2007 May;63(Pt 5):611-20. doi: 10.1107/S090744490700978X. Epub 2007 Apr 21.
The Protein Data Bank and Cambridge Structural Database were analyzed with the aim of verifying whether the restraints that are most commonly used for protein structure refinement are still appropriate 15 years after their introduction. From an analysis of selected main-chain parameters in well ordered fragments of ten highest resolution protein structures, it was concluded that some of the currently used geometrical target values should be adjusted somewhat (the C-N bond and the N-C(alpha)-C angle) or applied with less emphasis (peptide planarity). It was also found that the weighting of stereochemical information in medium-resolution refinements is often overemphasized at the cost of the experimental information in the diffraction data. A correctly set balance will be reflected in root-mean-square deviations from ideal bond lengths in the range 0.015-0.020 A for structures refined to R factors of 0.15-0.20. At ultrahigh resolution, however, the diffraction terms should be allowed to dominate, with even higher acceptable deviations from idealized standards in the well defined fragments of the protein. It is postulated that modern refinement programs should accommodate variable restraint weights that are dependent on the occupancies and B factors of the atoms involved.
对蛋白质数据库(Protein Data Bank)和剑桥结构数据库(Cambridge Structural Database)进行了分析,目的是验证蛋白质结构优化中最常用的约束条件在引入15年后是否仍然适用。通过对十个最高分辨率蛋白质结构的有序片段中选定的主链参数进行分析,得出结论:一些当前使用的几何目标值应进行一定程度的调整(C-N键和N-C(α)-C角)或降低其权重(肽平面性)。还发现,在中等分辨率优化中,立体化学信息的权重往往被过度强调,而牺牲了衍射数据中的实验信息。对于精修至R因子为0.15 - 0.20的结构,正确设定的平衡将体现在与理想键长的均方根偏差在0.015 - 0.020 Å范围内。然而,在超高分辨率下,衍射项应占主导地位,在蛋白质明确界定的片段中,与理想化标准的可接受偏差甚至更高。据推测,现代优化程序应适应取决于所涉及原子占有率和B因子的可变约束权重。