Kleywegt G J
Biomedical Centre, Uppsala University, Uppsala, SE-751 24, Sweden.
J Mol Biol. 1997 Oct 24;273(2):371-6. doi: 10.1006/jmbi.1997.1309.
The Protein Data Bank contains a number of structures for which only the coordinates of the Calpha atoms have been deposited. Although many tools are available for the validation of all-atom protein models, hardly any of these can be used to assess the quality of models for which only Calpha coordinates are available. Two rapid and simple tests to assess the quality of the Calpha backbone of a protein model are described, one based on the distribution of Calpha-Calpha distances, and the other on the two-dimensional distribution of the angles and dihedrals formed by sequential Calpha atoms. Expected distributions were derived by analysing a set of 1343 high-resolution, all-atom protein models. The distance criterion is useful to discriminate between refined and unrefined models, whereas the angle/dihedral criterion can be used to discriminate between normal and possibly problematic Calpha models. The method has been applied to a set of 88 Calpha-only models from the Protein Data Bank. The tracing of two of the models that are outliers in this analysis has recently been shown to be incorrect. Other applications of the method are discussed.
蛋白质数据库包含许多仅沉积了α碳原子坐标的结构。尽管有许多工具可用于验证全原子蛋白质模型,但几乎没有任何一个工具可用于评估仅提供α碳坐标的模型的质量。本文描述了两种快速且简单的测试方法,用于评估蛋白质模型的α碳骨架质量,一种基于α碳-α碳距离的分布,另一种基于连续α碳原子形成的角度和二面角的二维分布。通过分析一组1343个高分辨率全原子蛋白质模型得出了预期分布。距离标准有助于区分精制模型和未精制模型,而角度/二面角标准可用于区分正常的和可能有问题的α碳模型。该方法已应用于蛋白质数据库中的一组88个仅含α碳的模型。最近已证明在该分析中作为异常值的两个模型的追踪是不正确的。还讨论了该方法的其他应用。