Tickle I J, Laskowski R A, Moss D S
Department of Crystallography, Birkbeck College, Malet Street, London WC1E 7HX, England.
Acta Crystallogr D Biol Crystallogr. 1998 Jul 1;54(Pt 4):547-57. doi: 10.1107/s0907444997013875.
The last five years have seen a large increase in the use of cross validation in the refinement of macromolecular structures using X-ray data. In this technique a test set of reflections is set aside from the working set and the progress of the refinement is monitored by the calculation of a free R factor which is based only on the excluded reflections. This paper gives estimates for the ratio of the free R factor to the R factor calculated from the working set for both unrestrained and restrained refinement. It is assumed that both the X-ray and restraint observations have been weighted correctly and that there is no correlation of errors between the test and working sets. It is also shown that the least-squares weights that minimize the variances of the refined parameters, also approximately minimize the free R factor. The estimated free R-factor ratios are compared with those reported for structures in the Protein Data Bank.
在过去五年中,在利用X射线数据优化大分子结构时,交叉验证的使用大幅增加。在这项技术中,从工作集中留出一组测试反射,并通过计算仅基于排除反射的自由R因子来监测优化的进展。本文给出了无约束和有约束优化时自由R因子与从工作集计算得到的R因子之比的估计值。假设X射线观测值和约束观测值都已正确加权,并且测试集和工作集之间不存在误差相关性。还表明,使精修参数方差最小化的最小二乘权重,也近似使自由R因子最小化。将估计的自由R因子比率与蛋白质数据库中报道的结构的比率进行了比较。