Chen Z, Blanc E, Chapman M S
Department of Chemistry and Institute of Molecular Biophysics, Florida State University, Tallahassee, FL 32306-3015, USA.
Acta Crystallogr D Biol Crystallogr. 1999 Jan;55(Pt 1):219-24. doi: 10.1107/S0907444998007331. Epub 1999 Jan 1.
Improvements in free R cross-validation are based on changed scaling procedures and the use, in map calculation, of estimates of the validation amplitudes which are independent of the actual observed values. The deleterious effects of the omitted test data are mitigated by reduction of the test-set size, which is made possible by constraining test and working sets to share the same scaling coefficients, thereby reducing the degrees of freedom and the dependence of free R on data selection. Further improvements come with use of a modified free R factor, R freeTA. Instead of omitting the validation reflections from map calculation, their amplitudes are replaced by the average of resolution peers that is (nearly) independent of the actual cross-validation amplitudes. The improvements are relevant to model building, phase refinement by density modification and especially to real-space refinement. Although for real data at about 3 A resolution, free R factors of about 0.25 are affected little, the precision of the structure is improved by about 0.1 A. Tests with simulated data show that with good agreement between observed and calculated amplitudes (as in very high resolution studies or simulated refinement tests), free R factors can be improved by factors greater than two.
自由R交叉验证的改进基于改变的缩放程序,以及在图谱计算中使用与实际观测值无关的验证振幅估计值。通过减小测试集大小减轻了省略测试数据的有害影响,这通过约束测试集和工作集共享相同的缩放系数得以实现,从而减少自由度以及自由R对数据选择的依赖性。进一步的改进来自于使用改进的自由R因子R freeTA。在图谱计算中,不是省略验证反射,而是将其振幅替换为分辨率相当的反射的平均值,该平均值(几乎)与实际交叉验证振幅无关。这些改进与模型构建、通过密度修改进行相位精修特别是与实空间精修相关。虽然对于约3 Å分辨率的实际数据,约0.25的自由R因子受影响较小,但结构精度提高了约0.1 Å。对模拟数据的测试表明,当观测振幅与计算振幅吻合良好时(如在非常高分辨率的研究或模拟精修测试中),自由R因子可提高两倍以上。