Structural Genomics Consortium, University of Oxford, Roosevelt Drive, Oxford OX3 7DQ, England.
Acta Crystallogr D Struct Biol. 2017 Mar 1;73(Pt 3):256-266. doi: 10.1107/S2059798317003412. Epub 2017 Mar 6.
Although noncovalent binding by small molecules cannot be assumed a priori to be stoichiometric in the crystal lattice, occupancy refinement of ligands is often avoided by convention. Occupancies tend to be set to unity, requiring the occupancy error to be modelled by the B factors, and residual weak density around the ligand is necessarily attributed to disorder'. Where occupancy refinement is performed, the complementary, superposed unbound state is rarely modelled. Here, it is shown that superior accuracy is achieved by modelling the ligand as partially occupied and superposed on a ligand-free ground-state' model. Explicit incorporation of this model of the crystal, obtained from a reference data set, allows constrained occupancy refinement with minimal fear of overfitting. Better representation of the crystal also leads to more meaningful refined atomic parameters such as the B factor, allowing more insight into dynamics in the crystal. An outline of an approach for algorithmically generating ensemble models of crystals is presented, assuming that data sets representing the ground state are available. The applicability of various electron-density metrics to the validation of the resulting models is assessed, and it is concluded that ensemble models consistently score better than the corresponding single-state models. Furthermore, it appears that ignoring the superposed ground state becomes the dominant source of model error, locally, once the overall model is accurate enough; modelling the local ground state properly is then more meaningful than correcting all remaining model errors globally, especially for low-occupancy ligands. Implications for the simultaneous refinement of B factors and occupancies, and for future evaluation of the limits of the approach, in particular its behaviour at lower data resolution, are discussed.
虽然小分子的非共价键结合不能先验地假定在晶格中是化学计量的,但根据惯例,配体的占有率精修通常被避免。占有率往往被设置为 1,需要用 B 因子来模拟占有率误差,并且配体周围必然存在的残留弱密度被归因于“无序”。在进行占有率精修的情况下,很少对互补的、重叠的未结合态进行建模。在这里,通过将配体建模为部分占据并重叠在无配体的“基态”模型上,证明了可以实现更高的精度。通过显式包含来自参考数据集的这种晶体模型,可以进行约束占有率精修,而不必担心过度拟合。更好地表示晶体也会导致更有意义的精细原子参数,如 B 因子,从而可以更深入地了解晶体中的动力学。提出了一种用于算法生成晶体整体模型的方法概述,假设可用代表基态的数据集。评估了各种电子密度度量标准对所得模型的验证的适用性,并得出结论,整体模型足够准确后,整体模型始终比相应的单态模型得分更好。此外,似乎一旦整体模型足够准确,忽略重叠的基态就会成为模型误差的主要来源,特别是对于低占有率的配体,正确地模拟局部基态比全局纠正所有剩余的模型误差更有意义。讨论了同时精修 B 因子和占有率的影响,以及对该方法的未来评估的影响,特别是其在较低数据分辨率下的行为。