Giordano Rita, Leal Ricardo M F, Bourenkov Gleb P, McSweeney Sean, Popov Alexander N
ESRF, 6 Rue Jules Horowitz, 38043 Grenoble, France.
Acta Crystallogr D Biol Crystallogr. 2012 Jun;68(Pt 6):649-58. doi: 10.1107/S0907444912006841. Epub 2012 May 17.
It is generally assumed that the quality of X-ray diffraction data can be improved by merging data sets from several crystals. However, this effect is only valid if the data sets used are from crystals that are structurally identical. It is found that frozen macromolecular crystals very often have relatively low structure identity (and are therefore not isomorphous); thus, to obtain a real gain from multi-crystal data sets one needs to make an appropriate selection of structurally similar crystals. The application of hierarchical cluster analysis, based on the matrix of the correlation coefficient between scaled intensities, is proposed for the identification of isomorphous data sets. Multi-crystal single-wavelength anomalous dispersion data sets from four different protein molecules have been probed to test the applicability of this method. The use of hierarchical cluster analysis permitted the selection of batches of data sets which when merged together significantly improved the crystallographic indicators of the merged data and allowed solution of the structure.
一般认为,通过合并来自多个晶体的数据集,可以提高X射线衍射数据的质量。然而,只有当所使用的数据集来自结构相同的晶体时,这种效果才有效。研究发现,冷冻的大分子晶体通常具有相对较低的结构同一性(因此不是同晶型的);因此,为了从多晶体数据集中获得实际收益,需要对结构相似的晶体进行适当选择。提出基于缩放强度之间的相关系数矩阵应用层次聚类分析来识别同晶型数据集。已对来自四个不同蛋白质分子的多晶体单波长反常色散数据集进行探测,以测试该方法的适用性。层次聚类分析的使用允许选择一批数据集,这些数据集合并在一起时可显著改善合并数据的晶体学指标并有助于解析结构。