Oide Mao, Sekiguchi Yuki, Fukuda Asahi, Okajima Koji, Oroguchi Tomotaka, Nakasako Masayoshi
Department of Physics, Faculty of Science and Technology, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama, Kanagawa 223-8522, Japan.
J Synchrotron Radiat. 2018 Sep 1;25(Pt 5):1379-1388. doi: 10.1107/S1600577518010342. Epub 2018 Aug 22.
In structure analyses of proteins in solution by using small-angle X-ray scattering (SAXS), the molecular models are restored by using ab initio molecular modeling algorithms. There can be variation among restored models owing to the loss of phase information in the scattering profiles, averaging with regard to the orientation of proteins against the direction of the incident X-ray beam, and also conformational fluctuations. In many cases, a representative molecular model is obtained by averaging models restored in a number of ab initio calculations, which possibly provide nonrealistic models inconsistent with the biological and structural information about the target protein. Here, a protocol for classifying predicted models by multivariate analysis to select probable and realistic models is proposed. In the protocol, each structure model is represented as a point in a hyper-dimensional space describing the shape of the model. Principal component analysis followed by the clustering method is applied to visualize the distribution of the points in the hyper-dimensional space. Then, the classification provides an opportunity to exclude nonrealistic models. The feasibility of the protocol was examined through the application to the SAXS profiles of four proteins.
在利用小角X射线散射(SAXS)对溶液中的蛋白质进行结构分析时,分子模型通过从头算分子建模算法进行重建。由于散射图谱中相位信息的丢失、蛋白质相对于入射X射线束方向的取向平均化以及构象波动,重建模型之间可能存在差异。在许多情况下,通过对多次从头算计算中重建的模型进行平均来获得代表性分子模型,这可能会提供与目标蛋白质的生物学和结构信息不一致的非现实模型。在此,提出了一种通过多变量分析对预测模型进行分类以选择可能且现实的模型的方案。在该方案中,每个结构模型都表示为描述模型形状的超维空间中的一个点。应用主成分分析随后结合聚类方法来可视化超维空间中各点的分布。然后,分类提供了排除非现实模型的机会。通过将该方案应用于四种蛋白质的SAXS图谱,检验了其可行性。