Li Yan
Department of Applied Chemistry, Ocean University of China, Qingdao 266003, P. R. China.
J Chem Inf Model. 2006 Jul-Aug;46(4):1742-50. doi: 10.1021/ci050463u.
This work describes the application of a Bayesian method for clustering protein conformations sampled during a molecular dynamics simulation of the HIV-1 integrase catalytic core. A clustering analysis is carried out under the assumption of normal distribution without fixing the number of clusters in advance. Some performance measures, such as posterior probability and class cross entropy, are used to determine the most probable set of clusters. The Bayesian clustering method results in meaningful groups identifying transitions between conformational ensembles. The dihedral angles involved in such transitions are also examined in detail. The conformations in high dimensional space are projected into 3D space employing a multidimensional scaling technique to provide a visual inspection.
这项工作描述了一种贝叶斯方法在对HIV-1整合酶催化核心进行分子动力学模拟期间对采样的蛋白质构象进行聚类的应用。在正态分布假设下进行聚类分析,且不预先确定聚类数量。使用一些性能指标,如后验概率和类交叉熵,来确定最可能的聚类集。贝叶斯聚类方法产生了有意义的组,识别出构象集合之间的转变。还详细研究了此类转变中涉及的二面角。采用多维缩放技术将高维空间中的构象投影到三维空间中,以进行可视化检查。