School of Electrical and Computer Engineering, Georgia Institute of Technology, UA Whitaker Building, 313 Ferst Drive, Atlanta, GA 30332-0535, USA.
IEEE/ACM Trans Comput Biol Bioinform. 2011 Nov-Dec;8(6):1604-19. doi: 10.1109/TCBB.2010.128.
The local conformation of RNA molecules is an important factor in determining their catalytic and binding properties. The analysis of such conformations is particularly difficult due to the large number of degrees of freedom, such as the measured torsion angles per residue and the interatomic distances among interacting residues. In this work, we use a nearest-neighbor search method based on the statistical mechanical Potts model to find clusters in the RNA conformational space. The proposed technique is mostly automatic and may be applied to problems, where there is no prior knowledge on the structure of the data space in contrast to many other clustering techniques. Results are reported for both single residue conformations, where the parameter set of the data space includes four to seven torsional angles, and base pair geometries, where the data space is reduced to two dimensions. Moreover, new results are reported for base stacking geometries. For the first two cases, i.e., single residue conformations and base pair geometries, we get a very good match between the results of the proposed clustering method and the known classifications with only few exceptions. For the case of base stacking geometries, we validate our classification with respect to geometrical constraints and describe the content, and the geometry of the new clusters.
RNA 分子的局部构象是决定其催化和结合特性的重要因素。由于自由度数量众多,例如每个残基的测量扭转角和相互作用残基之间的原子间距离,因此分析这种构象特别困难。在这项工作中,我们使用基于统计力学 Potts 模型的最近邻搜索方法在 RNA 构象空间中找到聚类。所提出的技术主要是自动的,并且可以应用于与许多其他聚类技术不同的问题,在这些问题中,没有关于数据空间结构的先验知识。报告了两种结果,一种是单个残基构象,其中数据空间的参数集包括四个到七个扭转角,另一种是碱基对几何形状,其中数据空间减少到二维。此外,还报告了碱基堆积几何形状的新结果。对于前两种情况,即单个残基构象和碱基对几何形状,我们发现所提出的聚类方法的结果与已知分类之间非常吻合,只有少数例外。对于碱基堆积几何形状的情况,我们根据几何约束验证了我们的分类,并描述了新聚类的内容和几何形状。