School of Life Sciences, Tsinghua-Peking Joint Center for Life Sciences, Beijing Advanced Innovation Center for Structural Biology, Tsinghua University, Beijing 100084, China.
Nucleic Acids Res. 2021 May 7;49(8):4294-4307. doi: 10.1093/nar/gkab250.
RNA structures play a fundamental role in nearly every aspect of cellular physiology and pathology. Gaining insights into the functions of RNA molecules requires accurate predictions of RNA secondary structures. However, the existing thermodynamic folding models remain less accurate than desired, even when chemical probing data, such as selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) reactivities, are used as restraints. Unlike most SHAPE-directed algorithms that only consider SHAPE restraints for base pairing, we extract two-dimensional structural features encoded in SHAPE data and establish robust relationships between characteristic SHAPE patterns and loop motifs of various types (hairpin, internal, and bulge) and lengths (2-11 nucleotides). Such characteristic SHAPE patterns are closely related to the sugar pucker conformations of loop residues. Based on these patterns, we propose a computational method, SHAPELoop, which refines the predicted results of the existing methods, thereby further improving their prediction accuracy. In addition, SHAPELoop can provide information about local or global structural rearrangements (including pseudoknots) and help researchers to easily test their hypothesized secondary structures.
RNA 结构在细胞生理学和病理学的几乎各个方面都起着至关重要的作用。深入了解 RNA 分子的功能需要准确预测 RNA 二级结构。然而,即使使用化学探测数据(例如选择性 2'-羟基乙酰化分析引物延伸(SHAPE)反应性)作为约束条件,现有的热力学折叠模型仍然不够准确。与大多数仅考虑碱基配对的 SHAPE 定向算法不同,我们从 SHAPE 数据中提取二维结构特征,并在各种类型(发夹、内部和凸起)和长度(2-11 个核苷酸)的环基序之间建立特征 SHAPE 模式和特征之间的稳健关系。这些特征性的 SHAPE 模式与环残基的糖 puck 构象密切相关。基于这些模式,我们提出了一种计算方法 SHAPELoop,它可以改进现有方法的预测结果,从而进一步提高其预测准确性。此外,SHAPELoop 可以提供有关局部或全局结构重排(包括假结)的信息,并帮助研究人员轻松测试其假设的二级结构。