Bhandari Yuba R, Fan Lixin, Fang Xianyang, Zaki George F, Stahlberg Eric A, Jiang Wei, Schwieters Charles D, Stagno Jason R, Wang Yun-Xing
Protein-Nucleic Acid Interaction Section, Structural Biophysics Laboratory, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, MD 21702, United States.
Leidos Biomedical Research Inc., Frederick, MD 21702, United States.
J Mol Biol. 2017 Nov 24;429(23):3635-3649. doi: 10.1016/j.jmb.2017.09.006. Epub 2017 Sep 14.
Knowledge of RNA three-dimensional topological structures provides important insight into the relationship between RNA structural components and function. It is often likely that near-complete sets of biochemical and biophysical data containing structural restraints are not available, but one still wants to obtain knowledge about approximate topological folding of RNA. In this regard, general methods for determining such topological structures with minimum readily available restraints are lacking. Naked RNAs are difficult to crystallize and NMR spectroscopy is generally limited to small RNA fragments. By nature, sequence determines structure and all interactions that drive folding are self-contained within sequence. Nevertheless, there is little apparent correlation between primary sequences and three-dimensional folding unless supplemented with experimental or phylogenetic data. Thus, there is an acute need for a robust high-throughput method that can rapidly determine topological structures of RNAs guided by some experimental data. We present here a novel method (RS3D) that can assimilate the RNA secondary structure information, small-angle X-ray scattering data, and any readily available tertiary contact information to determine the topological fold of RNA. Conformations are firstly sampled at glob level where each glob represents a nucleotide. Best-ranked glob models can be further refined against solvent accessibility data, if available, and then converted to explicit all-atom coordinates for refinement against SAXS data using the Xplor-NIH program. RS3D is widely applicable to a variety of RNA folding architectures currently present in the structure database. Furthermore, we demonstrate applicability and feasibility of the program to derive low-resolution topological structures of relatively large multi-domain RNAs.
对RNA三维拓扑结构的了解为洞察RNA结构成分与功能之间的关系提供了重要线索。通常情况下,可能无法获得包含结构限制的近乎完整的生化和生物物理数据集,但人们仍然希望获取有关RNA近似拓扑折叠的知识。在这方面,缺乏用最少的现成限制来确定此类拓扑结构的通用方法。裸露的RNA难以结晶,核磁共振光谱法通常仅限于小RNA片段。从本质上讲,序列决定结构,所有驱动折叠的相互作用都包含在序列中。然而,除非辅以实验或系统发育数据,否则一级序列与三维折叠之间几乎没有明显的相关性。因此,迫切需要一种强大的高通量方法,能够在一些实验数据的指导下快速确定RNA的拓扑结构。我们在此提出一种新方法(RS3D),它可以整合RNA二级结构信息、小角X射线散射数据以及任何现成的三级接触信息,以确定RNA的拓扑折叠。首先在球状体水平上对构象进行采样,其中每个球状体代表一个核苷酸。如果有溶剂可及性数据,排名最佳的球状体模型可以进一步根据该数据进行优化,然后使用Xplor-NIH程序转换为明确的全原子坐标,以根据小角X射线散射数据进行优化。RS3D广泛适用于目前结构数据库中存在的各种RNA折叠结构。此外,我们证明了该程序推导相对较大的多结构域RNA低分辨率拓扑结构的适用性和可行性。