Chauvot de Beauchene Isaure, de Vries Sjoerd J, Zacharias Martin
Physik-Department T38, Technische Universität München, Garching, Germany.
PLoS Comput Biol. 2016 Jan 27;12(1):e1004697. doi: 10.1371/journal.pcbi.1004697. eCollection 2016 Jan.
Protein-RNA docking is hampered by the high flexibility of RNA, and particularly single-stranded RNA (ssRNA). Yet, ssRNA regions typically carry the specificity of protein recognition. The lack of methodology for modeling such regions limits the accuracy of current protein-RNA docking methods. We developed a fragment-based approach to model protein-bound ssRNA, based on the structure of the protein and the sequence of the RNA, without any prior knowledge of the RNA binding site or the RNA structure. The conformational diversity of each fragment is sampled by an exhaustive RNA fragment library that was created from all the existing experimental structures of protein-ssRNA complexes. A systematic and detailed analysis of fragment-based ssRNA docking was performed which constitutes a proof-of-principle for the fragment-based approach. The method was tested on two 8-homo-nucleotide ssRNA-protein complexes and was able to identify the binding site on the protein within 10 Å. Moreover, a structure of each bound ssRNA could be generated in close agreement with the crystal structure with a mean deviation of ~1.5 Å except for a terminal nucleotide. This is the first time a bound ssRNA could be modeled from sequence with high precision.
蛋白质-RNA对接受到RNA高度灵活性的阻碍,尤其是单链RNA(ssRNA)。然而,ssRNA区域通常携带蛋白质识别的特异性。缺乏对这类区域进行建模的方法限制了当前蛋白质-RNA对接方法的准确性。我们基于蛋白质的结构和RNA的序列,开发了一种基于片段的方法来对与蛋白质结合的ssRNA进行建模,无需事先了解RNA结合位点或RNA结构。通过一个详尽的RNA片段文库对每个片段的构象多样性进行采样,该文库由蛋白质-ssRNA复合物的所有现有实验结构创建而成。对基于片段的ssRNA对接进行了系统而详细的分析,这构成了基于片段方法的原理验证。该方法在两个8个同核苷酸的ssRNA-蛋白质复合物上进行了测试,能够在10 Å范围内识别蛋白质上的结合位点。此外,除了一个末端核苷酸外,每个结合的ssRNA的结构都能以与晶体结构高度一致的方式生成,平均偏差约为1.5 Å。这是首次能够从序列高精度地对结合的ssRNA进行建模。