Biophysics Program, Stanford University, Stanford, CA 94305.
Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305.
Proc Natl Acad Sci U S A. 2019 Apr 23;116(17):8336-8341. doi: 10.1073/pnas.1819047116. Epub 2019 Apr 8.
Interactions between RNA and proteins are pervasive in biology, driving fundamental processes such as protein translation and participating in the regulation of gene expression. Modeling the energies of RNA-protein interactions is therefore critical for understanding and repurposing living systems but has been hindered by complexities unique to RNA-protein binding. Here, we bring together several advances to complete a calculation framework for RNA-protein binding affinities, including a unified free energy function for bound complexes, automated Rosetta modeling of mutations, and use of secondary structure-based energetic calculations to model unbound RNA states. The resulting Rosetta-Vienna RNP-ΔΔG method achieves root-mean-squared errors (RMSEs) of 1.3 kcal/mol on high-throughput MS2 coat protein-RNA measurements and 1.5 kcal/mol on an independent test set involving the signal recognition particle, human U1A, PUM1, and FOX-1. As a stringent test, the method achieves RMSE accuracy of 1.4 kcal/mol in blind predictions of hundreds of human PUM2-RNA relative binding affinities. Overall, these RMSE accuracies are significantly better than those attained by prior structure-based approaches applied to the same systems. Importantly, Rosetta-Vienna RNP-ΔΔG establishes a framework for further improvements in modeling RNA-protein binding that can be tested by prospective high-throughput measurements on new systems.
RNA 与蛋白质之间的相互作用在生物学中普遍存在,驱动着蛋白质翻译等基本过程,并参与基因表达的调控。因此,对 RNA-蛋白质相互作用的能量进行建模对于理解和重新利用生命系统至关重要,但由于 RNA-蛋白质结合的独特复杂性而受到阻碍。在这里,我们结合了几项进展,完成了一个计算 RNA-蛋白质结合亲和力的框架,包括结合复合物的统一自由能函数、突变的自动罗塞塔建模,以及使用基于二级结构的能量计算来模拟未结合的 RNA 状态。由此产生的 Rosetta-Vienna RNP-ΔΔG 方法在 MS2 外壳蛋白-RNA 的高通量测量中实现了 1.3 kcal/mol 的均方根误差 (RMSE),在涉及信号识别颗粒、人 U1A、PUM1 和 FOX-1 的独立测试集中实现了 1.5 kcal/mol 的 RMSE。作为一个严格的测试,该方法在对数百个人类 PUM2-RNA 相对结合亲和力的盲预测中达到了 1.4 kcal/mol 的 RMSE 准确性。总的来说,这些 RMSE 精度明显优于应用于相同系统的先前基于结构的方法所达到的精度。重要的是,Rosetta-Vienna RNP-ΔΔG 为进一步改进 RNA-蛋白质结合的建模建立了一个框架,可以通过对新系统进行前瞻性的高通量测量来进行测试。