Department of Pharmacology, Penn State University College of Medicine, Hershey, PA 17033, USA.
Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27514, USA.
Nucleic Acids Res. 2019 Jun 20;47(11):5563-5572. doi: 10.1093/nar/gkz427.
RNA structural complexity and flexibility present a challenge for computational modeling efforts. Experimental information and bioinformatics data can be used as restraints to improve the accuracy of RNA tertiary structure prediction. Regarding utilization of restraints, the fundamental questions are: (i) What is the limit in prediction accuracy that one can achieve with arbitrary number of restraints? (ii) Is there a strategy for selection of the minimal number of restraints that would result in the best structural model? We address the first question by testing the limits in prediction accuracy using native contacts as restraints. To address the second question, we develop an algorithm based on the distance variation allowed by secondary structure (DVASS), which ranks restraints according to their importance to RNA tertiary structure prediction. We find that due to kinetic traps, the greatest improvement in the structure prediction accuracy is achieved when we utilize only 40-60% of the total number of native contacts as restraints. When the restraints are sorted by DVASS algorithm, using only the first 20% ranked restraints can greatly improve the prediction accuracy. Our findings suggest that only a limited number of strategically selected distance restraints can significantly assist in RNA structure modeling.
RNA 的结构复杂性和灵活性给计算建模工作带来了挑战。实验信息和生物信息学数据可用作约束条件,以提高 RNA 三级结构预测的准确性。关于约束条件的利用,基本问题是:(i)使用任意数量的约束条件可以在预测准确性上达到什么极限?(ii)是否有一种选择最小数量约束条件的策略,以得到最佳的结构模型?我们通过使用天然接触作为约束条件来测试预测准确性的极限来解决第一个问题。为了解决第二个问题,我们基于二级结构允许的距离变化(DVASS)开发了一种算法,该算法根据对 RNA 三级结构预测的重要性对约束条件进行排序。我们发现,由于动力学陷阱,当我们仅将天然接触总数的 40-60%用作约束条件时,结构预测准确性的最大提高。当按照 DVASS 算法对约束条件进行排序时,仅使用前 20%排名的约束条件就可以大大提高预测准确性。我们的研究结果表明,只有有限数量的策略性选择的距离约束条件才能显著帮助 RNA 结构建模。