School of Mathematical Sciences, Nankai University, Tianjin 300071, China.
Center for Applied Mathematics, Tianjin University, Tianjin 300072, China.
Bioinformatics. 2021 May 23;37(8):1093-1098. doi: 10.1093/bioinformatics/btaa932.
Recent years have witnessed that the inter-residue contact/distance in proteins could be accurately predicted by deep neural networks, which significantly improve the accuracy of predicted protein structure models. In contrast, fewer studies have been done for the prediction of RNA inter-nucleotide 3D closeness.
We proposed a new algorithm named RNAcontact for the prediction of RNA inter-nucleotide 3D closeness. RNAcontact was built based on the deep residual neural networks. The covariance information from multiple sequence alignments and the predicted secondary structure were used as the input features of the networks. Experiments show that RNAcontact achieves the respective precisions of 0.8 and 0.6 for the top L/10 and L (where L is the length of an RNA) predictions on an independent test set, significantly higher than other evolutionary coupling methods. Analysis shows that about 1/3 of the correctly predicted 3D closenesses are not base pairings of secondary structure, which are critical to the determination of RNA structure. In addition, we demonstrated that the predicted 3D closeness could be used as distance restraints to guide RNA structure folding by the 3dRNA package. More accurate models could be built by using the predicted 3D closeness than the models without using 3D closeness.
The webserver and a standalone package are available at: http://yanglab.nankai.edu.cn/RNAcontact/.
Supplementary data are available at Bioinformatics online.
近年来,深度神经网络已经能够准确地预测蛋白质中的残基间接触/距离,这显著提高了预测蛋白质结构模型的准确性。相比之下,对于 RNA 核苷酸间三维接近度的预测,研究较少。
我们提出了一种新的算法,名为 RNAcontact,用于预测 RNA 核苷酸间三维接近度。RNAcontact 是基于深度残差神经网络构建的。网络的输入特征包括来自多序列比对的协方差信息和预测的二级结构。实验表明,在独立测试集上,RNAcontact 在 top L/10 和 L(其中 L 是 RNA 的长度)预测中分别达到了 0.8 和 0.6 的精度,显著高于其他进化耦合方法。分析表明,约 1/3 的正确预测三维接近度不是二级结构的碱基对,这对 RNA 结构的确定至关重要。此外,我们证明了预测的三维接近度可以作为距离约束,通过 3dRNA 包来指导 RNA 结构折叠。与不使用三维接近度的模型相比,使用预测的三维接近度可以构建更准确的模型。
该网络服务器和独立软件包可在:http://yanglab.nankai.edu.cn/RNAcontact/ 获得。
补充数据可在生物信息学在线获得。