同源RNA常见折叠结构的预测。

Prediction of common folding structures of homologous RNAs.

作者信息

Han K, Kim H J

机构信息

Department of Computer Science, Rutgers University, Piscataway, NJ 08855.

出版信息

Nucleic Acids Res. 1993 Mar 11;21(5):1251-7. doi: 10.1093/nar/21.5.1251.

DOI:10.1093/nar/21.5.1251

PMID:7681944

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC309290/

Abstract

We have developed an algorithm and a computer program for simultaneously folding homologous RNA sequences. Given an alignment of M homologous sequences of length N, the program performs phylogenetic comparative analysis and predicts a common secondary structure conserved in the sequences. When the structure is not uniquely determined, it infers multiple structures which appear most plausible. This method is superior to energy minimization methods in the sense that it is not sensitive to point mutation of a sequence. It is also superior to usual phylogenetic comparative methods in that it does not require manual scrutiny for covariation or secondary structures. The most plausible 1-5 structures are produced in O(MN2 + N3) time and O(N2) space, which are the same requirements as those of widely used dynamic programs based on energy minimization for folding a single sequence. This is the first algorithm probably practical both in terms of time and space for finding secondary structures of homologous RNA sequences. The algorithm has been implemented in C on a Sun SparcStation, and has been verified by testing on tRNAs, 5S rRNAs, 16S rRNAs, TAR RNAs of human immunodeficiency virus type 1 (HIV-1), and RRE RNAs of HIV-1. We have also applied the program to cis-acting packaging sequences of HIV-1, for which no generally accepted structures yet exist, and propose potentially stable structures. Simulation of the program with random sequences with the same base composition and the same degree of similarity as the above sequences shows that structures common to homologous sequences are very unlikely to occur by chance in random sequences.

摘要

我们开发了一种用于同时折叠同源RNA序列的算法和计算机程序。给定长度为N的M条同源序列的比对，该程序进行系统发育比较分析，并预测序列中保守的共同二级结构。当结构不能唯一确定时，它会推断出多个最合理的结构。该方法优于能量最小化方法，因为它对序列的点突变不敏感。它也优于常规的系统发育比较方法，因为它不需要人工检查共变或二级结构。最合理的1 - 5种结构在O(MN2 + N3)时间和O(N2)空间内生成，这与基于能量最小化折叠单条序列的广泛使用的动态程序的要求相同。这可能是第一种在时间和空间方面都切实可行的用于寻找同源RNA序列二级结构的算法。该算法已用C语言在Sun SparcStation上实现，并通过对tRNA、5S rRNA、16S rRNA、人类免疫缺陷病毒1型（HIV - 1）的TAR RNA以及HIV - 1的RRE RNA进行测试得到验证。我们还将该程序应用于HIV - 1的顺式作用包装序列，目前尚无普遍接受的结构，我们提出了可能稳定的结构。用与上述序列具有相同碱基组成和相同相似程度的随机序列对该程序进行模拟表明，同源序列共有的结构在随机序列中偶然出现的可能性非常小。

相似文献

Prediction of common folding structures of homologous RNAs.同源RNA常见折叠结构的预测。

Nucleic Acids Res. 1993 Mar 11;21(5):1251-7. doi: 10.1093/nar/21.5.1251.

RNAlign program: alignment of RNA sequences using both primary and secondary structures.RNAlign程序：利用一级结构和二级结构对RNA序列进行比对。

Comput Appl Biosci. 1994 Jul;10(4):389-99. doi: 10.1093/bioinformatics/10.4.389.

RNA-TVcurve: a Web server for RNA secondary structure comparison based on a multi-scale similarity of its triple vector curve representation.RNA-TVcurve：一个基于三向量曲线表示的多尺度相似性进行RNA二级结构比较的网络服务器。

BMC Bioinformatics. 2017 Jan 21;18(1):51. doi: 10.1186/s12859-017-1481-7.

Predicting common foldings of homologous RNAs.预测同源RNA的常见折叠方式。

J Biomol Struct Dyn. 1991 Apr;8(5):1027-44. doi: 10.1080/07391102.1991.10507863.

Cofolga: a genetic algorithm for finding the common folding of two RNAs.Cofolga：一种用于寻找两个RNA共同折叠结构的遗传算法。

Comput Biol Chem. 2005 Apr;29(2):111-9. doi: 10.1016/j.compbiolchem.2005.02.004.

Prediction of common secondary structures of RNAs: a genetic algorithm approach.RNA常见二级结构的预测：一种遗传算法方法。

Nucleic Acids Res. 2000 Feb 15;28(4):991-9. doi: 10.1093/nar/28.4.991.

Dynalign: an algorithm for finding the secondary structure common to two RNA sequences.Dynalign：一种用于寻找两个RNA序列共有的二级结构的算法。

J Mol Biol. 2002 Mar 22;317(2):191-203. doi: 10.1006/jmbi.2001.5351.

A method for predicting common structures of homologous RNAs.一种预测同源RNA共同结构的方法。

Comput Biomed Res. 1995 Feb;28(1):53-66. doi: 10.1006/cbmr.1995.1005.

Free energy minimization to predict RNA secondary structures and computational RNA design.用于预测RNA二级结构和计算RNA设计的自由能最小化

Methods Mol Biol. 2015;1269:3-16. doi: 10.1007/978-1-4939-2291-8_1.

RNA Sampler: a new sampling based algorithm for common RNA secondary structure prediction and structural alignment.RNA采样器：一种基于采样的新算法，用于常见RNA二级结构预测和结构比对。

Bioinformatics. 2007 Aug 1;23(15):1883-91. doi: 10.1093/bioinformatics/btm272. Epub 2007 May 30.

引用本文的文献

Review of machine learning methods for RNA secondary structure prediction.机器学习方法在 RNA 二级结构预测中的研究进展综述。

PLoS Comput Biol. 2021 Aug 26;17(8):e1009291. doi: 10.1371/journal.pcbi.1009291. eCollection 2021 Aug.

Inforna 2.0: A Platform for the Sequence-Based Design of Small Molecules Targeting Structured RNAs.Inforna 2.0：一个用于基于序列设计靶向结构化RNA的小分子的平台。

ACS Chem Biol. 2016 Jun 17;11(6):1720-8. doi: 10.1021/acschembio.6b00001. Epub 2016 Apr 20.

Relationship between mRNA secondary structure and sequence variability in Chloroplast genes: possible life history implications.叶绿体基因中mRNA二级结构与序列变异性之间的关系：对生活史的可能影响

BMC Genomics. 2008 Jan 28;9:48. doi: 10.1186/1471-2164-9-48.

Predicting RNA secondary structure by the comparative approach: how to select the homologous sequences.通过比较方法预测RNA二级结构：如何选择同源序列。

BMC Bioinformatics. 2007 Nov 28;8:464. doi: 10.1186/1471-2105-8-464.

Predicted secondary structure for 28S and 18S rRNA from Ichneumonoidea (Insecta: Hymenoptera: Apocrita): impact on sequence alignment and phylogeny estimation.姬蜂总科（昆虫纲：膜翅目：细腰亚目）28S和18S核糖体RNA的预测二级结构：对序列比对和系统发育估计的影响

J Mol Evol. 2005 Jul;61(1):114-37. doi: 10.1007/s00239-004-0246-x. Epub 2005 Jul 14.

Comparative sequence analysis and patterns of covariation in RNA secondary structures.RNA二级结构中的比较序列分析与共变模式。

Genetics. 2000 Feb;154(2):909-21. doi: 10.1093/genetics/154.2.909.

Site-directed mutations reveal long-range compensatory interactions in the Adh gene of Drosophila melanogaster.定点突变揭示了黑腹果蝇乙醇脱氢酶基因中的远距离补偿性相互作用。

Proc Natl Acad Sci U S A. 1997 Feb 4;94(3):928-33. doi: 10.1073/pnas.94.3.928.

Chemical and computer probing of RNA structure.RNA结构的化学与计算机探测

Prog Nucleic Acid Res Mol Biol. 1996;53:131-96. doi: 10.1016/s0079-6603(08)60144-0.

Structures of small subunit ribosomal RNAs in situ from Escherichia coli and Thermomyces lanuginosus.来自大肠杆菌和嗜热栖热菌的小亚基核糖体RNA的原位结构。

Mol Cell Biochem. 1995 Jul 19;148(2):165-81. doi: 10.1007/BF00928154.

RNA sequence analysis using covariance models.使用协方差模型进行RNA序列分析。

Nucleic Acids Res. 1994 Jun 11;22(11):2079-88. doi: 10.1093/nar/22.11.2079.

本文引用的文献

STRUCTURE OF A RIBONUCLEIC ACID.核糖核酸的结构

Science. 1965 Mar 19;147(3664):1462-5. doi: 10.1126/science.147.3664.1462.

Pattern recognition in several sequences: consensus and alignment.多个序列中的模式识别：共有序列与比对

Bull Math Biol. 1984;46(4):515-27. doi: 10.1007/BF02459500.

Prokaryotic and eukaryotic 5 S RNAs: primary sequences and proposed secondary structures.原核生物和真核生物的5S核糖核酸：一级序列和推测的二级结构

Prog Nucleic Acid Res Mol Biol. 1983;28:177-209, 251-2. doi: 10.1016/s0079-6603(08)60087-2.

Secondary structure of the Tetrahymena ribosomal RNA intervening sequence: structural homology with fungal mitochondrial intervening sequences.嗜热四膜虫核糖体RNA间隔序列的二级结构：与真菌线粒体间隔序列的结构同源性。

Proc Natl Acad Sci U S A. 1983 Jul;80(13):3903-7. doi: 10.1073/pnas.80.13.3903.

Structure of ribosomal RNA.核糖体RNA的结构。

Annu Rev Biochem. 1984;53:119-62. doi: 10.1146/annurev.bi.53.070184.001003.

Secondary structure of 16S ribosomal RNA.16S核糖体RNA的二级结构

Science. 1981 Apr 24;212(4493):403-11. doi: 10.1126/science.6163215.

Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information.利用热力学和辅助信息对大型RNA序列进行最优计算机折叠

Nucleic Acids Res. 1981 Jan 10;9(1):133-48. doi: 10.1093/nar/9.1.133.

Three-dimensional tertiary structure of yeast phenylalanine transfer RNA.酵母苯丙氨酸转移核糖核酸的三维三级结构。

Science. 1974 Aug 2;185(4149):435-40. doi: 10.1126/science.185.4149.435.

Improved estimation of secondary structure in ribonucleic acids.核糖核酸二级结构的改进估计。

Nat New Biol. 1973 Nov 14;246(150):40-1. doi: 10.1038/newbio246040a0.

Comparative anatomy of 16-S-like ribosomal RNA.16S 样核糖体 RNA 的比较解剖学

Prog Nucleic Acid Res Mol Biol. 1985;32:155-216. doi: 10.1016/s0079-6603(08)60348-7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验