Suppr超能文献

同源RNA常见折叠结构的预测。

Prediction of common folding structures of homologous RNAs.

作者信息

Han K, Kim H J

机构信息

Department of Computer Science, Rutgers University, Piscataway, NJ 08855.

出版信息

Nucleic Acids Res. 1993 Mar 11;21(5):1251-7. doi: 10.1093/nar/21.5.1251.

Abstract

We have developed an algorithm and a computer program for simultaneously folding homologous RNA sequences. Given an alignment of M homologous sequences of length N, the program performs phylogenetic comparative analysis and predicts a common secondary structure conserved in the sequences. When the structure is not uniquely determined, it infers multiple structures which appear most plausible. This method is superior to energy minimization methods in the sense that it is not sensitive to point mutation of a sequence. It is also superior to usual phylogenetic comparative methods in that it does not require manual scrutiny for covariation or secondary structures. The most plausible 1-5 structures are produced in O(MN2 + N3) time and O(N2) space, which are the same requirements as those of widely used dynamic programs based on energy minimization for folding a single sequence. This is the first algorithm probably practical both in terms of time and space for finding secondary structures of homologous RNA sequences. The algorithm has been implemented in C on a Sun SparcStation, and has been verified by testing on tRNAs, 5S rRNAs, 16S rRNAs, TAR RNAs of human immunodeficiency virus type 1 (HIV-1), and RRE RNAs of HIV-1. We have also applied the program to cis-acting packaging sequences of HIV-1, for which no generally accepted structures yet exist, and propose potentially stable structures. Simulation of the program with random sequences with the same base composition and the same degree of similarity as the above sequences shows that structures common to homologous sequences are very unlikely to occur by chance in random sequences.

摘要

我们开发了一种用于同时折叠同源RNA序列的算法和计算机程序。给定长度为N的M条同源序列的比对,该程序进行系统发育比较分析,并预测序列中保守的共同二级结构。当结构不能唯一确定时,它会推断出多个最合理的结构。该方法优于能量最小化方法,因为它对序列的点突变不敏感。它也优于常规的系统发育比较方法,因为它不需要人工检查共变或二级结构。最合理的1 - 5种结构在O(MN2 + N3)时间和O(N2)空间内生成,这与基于能量最小化折叠单条序列的广泛使用的动态程序的要求相同。这可能是第一种在时间和空间方面都切实可行的用于寻找同源RNA序列二级结构的算法。该算法已用C语言在Sun SparcStation上实现,并通过对tRNA、5S rRNA、16S rRNA、人类免疫缺陷病毒1型(HIV - 1)的TAR RNA以及HIV - 1的RRE RNA进行测试得到验证。我们还将该程序应用于HIV - 1的顺式作用包装序列,目前尚无普遍接受的结构,我们提出了可能稳定的结构。用与上述序列具有相同碱基组成和相同相似程度的随机序列对该程序进行模拟表明,同源序列共有的结构在随机序列中偶然出现的可能性非常小。

相似文献

1
Prediction of common folding structures of homologous RNAs.
Nucleic Acids Res. 1993 Mar 11;21(5):1251-7. doi: 10.1093/nar/21.5.1251.
2
RNAlign program: alignment of RNA sequences using both primary and secondary structures.
Comput Appl Biosci. 1994 Jul;10(4):389-99. doi: 10.1093/bioinformatics/10.4.389.
4
Predicting common foldings of homologous RNAs.
J Biomol Struct Dyn. 1991 Apr;8(5):1027-44. doi: 10.1080/07391102.1991.10507863.
5
Cofolga: a genetic algorithm for finding the common folding of two RNAs.
Comput Biol Chem. 2005 Apr;29(2):111-9. doi: 10.1016/j.compbiolchem.2005.02.004.
6
Prediction of common secondary structures of RNAs: a genetic algorithm approach.
Nucleic Acids Res. 2000 Feb 15;28(4):991-9. doi: 10.1093/nar/28.4.991.
7
Dynalign: an algorithm for finding the secondary structure common to two RNA sequences.
J Mol Biol. 2002 Mar 22;317(2):191-203. doi: 10.1006/jmbi.2001.5351.
8
A method for predicting common structures of homologous RNAs.
Comput Biomed Res. 1995 Feb;28(1):53-66. doi: 10.1006/cbmr.1995.1005.
9
Free energy minimization to predict RNA secondary structures and computational RNA design.
Methods Mol Biol. 2015;1269:3-16. doi: 10.1007/978-1-4939-2291-8_1.
10
RNA Sampler: a new sampling based algorithm for common RNA secondary structure prediction and structural alignment.
Bioinformatics. 2007 Aug 1;23(15):1883-91. doi: 10.1093/bioinformatics/btm272. Epub 2007 May 30.

引用本文的文献

1
Review of machine learning methods for RNA secondary structure prediction.
PLoS Comput Biol. 2021 Aug 26;17(8):e1009291. doi: 10.1371/journal.pcbi.1009291. eCollection 2021 Aug.
2
Inforna 2.0: A Platform for the Sequence-Based Design of Small Molecules Targeting Structured RNAs.
ACS Chem Biol. 2016 Jun 17;11(6):1720-8. doi: 10.1021/acschembio.6b00001. Epub 2016 Apr 20.
4
6
Comparative sequence analysis and patterns of covariation in RNA secondary structures.
Genetics. 2000 Feb;154(2):909-21. doi: 10.1093/genetics/154.2.909.
7
Site-directed mutations reveal long-range compensatory interactions in the Adh gene of Drosophila melanogaster.
Proc Natl Acad Sci U S A. 1997 Feb 4;94(3):928-33. doi: 10.1073/pnas.94.3.928.
8
Chemical and computer probing of RNA structure.
Prog Nucleic Acid Res Mol Biol. 1996;53:131-96. doi: 10.1016/s0079-6603(08)60144-0.
9
Structures of small subunit ribosomal RNAs in situ from Escherichia coli and Thermomyces lanuginosus.
Mol Cell Biochem. 1995 Jul 19;148(2):165-81. doi: 10.1007/BF00928154.
10
RNA sequence analysis using covariance models.
Nucleic Acids Res. 1994 Jun 11;22(11):2079-88. doi: 10.1093/nar/22.11.2079.

本文引用的文献

1
STRUCTURE OF A RIBONUCLEIC ACID.
Science. 1965 Mar 19;147(3664):1462-5. doi: 10.1126/science.147.3664.1462.
2
Pattern recognition in several sequences: consensus and alignment.
Bull Math Biol. 1984;46(4):515-27. doi: 10.1007/BF02459500.
3
Prokaryotic and eukaryotic 5 S RNAs: primary sequences and proposed secondary structures.
Prog Nucleic Acid Res Mol Biol. 1983;28:177-209, 251-2. doi: 10.1016/s0079-6603(08)60087-2.
5
Structure of ribosomal RNA.
Annu Rev Biochem. 1984;53:119-62. doi: 10.1146/annurev.bi.53.070184.001003.
6
Secondary structure of 16S ribosomal RNA.
Science. 1981 Apr 24;212(4493):403-11. doi: 10.1126/science.6163215.
7
Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information.
Nucleic Acids Res. 1981 Jan 10;9(1):133-48. doi: 10.1093/nar/9.1.133.
8
Three-dimensional tertiary structure of yeast phenylalanine transfer RNA.
Science. 1974 Aug 2;185(4149):435-40. doi: 10.1126/science.185.4149.435.
9
Improved estimation of secondary structure in ribonucleic acids.
Nat New Biol. 1973 Nov 14;246(150):40-1. doi: 10.1038/newbio246040a0.
10
Comparative anatomy of 16-S-like ribosomal RNA.
Prog Nucleic Acid Res Mol Biol. 1985;32:155-216. doi: 10.1016/s0079-6603(08)60348-7.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验