用于折叠和比较核酸序列的高效算法。

Efficient algorithms for folding and comparing nucleic acid sequences.

作者信息

Dumas J P, Ninio J

出版信息

Nucleic Acids Res. 1982 Jan 11;10(1):197-206. doi: 10.1093/nar/10.1.197.

DOI:10.1093/nar/10.1.197

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC326126/

Abstract

Fast algorithms for analysing sequence data are presented. An algorithm for strict homologies finds all common subsequences of length greater than or equal to 6 in two given sequences. With it, nucleic acid pieces five thousand nucleotides long can be compared in five seconds on CDC 6600. Secondary structure algorithms generate the N most stable secondary structures of an RNA molecule, taking into account all loop contributions, and the formation of all possible base-pairs in stems, including odd pairs (G.G., C.U., etc.). They allow a typical 100-nucleotide sequence to be analysed in 10 seconds. The homology and secondary structure programs are respectively illustrated with a comparison of two phage genomes, and a discussion of Drosophila melanogaster 55 RNA folding.

摘要

本文介绍了用于分析序列数据的快速算法。一种用于严格同源性分析的算法可在两个给定序列中找到所有长度大于或等于6的公共子序列。利用该算法，在CDC 6600计算机上，长度为五千个核苷酸的核酸片段可在五秒内完成比较。二级结构算法可生成RNA分子最稳定的N种二级结构，该算法考虑了所有环的贡献以及茎中所有可能碱基对的形成，包括奇数对（G.G.、C.U.等）。它们能在10秒内分析一个典型的100个核苷酸的序列。通过比较两个噬菌体基因组以及讨论黑腹果蝇55 RNA折叠，分别展示了同源性和二级结构程序。

相似文献

1

Efficient algorithms for folding and comparing nucleic acid sequences.用于折叠和比较核酸序列的高效算法。

Nucleic Acids Res. 1982 Jan 11;10(1):197-206. doi: 10.1093/nar/10.1.197.

2

A method for predicting common structures of homologous RNAs.一种预测同源RNA共同结构的方法。

Comput Biomed Res. 1995 Feb;28(1):53-66. doi: 10.1006/cbmr.1995.1005.

3

Dynalign: an algorithm for finding the secondary structure common to two RNA sequences.Dynalign：一种用于寻找两个RNA序列共有的二级结构的算法。

J Mol Biol. 2002 Mar 22;317(2):191-203. doi: 10.1006/jmbi.2001.5351.

4

Evolution of the secondary structures and compensatory mutations of the ribosomal RNAs of Drosophila melanogaster.黑腹果蝇核糖体RNA二级结构的进化及补偿性突变

Mol Biol Evol. 1988 Jul;5(4):393-414. doi: 10.1093/oxfordjournals.molbev.a040501.

5

A graph theoretical approach for predicting common RNA secondary structure motifs including pseudoknots in unaligned sequences.一种用于预测未比对序列中包括假结在内的常见RNA二级结构基序的图论方法。

Bioinformatics. 2004 Jul 10;20(10):1591-602. doi: 10.1093/bioinformatics/bth131. Epub 2004 Feb 12.

6

RNA secondary structure prediction using highly parallel computers.使用高度并行计算机进行RNA二级结构预测。

Comput Appl Biosci. 1995 Dec;11(6):685-92. doi: 10.1093/bioinformatics/11.6.685.

7

Prediction of RNA base pairing probabilities on massively parallel computers.在大规模并行计算机上预测RNA碱基配对概率。

J Comput Biol. 2000 Feb-Apr;7(1-2):171-82. doi: 10.1089/10665270050081441.

8

Nucleic acid sequence design via efficient ensemble defect optimization.通过高效的整体缺陷优化进行核酸序列设计。

J Comput Chem. 2011 Feb;32(3):439-52. doi: 10.1002/jcc.21633. Epub 2010 Aug 17.

9

Sequence of U1 RNA from Drosophila melanogaster: implications for U1 secondary structure and possible involvement in splicing.黑腹果蝇U1 RNA的序列：对U1二级结构的影响以及可能参与剪接的情况。

Nucleic Acids Res. 1981 Dec 11;9(23):6351-68. doi: 10.1093/nar/9.23.6351.

10

Prediction of RNA secondary structure, including pseudoknotting, by computer simulation.通过计算机模拟预测RNA二级结构，包括假结结构。

Nucleic Acids Res. 1990 May 25;18(10):3035-44. doi: 10.1093/nar/18.10.3035.

引用本文的文献

1

The Historical Evolution and Significance of Multiple Sequence Alignment in Molecular Structure and Function Prediction.多重序列比对在分子结构与功能预测中的历史演变及意义

Biomolecules. 2024 Nov 29;14(12):1531. doi: 10.3390/biom14121531.

2

Cooperation of Spaln and Prrn5 for Construction of Gene-Structure-Aware Multiple Sequence Alignment.Spaln和Prrn5在构建基因结构感知多序列比对中的合作。

Methods Mol Biol. 2021;2231:71-88. doi: 10.1007/978-1-0716-1036-7_5.

3

A Puzzling Anomaly in the 4-Mer Composition of the Giant Pandoravirus Genomes Reveals a Stringent New Evolutionary Selection Process.巨潘多拉病毒基因组四聚体组成中的一个令人费解的异常现象揭示了一个严格的新进化选择过程。

J Virol. 2019 Nov 13;93(23). doi: 10.1128/JVI.01206-19. Print 2019 Dec 1.

4

Instability in progressive multiple sequence alignment algorithms.渐进式多序列比对算法中的不稳定性。

Algorithms Mol Biol. 2015 Oct 9;10:26. doi: 10.1186/s13015-015-0057-1. eCollection 2015.

5

Cgaln: fast and space-efficient whole-genome alignment.CGALN：快速且节省空间的全基因组比对。

BMC Bioinformatics. 2010 Apr 30;11:224. doi: 10.1186/1471-2105-11-224.

6

A space-efficient and accurate method for mapping and aligning cDNA sequences onto genomic sequence.一种用于将cDNA序列定位和比对到基因组序列上的节省空间且准确的方法。

Nucleic Acids Res. 2008 May;36(8):2630-8. doi: 10.1093/nar/gkn105. Epub 2008 Mar 15.

7

Sequence alignments in the neighborhood of the optimum with general application to dynamic programming.最优邻域序列比对及其在动态规划中的广泛应用。

Proc Natl Acad Sci U S A. 1983 May;80(10):3123-4. doi: 10.1073/pnas.80.10.3123.

8

Selection of antisense oligonucleotides based on multiple predicted target mRNA structures.基于多种预测的靶mRNA结构选择反义寡核苷酸。

BMC Bioinformatics. 2006 Mar 9;7:122. doi: 10.1186/1471-2105-7-122.

9

Application of a superword array in genome assembly.超级词阵列在基因组组装中的应用。

Nucleic Acids Res. 2006 Jan 5;34(1):201-5. doi: 10.1093/nar/gkj419. Print 2006.

10

Versatile and open software for comparing large genomes.用于比较大型基因组的通用且开放的软件。

Genome Biol. 2004;5(2):R12. doi: 10.1186/gb-2004-5-2-r12. Epub 2004 Jan 30.

本文引用的文献

1

A computer program to search for tRNA genes.一个用于搜索转运RNA基因的计算机程序。

Nucleic Acids Res. 1980 Feb 25;8(4):817-25.

2

Nucleotide sequences of Acanthamoeba castellanii 5S and 5.8S ribosomal ribonucleic acids: phylogenetic and comparative structural analyses.卡氏棘阿米巴5S和5.8S核糖体核糖核酸的核苷酸序列：系统发育和比较结构分析

Nucleic Acids Res. 1981 Jul 24;9(14):3321-34. doi: 10.1093/nar/9.14.3321.

3

Sequence homologies between eukaryotic 5.8S rRNA and the 5' end of prokaryotic 23S rRNa: evidences for a common evolutionary origin.真核生物5.8S rRNA与原核生物23S rRNA 5'端之间的序列同源性：共同进化起源的证据。

Nucleic Acids Res. 1981 Jun 25;9(12):2913-32. doi: 10.1093/nar/9.12.2913.

4

Determination of the secondary structure of Drosophila melanogaster 5 S RNA by hydroxymethyltrimethylpsoralen crosslinking.通过羟甲基三甲基补骨脂素交联测定黑腹果蝇5S RNA的二级结构

J Mol Biol. 1981 Apr 15;147(3):417-36. doi: 10.1016/0022-2836(81)90493-9.

5

Nucleotide sequence of the filamentous bacteriophage M13 DNA genome: comparison with phage fd.丝状噬菌体M13 DNA基因组的核苷酸序列：与噬菌体fd的比较。

Gene. 1980 Oct;11(1-2):129-48. doi: 10.1016/0378-1119(80)90093-1.

6

Steps toward computer analysis of nucleotide sequences.核苷酸序列计算机分析的步骤。

Science. 1980 Sep 19;209(4463):1322-8. doi: 10.1126/science.6251542.

7

A unique secondary folding pattern for 5S RNA corresponds to the lowest energy homologous secondary structure in 17 different prokaryotes.5S RNA独特的二级折叠模式与17种不同原核生物中能量最低的同源二级结构相对应。

Nucleic Acids Res. 1981 Apr 24;9(8):1885-904. doi: 10.1093/nar/9.8.1885.

8

A conversational system for the computer analysis of nucleic acid sequences.一种用于核酸序列计算机分析的对话系统。

Nucleic Acids Res. 1981 Jan 24;9(2):437-44. doi: 10.1093/nar/9.2.437.

9

Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information.利用热力学和辅助信息对大型RNA序列进行最优计算机折叠

Nucleic Acids Res. 1981 Jan 10;9(1):133-48. doi: 10.1093/nar/9.1.133.

10

Fast algorithm for predicting the secondary structure of single-stranded RNA.预测单链RNA二级结构的快速算法

Proc Natl Acad Sci U S A. 1980 Nov;77(11):6309-13. doi: 10.1073/pnas.77.11.6309.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验