Suppr超能文献

通过使用寡聚体表提高点阵相似性搜索的效率。

Improving the efficiency of dot-matrix similarity searches through use of an oligomer table.

作者信息

Fristensky B

出版信息

Nucleic Acids Res. 1986 Jan 10;14(1):597-610. doi: 10.1093/nar/14.1.597.

Abstract

Dot-matrix sequence similarity searches can be greatly speeded up through use of a table listing all locations of short oligomers in one of the sequences to find potential similarities with a second sequence. The algorithm described finds similarities between two sequences of lengths M and N, comparing L residues at a time, with an efficiency of L X M X N/(SK) where S is the alphabet size, and k is the length of the oligomer. For nucleic acids, in which S = 4, use of a tetranucleotide table results in an efficiency of L X M X N/256. The simplicity of the approach allows for a straightforward calculation of the level of similarities expected to be found for given search parameters. Furthermore, the storage required is minimal, allowing for even large sequences to be compared on small microcomputers. Theoretical considerations regarding the use of this search are discussed.

摘要

通过使用一个列出短寡聚物在其中一个序列中所有位置的表格来寻找与第二个序列的潜在相似性,点阵序列相似性搜索可以大大加快速度。所描述的算法可找到长度分别为M和N的两个序列之间的相似性,每次比较L个残基,效率为L×M×N/(SK),其中S是字母表大小,k是寡聚物的长度。对于核酸,S = 4,使用四核苷酸表的效率为L×M×N/256。该方法的简单性使得可以直接计算在给定搜索参数下预期发现的相似性水平。此外,所需的存储量最小,甚至可以在小型微型计算机上比较大的序列。讨论了关于使用这种搜索的理论考虑因素。

相似文献

4
Rapid and sensitive protein similarity searches.快速且灵敏的蛋白质相似性搜索。
Science. 1985 Mar 22;227(4693):1435-41. doi: 10.1126/science.2983426.
5
Analysis of large nucleic acid dot matrices on small computers.小型计算机上的大型核酸点阵分析
Nucleic Acids Res. 1984 Jan 11;12(1 Pt 2):767-76. doi: 10.1093/nar/12.1part2.767.
10

本文引用的文献

2
Recognition of protein coding regions in DNA sequences.DNA序列中蛋白质编码区域的识别。
Nucleic Acids Res. 1982 Sep 11;10(17):5303-18. doi: 10.1093/nar/10.17.5303.
7
On the statistical significance of nucleic acid similarities.论核酸相似性的统计学意义。
Nucleic Acids Res. 1984 Jan 11;12(1 Pt 1):215-26. doi: 10.1093/nar/12.1part1.215.
9
A fast homology program for aligning biological sequences.一种用于比对生物序列的快速同源性程序。
Nucleic Acids Res. 1984 Jan 11;12(1 Pt 2):447-55. doi: 10.1093/nar/12.1part2.447.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验