利用计算机快速搜索相似的DNA序列。

Fast computer search for similar DNA sequences.

作者信息

Bishop M, Thompson E

出版信息

Nucleic Acids Res. 1984 Jul 11;12(13):5471-4. doi: 10.1093/nar/12.13.5471.

DOI:10.1093/nar/12.13.5471

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC318933/

Abstract

An extremely fast method of searching a nucleic acid sequence database against a probe sequence is described. The method is based on the detection of deviation from expected number and deviation from random spatial distribution of sub-sequences which are unique within a sequence, and shared between that sequence and the probe. On an IBM 3081 computer, total search of an encoded form of the EMBL nucleic acid sequence database with a 1 kbase probe sequence is completed in a few seconds. Previous best methods for a similar task required a few minutes.

摘要

本文描述了一种针对探针序列搜索核酸序列数据库的极快速方法。该方法基于检测序列内独特且该序列与探针共有的子序列的预期数量偏差和随机空间分布偏差。在一台IBM 3081计算机上，用一个1千碱基的探针序列对EMBL核酸序列数据库的编码形式进行全面搜索只需几秒即可完成。之前用于类似任务的最佳方法则需要几分钟。

相似文献

1

Fast computer search for similar DNA sequences.利用计算机快速搜索相似的DNA序列。

Nucleic Acids Res. 1984 Jul 11;12(13):5471-4. doi: 10.1093/nar/12.13.5471.

2

A comprehensive sequence analysis program for the IBM personal computer.一款适用于IBM个人计算机的综合序列分析程序。

Nucleic Acids Res. 1984 Jan 11;12(1 Pt 2):581-99. doi: 10.1093/nar/12.1part2.581.

3

Statistical significance of symmetrical and repetitive segments in DNA.DNA中对称和重复片段的统计学意义。

Nucleic Acids Res. 1982 Dec 20;10(24):8323-39. doi: 10.1093/nar/10.24.8323.

4

A DNA sequence analysis package for the IBM personal computer.一款用于IBM个人计算机的DNA序列分析软件包。

Nucleic Acids Res. 1984 Jan 11;12(1 Pt 2):605-14. doi: 10.1093/nar/12.1part2.605.

5

Computer programs used to aid in the selection of DNA hybridization probes.用于辅助选择DNA杂交探针的计算机程序。

Nucleic Acids Res. 1984 Jan 11;12(1 Pt 2):833-6. doi: 10.1093/nar/12.1part2.833.

6

Microcomputer programs for back translation of protein to DNA sequences and analysis of ambiguous DNA sequences.用于将蛋白质反向翻译为DNA序列以及分析模糊DNA序列的微机程序。

Nucleic Acids Res. 1984 Jan 11;12(1 Pt 2):819-23. doi: 10.1093/nar/12.1part2.819.

7

[A rapid method of searching for homology of nucleic acid sequences].[一种快速搜索核酸序列同源性的方法]

Biofizika. 1988 Mar-Apr;33(2):229-32.

8

A common philosophy and FORTRAN 77 software package for implementing and searching sequence databases.一个用于实现和搜索序列数据库的通用理念及 FORTRAN 77 软件包。

Nucleic Acids Res. 1984 Jan 11;12(1 Pt 1):397-407. doi: 10.1093/nar/12.1part1.397.

9

Graphic methods to determine the function of nucleic acid sequences.用于确定核酸序列功能的图解方法。

Nucleic Acids Res. 1984 Jan 11;12(1 Pt 2):521-38. doi: 10.1093/nar/12.1part2.521.

10

The diagonal-traverse homology search algorithm for locating similarities between two sequences.用于定位两个序列之间相似性的对角线遍历同源性搜索算法。

Nucleic Acids Res. 1984 Jan 11;12(1 Pt 2):751-66. doi: 10.1093/nar/12.1part2.751.

引用本文的文献

1

Heuristic informational analysis of sequences.序列的启发式信息分析

Nucleic Acids Res. 1986 Jan 10;14(1):179-96. doi: 10.1093/nar/14.1.179.

2

A flexible method to align large numbers of biological sequences.一种比对大量生物序列的灵活方法。

J Mol Evol. 1988;28(1-2):161-9. doi: 10.1007/BF02143508.

3

Isolation of a human gene with protein sequence similarity to human and murine int-1 and the Drosophila segment polarity mutant wingless.分离出一种与人类和小鼠int-1以及果蝇体节极性突变体无翅蛋白序列相似的人类基因。

EMBO J. 1988 Jun;7(6):1743-8. doi: 10.1002/j.1460-2075.1988.tb03003.x.

4

Overexpression and site-directed mutagenesis of the succinyl-CoA synthetase of Escherichia coli and nucleotide sequence of a gene (g30) that is adjacent to the suc operon.大肠杆菌琥珀酰辅酶A合成酶的过表达和定点诱变以及与suc操纵子相邻的一个基因（g30）的核苷酸序列。

Biochem J. 1989 Jun 15;260(3):737-47. doi: 10.1042/bj2600737.

本文引用的文献

1

An interactive graphics program for comparing and aligning nucleic acid and amino acid sequences.一个用于比较和比对核酸及氨基酸序列的交互式图形程序。

Nucleic Acids Res. 1982 May 11;10(9):2951-61. doi: 10.1093/nar/10.9.2951.

2

New approaches for computer analysis of nucleic acid sequences.核酸序列计算机分析的新方法。

Proc Natl Acad Sci U S A. 1983 Sep;80(18):5660-4. doi: 10.1073/pnas.80.18.5660.

3

Rapid similarity searches of nucleic acid and protein data banks.核酸和蛋白质数据库的快速相似性搜索。

Proc Natl Acad Sci U S A. 1983 Feb;80(3):726-30. doi: 10.1073/pnas.80.3.726.

4

Efficient algorithms for folding and comparing nucleic acid sequences.用于折叠和比较核酸序列的高效算法。

Nucleic Acids Res. 1982 Jan 11;10(1):197-206. doi: 10.1093/nar/10.1.197.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验