• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

具有有限存储的生物序列模式匹配

Pattern matching of biological sequences with limited storage.

作者信息

Gotoh O

机构信息

Department of Biochemistry, Saitama Cancer Center Research Institute, Japan.

出版信息

Comput Appl Biosci. 1987 Mar;3(1):17-20. doi: 10.1093/bioinformatics/3.1.17.

DOI:10.1093/bioinformatics/3.1.17
PMID:3453210
Abstract

Existing methods for getting the locally best matched alignments between a pair of biological sequences require O(N2) computational steps and O(N2) storage, where N is the average sequence length. An improved method is presented with which the storage requirement is greatly reduced, while the computational steps remain O(N2). Only a small number of additional steps are required to display any common sub-sequences with similarity scores greater than a given threshold. The aligments found by the algorithm are optimal in the sense that their scores are locally maximal, where each score is a sum of weights given to individual matches/replacements, insertions and deletions involved in the alignment. The algorithm was implemented in C programming language on a personal computer. Data area of 64 kbytes on random access memory and a few hundred kbytes on a disk is sufficient for comparing two protein or nucleic acid sequences of 2500 residues. The programs are particularly valuable when used in combination with fast sequence search programs.

摘要

现有用于获取一对生物序列之间局部最佳匹配比对的方法需要O(N2)的计算步骤和O(N2)的存储空间,其中N是平均序列长度。本文提出了一种改进方法,该方法在计算步骤仍为O(N2)的情况下,大大降低了存储需求。只需少量额外步骤就能显示任何相似度得分大于给定阈值的公共子序列。该算法找到的比对在其得分局部最大的意义上是最优的,其中每个得分是赋予比对中各个匹配/替换、插入和删除的权重之和。该算法用C编程语言在个人计算机上实现。对于比较两个长度为2500个残基的蛋白质或核酸序列,随机存取存储器中64千字节的数据区域和磁盘上几百千字节的空间就足够了。当与快速序列搜索程序结合使用时,这些程序特别有价值。

相似文献

1
Pattern matching of biological sequences with limited storage.具有有限存储的生物序列模式匹配
Comput Appl Biosci. 1987 Mar;3(1):17-20. doi: 10.1093/bioinformatics/3.1.17.
2
Alignment of three biological sequences with an efficient traceback procedure.使用高效回溯程序对三个生物序列进行比对。
J Theor Biol. 1986 Aug 7;121(3):327-37. doi: 10.1016/s0022-5193(86)80112-6.
3
A search for common patterns in many sequences.
Comput Appl Biosci. 1992 Feb;8(1):57-64. doi: 10.1093/bioinformatics/8.1.57.
4
ProClust: improved clustering of protein sequences with an extended graph-based approach.ProClust:基于扩展的图形方法改进蛋白质序列聚类
Bioinformatics. 2002;18 Suppl 2:S182-91. doi: 10.1093/bioinformatics/18.suppl_2.s182.
5
Dynamic programming algorithms for biological sequence comparison.用于生物序列比较的动态规划算法。
Methods Enzymol. 1992;210:575-601. doi: 10.1016/0076-6879(92)10029-d.
6
A method for detecting distant evolutionary relationships between protein or nucleic acid sequences in the presence of deletions or insertions.一种在存在缺失或插入的情况下检测蛋白质或核酸序列之间远距离进化关系的方法。
J Mol Evol. 1978 Jun 20;11(2):143-61. doi: 10.1007/BF01733890.
7
SCARNA: fast and accurate structural alignment of RNA sequences by matching fixed-length stem fragments.SCARNA:通过匹配固定长度的茎片段实现RNA序列的快速准确结构比对。
Bioinformatics. 2006 Jul 15;22(14):1723-9. doi: 10.1093/bioinformatics/btl177. Epub 2006 May 11.
8
Aligning two sequences within a specified diagonal band.
Comput Appl Biosci. 1992 Oct;8(5):481-7. doi: 10.1093/bioinformatics/8.5.481.
9
Heuristic reusable dynamic programming: efficient updates of local sequence alignment.启发式可重用动态规划:局部序列比对的高效更新。
IEEE/ACM Trans Comput Biol Bioinform. 2009 Oct-Dec;6(4):570-82. doi: 10.1109/TCBB.2009.30.
10
CLUSTAL: a package for performing multiple sequence alignment on a microcomputer.CLUSTAL:一个用于在微型计算机上进行多序列比对的程序包。
Gene. 1988 Dec 15;73(1):237-44. doi: 10.1016/0378-1119(88)90330-7.

引用本文的文献

1
Cgaln: fast and space-efficient whole-genome alignment.CGALN:快速且节省空间的全基因组比对。
BMC Bioinformatics. 2010 Apr 30;11:224. doi: 10.1186/1471-2105-11-224.
2
Evolutionary relationships of the classes of major histocompatibility complex genes.主要组织相容性复合体基因类别的进化关系。
Immunogenetics. 1993;37(5):337-46. doi: 10.1007/BF00216798.
3
Vaccinia virus-encoded ribonucleotide reductase: sequence conservation of the gene for the small subunit and its amplification in hydroxyurea-resistant mutants.痘苗病毒编码的核糖核苷酸还原酶:小亚基基因的序列保守性及其在羟基脲抗性突变体中的扩增
J Virol. 1988 Feb;62(2):519-27. doi: 10.1128/JVI.62.2.519-527.1988.
4
Optimal sequence alignment allowing for long gaps.允许长间隙的最优序列比对。
Bull Math Biol. 1990;52(3):359-73. doi: 10.1007/BF02458577.