• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于查找限制酶切位点及其他模式匹配应用的有限状态机算法。

A finite state machine algorithm for finding restriction sites and other pattern matching applications.

作者信息

Smith R

机构信息

Public Health Research Institute, New York, NY 10016.

出版信息

Comput Appl Biosci. 1988 Nov;4(4):459-65. doi: 10.1093/bioinformatics/4.4.459.

DOI:10.1093/bioinformatics/4.4.459
PMID:3208180
Abstract

Existing algorithms for finding restriction endonuclease recognition sites use brute-force algorithms which run in time 0(NM) where N is the number of nucleotides in the sequence under analysis and M is the total number of nucleotides in all the different sites being searched for. This paper presents a deterministic finite state machine algorithm which runs in time 0(N). Memory use can be as high as 0(M4) but a slight modification to the basic algorithm can impose a theoretical upper bound of 0(M) at the cost of some added complexity in the execution of the state machine. The algorithm can operate with a single pass through the sequence under analysis, with no need to back up or (for non-circular sequences) store more than a single input character at a time. This type of algorithm can be adapted to many pattern-matching tasks and is simple enough to implement in hardware that it could, for example, be built into a disk controller as part of a specialized database machine.

摘要

现有的用于寻找限制性内切酶识别位点的算法使用暴力算法,其运行时间为O(NM),其中N是被分析序列中的核苷酸数量,M是正在搜索的所有不同位点中的核苷酸总数。本文提出了一种确定性有限状态机算法,其运行时间为O(N)。内存使用量可能高达O(M4),但对基本算法进行轻微修改可以将理论上限设为O(M),代价是状态机执行时会增加一些复杂度。该算法可以对被分析序列进行一次遍历操作,无需回溯(对于非环状序列),并且每次只需要存储一个输入字符。这种类型的算法可以适用于许多模式匹配任务,并且足够简单,可以在硬件中实现,例如,可以作为专用数据库机器的一部分内置到磁盘控制器中。

相似文献

1
A finite state machine algorithm for finding restriction sites and other pattern matching applications.一种用于查找限制酶切位点及其他模式匹配应用的有限状态机算法。
Comput Appl Biosci. 1988 Nov;4(4):459-65. doi: 10.1093/bioinformatics/4.4.459.
2
A generic algorithm for finding restriction sites within DNA sequences.一种用于在DNA序列中寻找限制性酶切位点的通用算法。
Comput Appl Biosci. 1991 Apr;7(2):249-56. doi: 10.1093/bioinformatics/7.2.249.
3
A Simple, Fast, Filter-Based Algorithm for Approximate Circular Pattern Matching.
IEEE Trans Nanobioscience. 2016 Mar;15(2):93-100. doi: 10.1109/TNB.2016.2542062. Epub 2016 Mar 14.
4
High performance pattern matching on heterogeneous platform.异构平台上的高性能模式匹配
J Integr Bioinform. 2014 Oct 23;11(3):253. doi: 10.2390/biecoll-jib-2014-253.
5
ICRPfinder: a fast pattern design algorithm for coding sequences and its application in finding potential restriction enzyme recognition sites.ICRPfinder:一种快速的编码序列模式设计算法及其在寻找潜在限制酶识别位点中的应用。
BMC Bioinformatics. 2009 Sep 11;10:286. doi: 10.1186/1471-2105-10-286.
6
An algorithm for finding signals of unknown length in DNA sequences.一种用于在DNA序列中寻找未知长度信号的算法。
Bioinformatics. 2001;17 Suppl 1:S207-14. doi: 10.1093/bioinformatics/17.suppl_1.s207.
7
Pattern matching in indeterminate and Arc-annotated sequences.不确定序列和带弧注释序列中的模式匹配
Recent Pat DNA Gene Seq. 2013 Aug;7(2):96-104. doi: 10.2174/1872215611307020003.
8
Detection of possible restriction sites for type II restriction enzymes in DNA sequences.检测DNA序列中II型限制性内切酶的可能限制位点。
Rom J Intern Med. 2011;49(2):121-8.
9
Finding significant matches of position weight matrices in linear time.在线性时间内找到位置权重矩阵的显著匹配。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Jan-Mar;8(1):69-79. doi: 10.1109/TCBB.2009.35.
10
A fast, sensitive pattern-matching approach for protein sequences.一种用于蛋白质序列的快速、灵敏的模式匹配方法。
Comput Appl Biosci. 1993 Apr;9(2):183-9. doi: 10.1093/bioinformatics/9.2.183.

引用本文的文献

1
PatMaN: rapid alignment of short sequences to large databases.PatMaN:短序列与大型数据库的快速比对
Bioinformatics. 2008 Jul 1;24(13):1530-1. doi: 10.1093/bioinformatics/btn223. Epub 2008 May 8.
2
Identification of a DNA sequence motif required for expression of iron-regulated genes in pseudomonads.鉴定假单胞菌中铁调节基因表达所需的DNA序列基序。
Mol Gen Genet. 1995 Feb 20;246(4):519-28. doi: 10.1007/BF00290456.