• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于估计任意两个给定DNA序列之间比对的模式匹配方法。

A pattern matching approach for the estimation of alignment between any two given DNA sequences.

作者信息

Basu K, Sriraam N, Richard R J A

机构信息

Faculty of Information Technology, Multimedia University, 63100 Cyberjaya, Malaysia.

出版信息

J Med Syst. 2007 Aug;31(4):247-53. doi: 10.1007/s10916-007-9062-3.

DOI:10.1007/s10916-007-9062-3
PMID:17685148
Abstract

For a given DNA sequence, it is well known that pair wise alignment schemes are used to determine the similarity with the DNA sequences available in the databanks. The efficiency of the alignment decides the type of amino acids and its corresponding proteins. In order to evaluate the given DNA sequence for its proteomic identity, a pattern matching approach is proposed in this paper. A block based semi-global alignment scheme is introduced to determine the similarity between the DNA sequences (known and given). The two DNA sequences are divided into blocks of equal length and alignment is performed which minimizes the computational complexity. The efficiency of the alignment scheme is evaluated using the parameter, percentage of similarity (POS). Four essential DNA version of the amino acids that emphasize the importance of proteomic functionalities are chosen as patterns and matching is performed with the known and given DNA sequences to determine the similarity between them. The ratio of amino acid counts between the two sequences is estimated and the results are compared with that of the POS value. It is found from the experimental results that higher the POS value and the pattern matching higher are the similarity between the two DNA sequences. The optimal block is also identified based on the POS value and amino acids count.

摘要

对于给定的DNA序列,众所周知,成对排列方案用于确定与数据库中现有DNA序列的相似性。排列的效率决定了氨基酸及其相应蛋白质的类型。为了评估给定DNA序列的蛋白质组特性,本文提出了一种模式匹配方法。引入了一种基于块的半全局排列方案来确定DNA序列(已知序列和给定序列)之间的相似性。将两个DNA序列分成等长的块并进行排列,以最小化计算复杂度。使用相似性百分比(POS)参数评估排列方案的效率。选择四种强调蛋白质组功能重要性的氨基酸的基本DNA版本作为模式,并与已知DNA序列和给定DNA序列进行匹配,以确定它们之间的相似性。估计两个序列之间氨基酸计数的比率,并将结果与POS值进行比较。从实验结果发现,POS值越高且模式匹配度越高,两个DNA序列之间的相似性就越高。还根据POS值和氨基酸计数确定了最佳块。

相似文献

1
A pattern matching approach for the estimation of alignment between any two given DNA sequences.一种用于估计任意两个给定DNA序列之间比对的模式匹配方法。
J Med Syst. 2007 Aug;31(4):247-53. doi: 10.1007/s10916-007-9062-3.
2
Sequence alignment by cross-correlation.通过互相关进行序列比对。
J Biomol Tech. 2005 Dec;16(4):453-8.
3
ProClust: improved clustering of protein sequences with an extended graph-based approach.ProClust:基于扩展的图形方法改进蛋白质序列聚类
Bioinformatics. 2002;18 Suppl 2:S182-91. doi: 10.1093/bioinformatics/18.suppl_2.s182.
4
Compression of Multiple DNA Sequences Using Intra-Sequence and Inter-Sequence Similarities.利用序列内和序列间相似性对多个DNA序列进行压缩
IEEE/ACM Trans Comput Biol Bioinform. 2015 Nov-Dec;12(6):1322-32. doi: 10.1109/TCBB.2015.2403370.
5
Compressed pattern matching in DNA sequences.DNA序列中的压缩模式匹配
Proc IEEE Comput Syst Bioinform Conf. 2004:62-8. doi: 10.1109/csb.2004.1332418.
6
ABS: Sequence alignment by scanning.ABS:通过扫描进行序列比对。
Annu Int Conf IEEE Eng Med Biol Soc. 2011;2011:928-31. doi: 10.1109/IEMBS.2011.6090209.
7
High similarity sequence comparison in clustering large sequence databases.在大型序列数据库聚类中的高相似性序列比较。
Proc IEEE Comput Soc Bioinform Conf. 2002;1:228-36.
8
Optimization and Performance Analysis of CAT Method for DNA Sequence Similarity Searching and Alignment.CAT 方法在 DNA 序列相似性搜索和比对中的优化与性能分析。
Genes (Basel). 2024 Mar 7;15(3):341. doi: 10.3390/genes15030341.
9
A new similarity measure among protein sequences.一种蛋白质序列间新的相似性度量方法。
Proc IEEE Comput Soc Bioinform Conf. 2003;2:347-52.
10
Learning scoring schemes for sequence alignment from partial examples.从部分示例中学习序列比对的评分方案。
IEEE/ACM Trans Comput Biol Bioinform. 2008 Oct-Dec;5(4):546-56. doi: 10.1109/TCBB.2008.57.

引用本文的文献

1
Non-coding RNA annotation of the genome of Trichoplax adhaerens.黏菌盘基网柄菌基因组的非编码RNA注释
Nucleic Acids Res. 2009 Apr;37(5):1602-15. doi: 10.1093/nar/gkn1084. Epub 2009 Jan 16.

本文引用的文献

1
A generalized global alignment algorithm.一种广义全局比对算法。
Bioinformatics. 2003 Jan 22;19(2):228-33. doi: 10.1093/bioinformatics/19.2.228.
2
[Getting the appropriate set of sequences for your research: how to use sequence retrieval system on DNA Data Bank of Japan (DDBJ)].
Tanpakushitsu Kakusan Koso. 2001 May;46(6):764-70.
3
A new approach to sequence comparison: normalized sequence alignment.一种序列比较的新方法:标准化序列比对。
Bioinformatics. 2001 Apr;17(4):327-37. doi: 10.1093/bioinformatics/17.4.327.
4
Improved tools for biological sequence comparison.用于生物序列比较的改进工具。
Proc Natl Acad Sci U S A. 1988 Apr;85(8):2444-8. doi: 10.1073/pnas.85.8.2444.
5
Basic local alignment search tool.基本局部比对搜索工具
J Mol Biol. 1990 Oct 5;215(3):403-10. doi: 10.1016/S0022-2836(05)80360-2.