• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

BLAST中改进的空位比对。

Improved gapped alignment in BLAST.

作者信息

Cameron Michael, Williams Hugh E, Cannane Adam

机构信息

School of Computer Science and Information Technology, RMIT University, GPO Box 2476V, Melbourne, Australia.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2004 Jul-Sep;1(3):116-29. doi: 10.1109/TCBB.2004.32.

DOI:10.1109/TCBB.2004.32
PMID:17048387
Abstract

Homology search is a key tool for understanding the role, structure, and biochemical function of genomic sequences. The most popular technique for rapid homology search is BLAST, which has been in widespread use within universities, research centers, and commercial enterprises since the early 1990s. In this paper, we propose a new step in the BLAST algorithm to reduce the computational cost of searching with negligible effect on accuracy. This new step-semigapped alignment-compromises between the efficiency of ungapped alignment and the accuracy of gapped alignment, allowing BLAST to accurately filter sequences with lower computational cost. In addition, we propose a heuristic-restricted insertion alignment-that avoids unlikely evolutionary paths with the aim of reducing gapped alignment cost with negligible effect on accuracy. Together, after including an optimization of the local alignment recursion, our two techniques more than double the speed of the gapped alignment stages in BLAST. We conclude that our techniques are an important improvement to the BLAST algorithm. Source code for the alignment algorithms is available for download at http://www.bsg.rmit.edu.au/iga/.

摘要

同源性搜索是理解基因组序列的作用、结构和生化功能的关键工具。快速进行同源性搜索最常用的技术是BLAST,自20世纪90年代初以来,它就在大学、研究中心和商业企业中广泛使用。在本文中,我们在BLAST算法中提出了一个新步骤,以降低搜索的计算成本,同时对准确性的影响可以忽略不计。这个新步骤——半间隙比对——在无间隙比对的效率和有间隙比对的准确性之间进行了折衷,使BLAST能够以更低的计算成本准确地筛选序列。此外,我们还提出了一种启发式方法——受限插入比对——以避免不太可能的进化路径,目的是以对准确性影响可忽略不计的方式降低有间隙比对的成本。综合起来,在对局部比对递归进行优化之后,我们的两项技术使BLAST中有间隙比对阶段的速度提高了一倍多。我们得出结论,我们的技术是对BLAST算法的一项重要改进。比对算法的源代码可从http://www.bsg.rmit.edu.au/iga/下载。

相似文献

1
Improved gapped alignment in BLAST.BLAST中改进的空位比对。
IEEE/ACM Trans Comput Biol Bioinform. 2004 Jul-Sep;1(3):116-29. doi: 10.1109/TCBB.2004.32.
2
An efficient algorithm after ungapped analysis in BLAST.一种在BLAST中进行无间隙分析后的高效算法。
DNA Seq. 2006 Aug;17(4):247-53. doi: 10.1080/10425170600805128.
3
Comparing compressed sequences for faster nucleotide BLAST searches.比较压缩序列以进行更快的核苷酸BLAST搜索。
IEEE/ACM Trans Comput Biol Bioinform. 2007 Jul-Sep;4(3):349-64. doi: 10.1109/TCBB.2007.1029.
4
PROMALS web server for accurate multiple protein sequence alignments.用于精确多蛋白序列比对的PROMALS网络服务器。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W649-52. doi: 10.1093/nar/gkm227. Epub 2007 Apr 22.
5
OXBench: a benchmark for evaluation of protein multiple sequence alignment accuracy.OXBench:一种用于评估蛋白质多序列比对准确性的基准。
BMC Bioinformatics. 2003 Oct 10;4:47. doi: 10.1186/1471-2105-4-47.
6
A deterministic finite automaton for faster protein hit detection in BLAST.一种用于在BLAST中更快检测蛋白质匹配的确定性有限自动机。
J Comput Biol. 2006 May;13(4):965-78. doi: 10.1089/cmb.2006.13.965.
7
ParAlign: a parallel sequence alignment algorithm for rapid and sensitive database searches.ParAlign:一种用于快速且灵敏的数据库搜索的并行序列比对算法。
Nucleic Acids Res. 2001 Apr 1;29(7):1647-52. doi: 10.1093/nar/29.7.1647.
8
BLAST and FASTA similarity searching for multiple sequence alignment.用于多序列比对的BLAST和FASTA相似性搜索。
Methods Mol Biol. 2014;1079:75-101. doi: 10.1007/978-1-62703-646-7_5.
9
T-Coffee: A novel method for fast and accurate multiple sequence alignment.T-Coffee:一种用于快速准确的多序列比对的新方法。
J Mol Biol. 2000 Sep 8;302(1):205-17. doi: 10.1006/jmbi.2000.4042.
10
SALSA: improved protein database searching by a new algorithm for assembly of sequence fragments into gapped alignments.SALSA:通过一种将序列片段组装成带空位比对的新算法改进蛋白质数据库搜索。
Bioinformatics. 1998;14(10):839-45. doi: 10.1093/bioinformatics/14.10.839.

引用本文的文献

1
Comparative Transcriptomic and Metabolomic Analyses of Differences in Trunk Spiral Grain in .树干螺旋纹理差异的比较转录组学和代谢组学分析。
Int J Mol Sci. 2023 Sep 28;24(19):14658. doi: 10.3390/ijms241914658.
2
Complete Genome Sequences of Two Apple Stem Grooving Viruses in Cnidium officinale in Korea.韩国药用植物蛇床子中两种苹果茎沟病毒的全基因组序列
Microbiol Resour Announc. 2023 Jan 24;12(1):e0090222. doi: 10.1128/mra.00902-22. Epub 2022 Dec 7.
3
An Eruption of LTR Retrotransposons in the Autopolyploid Genomes of (Asteraceae).
菊科植物同源多倍体基因组中LTR反转录转座子的爆发
Plants (Basel). 2022 Jan 25;11(3):315. doi: 10.3390/plants11030315.
4
Transcriptome Analysis of Jojoba () during Seed Development and Liquid Wax Ester Biosynthesis.霍霍巴种子发育及液态蜡酯生物合成过程中的转录组分析
Plants (Basel). 2020 May 4;9(5):588. doi: 10.3390/plants9050588.
5
How sequence alignment scores correspond to probability models.序列比对分数如何对应概率模型。
Bioinformatics. 2020 Jan 15;36(2):408-415. doi: 10.1093/bioinformatics/btz576.
6
Hypothalamus-pituitary-gonad axis transcriptome profiling for sex differentiation in Acipenser sinensis.鲟鱼下丘脑-垂体-性腺轴转录组分析用于性别分化。
Sci Data. 2019 Jun 13;6(1):87. doi: 10.1038/s41597-019-0099-1.
7
Identification of favorable SNP alleles and candidate genes responsible for inflorescence-related traits via GWAS in chrysanthemum.通过 GWAS 鉴定菊花花序相关性状的有利 SNP 等位基因和候选基因。
Plant Mol Biol. 2019 Mar;99(4-5):407-420. doi: 10.1007/s11103-019-00826-w. Epub 2019 Jan 30.
8
The Population Genetics of in Four Regions of China as Determined by Microsatellite Markers Obtained by Transcriptome Sequencing.利用转录组测序获得的微卫星标记确定中国四个地区的 群体遗传学。 (原文中“of”后面缺少具体内容)
Front Microbiol. 2018 Dec 3;9:2904. doi: 10.3389/fmicb.2018.02904. eCollection 2018.
9
Transcriptome sequencing of Pinus kesiya var. langbianensis and comparative analysis in the Pinus phylogeny.梁王杉转录组测序及松属系统发育中的比较分析。
BMC Genomics. 2018 Oct 3;19(1):725. doi: 10.1186/s12864-018-5127-6.
10
Mining and Development of Novel SSR Markers Using Next Generation Sequencing (NGS) Data in Plants.利用新一代测序(NGS)数据在植物中挖掘和开发新型 SSR 标记。
Molecules. 2018 Feb 13;23(2):399. doi: 10.3390/molecules23020399.