• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

FAME:一种快速且内存高效的多序列比对工具,通过兼容的根链实现。

FAME: fast and memory efficient multiple sequences alignment tool through compatible chain of roots.

机构信息

Department of Computer Engineering, Shiraz Branch, Islamic Azad University, Shiraz, Iran.

Department of Computer Engineering, Sharif University of Technology, Tehran, Iran.

出版信息

Bioinformatics. 2020 Jun 1;36(12):3662-3668. doi: 10.1093/bioinformatics/btaa175.

DOI:10.1093/bioinformatics/btaa175
PMID:32170927
Abstract

MOTIVATION

Multiple sequence alignment (MSA) is important and challenging problem of computational biology. Most of the existing methods can only provide a short length multiple alignments in an acceptable time. Nevertheless, when the researchers confront the genome size in the multiple alignments, the process has required a huge processing space/time. Accordingly, using the method that can align genome size rapidly and precisely has a great effect, especially on the analysis of the very long alignments. Herein, we have proposed an efficient method, called FAME, which vertically divides sequences from the places that they have common areas; then they are arranged in consecutive order. Then these common areas are shifted and placed under each other, and the subsequences between them are aligned using any existing MSA tool.

RESULTS

The results demonstrate that the combination of FAME and the MSA methods and deploying minimizer are capable to be executed on personal computer and finely align long length sequences with much higher sum-of-pair (SP) score compared to the standalone MSA tools. As we select genomic datasets with longer length, the SP score of the combinatorial methods is gradually improved. The calculated computational complexity of methods supports the results in a way that combining FAME and the MSA tools leads to at least four times faster execution on the datasets.

AVAILABILITY AND IMPLEMENTATION

The source code and all datasets and run-parameters are accessible free on http://github.com/naznoosh/msa.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

多序列比对(MSA)是计算生物学中的一个重要且具有挑战性的问题。大多数现有的方法只能在可接受的时间内提供短长度的多重比对。然而,当研究人员在多重比对中面对基因组大小时,该过程需要巨大的处理空间/时间。因此,使用能够快速、准确地进行基因组大小比对的方法具有重要意义,尤其是在对非常长的比对进行分析时。在这里,我们提出了一种有效的方法,称为 FAME,它从具有公共区域的位置垂直划分序列;然后按连续顺序排列。然后,这些公共区域被移动并放置在彼此之下,它们之间的子序列使用任何现有的 MSA 工具进行对齐。

结果

结果表明,FAME 与 MSA 方法的组合以及使用 minimizer 能够在个人计算机上执行,并与独立的 MSA 工具相比,以更高的总和对(SP)分数精细对齐长序列。当我们选择具有更长长度的基因组数据集时,组合方法的 SP 分数逐渐提高。方法的计算复杂度计算以这样的方式支持结果,即 FAME 与 MSA 工具的组合导致在数据集上的执行速度至少快四倍。

可用性和实现

源代码以及所有数据集和运行参数都可在 http://github.com/naznoosh/msa 上免费获取。

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
FAME: fast and memory efficient multiple sequences alignment tool through compatible chain of roots.FAME:一种快速且内存高效的多序列比对工具,通过兼容的根链实现。
Bioinformatics. 2020 Jun 1;36(12):3662-3668. doi: 10.1093/bioinformatics/btaa175.
2
ViralMSA: massively scalable reference-guided multiple sequence alignment of viral genomes.病毒 MSA:大规模可扩展的基于参考的病毒基因组多重序列比对。
Bioinformatics. 2021 May 5;37(5):714-716. doi: 10.1093/bioinformatics/btaa743.
3
M2Align: parallel multiple sequence alignment with a multi-objective metaheuristic.M2Align:一种使用多目标元启发式算法的并行多序列比对方法。
Bioinformatics. 2017 Oct 1;33(19):3011-3017. doi: 10.1093/bioinformatics/btx338.
4
Protein multiple sequence alignment benchmarking through secondary structure prediction.通过二级结构预测进行蛋白质多序列比对基准测试。
Bioinformatics. 2017 May 1;33(9):1331-1337. doi: 10.1093/bioinformatics/btw840.
5
TPMA: A two pointers meta-alignment tool to ensemble different multiple nucleic acid sequence alignments.TPMA:一种双指针元比对工具,用于集成不同的多个核酸序列比对。
PLoS Comput Biol. 2024 Apr 1;20(4):e1011988. doi: 10.1371/journal.pcbi.1011988. eCollection 2024 Apr.
6
FMAlign2: a novel fast multiple nucleotide sequence alignment method for ultralong datasets.FMAlign2:一种新颖的快速多核苷酸序列比对方法,适用于超大数据集。
Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btae014.
7
A local multiple alignment method for detection of non-coding RNA sequences.一种用于检测非编码RNA序列的局部多重比对方法。
Bioinformatics. 2009 Jun 15;25(12):1498-505. doi: 10.1093/bioinformatics/btp261. Epub 2009 Apr 17.
8
UPP2: fast and accurate alignment of datasets with fragmentary sequences.UPP2:快速准确地对齐具有片段序列的数据集。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btad007.
9
A novel fast multiple nucleotide sequence alignment method based on FM-index.基于 FM-index 的新型快速多核苷酸序列比对方法。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab519.
10
Combining many multiple alignments in one improved alignment.将多个多序列比对合并为一个改进的比对。
Bioinformatics. 1999 Feb;15(2):122-30. doi: 10.1093/bioinformatics/15.2.122.

引用本文的文献

1
Genetic surveillance reveals low but sustained malaria transmission with clonal replacement in Sao Tome and Principe.基因监测显示,圣多美和普林西比存在低水平但持续的疟疾传播,且有克隆取代现象。
Commun Med (Lond). 2025 May 27;5(1):199. doi: 10.1038/s43856-025-00905-8.
2
Genetic surveillance reveals low, sustained malaria transmission with clonal replacement in Sao Tome and Principe.基因监测显示,圣多美和普林西比存在低水平且持续的疟疾传播,并伴有克隆取代现象。
medRxiv. 2024 Jul 16:2024.07.15.24309968. doi: 10.1101/2024.07.15.24309968.
3
FMAlign2: a novel fast multiple nucleotide sequence alignment method for ultralong datasets.
FMAlign2:一种新颖的快速多核苷酸序列比对方法,适用于超大数据集。
Bioinformatics. 2024 Jan 2;40(1). doi: 10.1093/bioinformatics/btae014.
4
Creating and Using Minimizer Sketches in Computational Genomics.在计算基因组学中创建和使用最小草图。
J Comput Biol. 2023 Dec;30(12):1251-1276. doi: 10.1089/cmb.2023.0094. Epub 2023 Aug 30.
5
Developments in Algorithms for Sequence Alignment: A Review.序列比对算法的发展:综述。
Biomolecules. 2022 Apr 6;12(4):546. doi: 10.3390/biom12040546.