MAFFT：一种基于快速傅里叶变换的快速多序列比对新方法。

MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

作者信息

Katoh Kazutaka, Misawa Kazuharu, Kuma Kei-ichi, Miyata Takashi

机构信息

Department of Biophysics, Graduate School of Science, Kyoto University, Kyoto 606-8502, Japan.

出版信息

Nucleic Acids Res. 2002 Jul 15;30(14):3059-66. doi: 10.1093/nar/gkf436.

DOI:10.1093/nar/gkf436

PMID:12136088

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC135756/

Abstract

A multiple sequence alignment program, MAFFT, has been developed. The CPU time is drastically reduced as compared with existing methods. MAFFT includes two novel techniques. (i) Homo logous regions are rapidly identified by the fast Fourier transform (FFT), in which an amino acid sequence is converted to a sequence composed of volume and polarity values of each amino acid residue. (ii) We propose a simplified scoring system that performs well for reducing CPU time and increasing the accuracy of alignments even for sequences having large insertions or extensions as well as distantly related sequences of similar length. Two different heuristics, the progressive method (FFT-NS-2) and the iterative refinement method (FFT-NS-i), are implemented in MAFFT. The performances of FFT-NS-2 and FFT-NS-i were compared with other methods by computer simulations and benchmark tests; the CPU time of FFT-NS-2 is drastically reduced as compared with CLUSTALW with comparable accuracy. FFT-NS-i is over 100 times faster than T-COFFEE, when the number of input sequences exceeds 60, without sacrificing the accuracy.

摘要

已经开发了一种多序列比对程序MAFFT。与现有方法相比，其CPU时间大幅减少。MAFFT包含两种新技术。（i）通过快速傅里叶变换（FFT）快速识别同源区域，其中氨基酸序列被转换为由每个氨基酸残基的体积和极性值组成的序列。（ii）我们提出了一种简化的评分系统，该系统即使对于具有大插入或延伸的序列以及长度相似的远缘相关序列，在减少CPU时间和提高比对准确性方面也表现良好。MAFFT实现了两种不同的启发式方法，渐进方法（FFT-NS-2）和迭代细化方法（FFT-NS-i）。通过计算机模拟和基准测试将FFT-NS-2和FFT-NS-i的性能与其他方法进行了比较；与CLUSTALW相比，FFT-NS-2的CPU时间大幅减少，且准确性相当。当输入序列数量超过60时，FFT-NS-i比T-COFFEE快100多倍，且不牺牲准确性。

相似文献

MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

Nucleic Acids Res. 2002 Jul 15;30(14):3059-66. doi: 10.1093/nar/gkf436.

A Modified Multiple Alignment Fast Fourier Transform with Higher Efficiency.

IEEE/ACM Trans Comput Biol Bioinform. 2017 May-Jun;14(3):634-645. doi: 10.1109/TCBB.2016.2530064. Epub 2016 Feb 15.

Improvement in the accuracy of multiple sequence alignment program MAFFT.

Genome Inform. 2005;16(1):22-33.

Mind the gaps: evidence of bias in estimates of multiple sequence alignments.

Mol Biol Evol. 2007 Nov;24(11):2433-42. doi: 10.1093/molbev/msm176. Epub 2007 Aug 20.

MAFFT version 5: improvement in accuracy of multiple sequence alignment.

Nucleic Acids Res. 2005 Jan 20;33(2):511-8. doi: 10.1093/nar/gki198. Print 2005.

MUSCLE: multiple sequence alignment with high accuracy and high throughput.

Nucleic Acids Res. 2004 Mar 19;32(5):1792-7. doi: 10.1093/nar/gkh340. Print 2004.

MAFFT: iterative refinement and additional methods.

Methods Mol Biol. 2014;1079:131-46. doi: 10.1007/978-1-62703-646-7_8.

Parallelization of the MAFFT multiple sequence alignment program.

Bioinformatics. 2010 Aug 1;26(15):1899-900. doi: 10.1093/bioinformatics/btq224. Epub 2010 Apr 28.

Assessing the efficiency of multiple sequence alignment programs.

Algorithms Mol Biol. 2014 Mar 6;9(1):4. doi: 10.1186/1748-7188-9-4.

The accuracy of several multiple sequence alignment programs for proteins.

BMC Bioinformatics. 2006 Oct 24;7:471. doi: 10.1186/1471-2105-7-471.

引用本文的文献

The complex pangenome is small and shaped by sub-lineage-specific regions of difference.

Elife. 2025 Sep 5;13:RP97870. doi: 10.7554/eLife.97870.

Microbiome dynamics associated with the infection of grey field slugs by the biocontrol nematode .

Front Microbiol. 2025 Aug 20;16:1619231. doi: 10.3389/fmicb.2025.1619231. eCollection 2025.

Characterization of mitogenome revealed a remarkable evolution in genome size and composition of protein-coding genes.

Front Plant Sci. 2025 Aug 20;16:1532782. doi: 10.3389/fpls.2025.1532782. eCollection 2025.

Carotenoids bind rhodopsins and act as photocycle-accelerating pigments in marine Bacteroidota.

Nat Microbiol. 2025 Sep 4. doi: 10.1038/s41564-025-02109-1.

Gut microbiome communities demonstrate fine-scale spatial variation in a closed, island bird population.

ISME Commun. 2025 Aug 11;5(1):ycaf138. doi: 10.1093/ismeco/ycaf138. eCollection 2025 Jan.

Chromosome-scale genome assembly and gene annotation of the hydrothermal vent annelid Alvinella pompejana yield insight into animal evolution in extreme environments.

BMC Biol. 2025 Sep 2;23(1):274. doi: 10.1186/s12915-025-02369-7.

Functional diversity of phage sponge proteins that sequester host immune signals.

bioRxiv. 2025 Aug 24:2025.08.24.671296. doi: 10.1101/2025.08.24.671296.

Comparison of iSeq and Miseq in 16S rRNA sequencing-based human gut microbiome analysis.

bioRxiv. 2025 Aug 22:2025.08.22.671784. doi: 10.1101/2025.08.22.671784.

Identification of severity related mutation hotspots in SARS-CoV-2 using a density-based clustering approach.

BioData Min. 2025 Sep 1;18(1):61. doi: 10.1186/s13040-025-00476-3.

Genomic resistance in historical clinical isolates increased in frequency and mobility after the age of antibiotics.

Microb Genom. 2025 Sep;11(9). doi: 10.1099/mgen.0.001474.

本文引用的文献

The efficient computation of position-specific match scores with the fast fourier transform.

J Comput Biol. 2002;9(1):23-33. doi: 10.1089/10665270252833172.

Evaluation of protein multiple alignments by SAM-T99 using the BAliBASE multiple alignment test set.

Bioinformatics. 2001 Aug;17(8):713-20. doi: 10.1093/bioinformatics/17.8.713.

Multiple alignment of complete sequences (MACS) in the post-genomic era.

Gene. 2001 May 30;270(1-2):17-30. doi: 10.1016/s0378-1119(01)00461-9.

The RDP-II (Ribosomal Database Project).

Nucleic Acids Res. 2001 Jan 1;29(1):173-4. doi: 10.1093/nar/29.1.173.

T-Coffee: A novel method for fast and accurate multiple sequence alignment.

J Mol Biol. 2000 Sep 8;302(1):205-17. doi: 10.1006/jmbi.2000.4042.

A comprehensive comparison of multiple sequence alignment programs.

Nucleic Acids Res. 1999 Jul 1;27(13):2682-90. doi: 10.1093/nar/27.13.2682.

Alignment of whole genomes.

Nucleic Acids Res. 1999 Jun 1;27(11):2369-76. doi: 10.1093/nar/27.11.2369.

DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment.

Bioinformatics. 1999 Mar;15(3):211-8. doi: 10.1093/bioinformatics/15.3.211.

BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs.

Bioinformatics. 1999 Jan;15(1):87-8. doi: 10.1093/bioinformatics/15.1.87.

Evolutionary motif and its biological and structural significance.

J Mol Evol. 1997;44 Suppl 1:S38-43. doi: 10.1007/pl00000056.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

MAFFT：一种基于快速傅里叶变换的快速多序列比对新方法。

MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

作者信息

Katoh Kazutaka, Misawa Kazuharu, Kuma Kei-ichi, Miyata Takashi

机构信息

Department of Biophysics, Graduate School of Science, Kyoto University, Kyoto 606-8502, Japan.

出版信息

Nucleic Acids Res. 2002 Jul 15;30(14):3059-66. doi: 10.1093/nar/gkf436.

DOI:10.1093/nar/gkf436

PMID:12136088

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC135756/

Abstract

摘要

MAFFT：一种基于快速傅里叶变换的快速多序列比对新方法。

MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

MAFFT：一种基于快速傅里叶变换的快速多序列比对新方法。

MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

作者信息

机构信息

出版信息