Suppr超能文献

并行化 MAFFT 进行大规模多序列比对。

Parallelization of MAFFT for large-scale multiple sequence alignments.

机构信息

Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, Chiba, Japan.

Artificial Intelligence Research Center (AIRC), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan.

出版信息

Bioinformatics. 2018 Jul 15;34(14):2490-2492. doi: 10.1093/bioinformatics/bty121.

Abstract

SUMMARY

We report an update for the MAFFT multiple sequence alignment program to enable parallel calculation of large numbers of sequences. The G-INS-1 option of MAFFT was recently reported to have higher accuracy than other methods for large data, but this method has been impractical for most large-scale analyses, due to the requirement of large computational resources. We introduce a scalable variant, G-large-INS-1, which has equivalent accuracy to G-INS-1 and is applicable to 50 000 or more sequences.

AVAILABILITY AND IMPLEMENTATION

This feature is available in MAFFT versions 7.355 or later at https://mafft.cbrc.jp/alignment/software/mpi.html.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

摘要

我们报告了 MAFFT 多序列对齐程序的更新,以实现大量序列的并行计算。最近有报道称,MAFFT 的 G-INS-1 选项在处理大数据时比其他方法具有更高的准确性,但由于需要大量计算资源,该方法对于大多数大规模分析来说并不实用。我们引入了一种可扩展的变体 G-large-INS-1,它与 G-INS-1 具有相同的准确性,适用于 50000 个或更多的序列。

可用性和实现

此功能在 MAFFT 版本 7.355 或更高版本中可用,网址为 https://mafft.cbrc.jp/alignment/software/mpi.html。

补充信息

补充数据可在 Bioinformatics 在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fd82/6041967/9ee5c839250c/bty121f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验