Suppr超能文献

Swarm v3:迈向万亿级扩增子聚类。

Swarm v3: towards tera-scale amplicon clustering.

机构信息

UMR PHIM, CIRAD, Montpellier, France.

PHIM Plant Health Institute, Univ Montpellier, CIRAD, INRAE, Institut Agro, IRD, Montpellier, France.

出版信息

Bioinformatics. 2021 Dec 22;38(1):267-269. doi: 10.1093/bioinformatics/btab493.

Abstract

MOTIVATION

Previously we presented swarm, an open-source amplicon clustering programme that produces fine-scale molecular operational taxonomic units (OTUs) that are free of arbitrary global clustering thresholds. Here, we present swarm v3 to address issues of contemporary datasets that are growing towards tera-byte sizes.

RESULTS

When compared with previous swarm versions, swarm v3 has modernized C++ source code, reduced memory footprint by up to 50%, optimized CPU-usage and multithreading (more than 7 times faster with default parameters), and it has been extensively tested for its robustness and logic.

AVAILABILITY AND IMPLEMENTATION

Source code and binaries are available at https://github.com/torognes/swarm.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

此前,我们介绍了 swarm,这是一个开源的扩增子聚类程序,可生成无任意全局聚类阈值的精细分子操作分类单元 (OTUs)。在这里,我们介绍 swarm v3,以解决当前数据集不断增长到太字节大小的问题。

结果

与以前的 swarm 版本相比,swarm v3 对 C++ 源代码进行了现代化改造,内存占用减少了 50%,优化了 CPU 使用率和多线程(使用默认参数时速度提高了 7 倍以上),并且已经对其鲁棒性和逻辑进行了广泛测试。

可用性和实现

源代码和二进制文件可在 https://github.com/torognes/swarm 上获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
Swarm v3: towards tera-scale amplicon clustering.Swarm v3:迈向万亿级扩增子聚类。
Bioinformatics. 2021 Dec 22;38(1):267-269. doi: 10.1093/bioinformatics/btab493.
2
FROGS: Find, Rapidly, OTUs with Galaxy Solution.FROGS:使用 Galaxy 解决方案快速找到 OTUs。
Bioinformatics. 2018 Apr 15;34(8):1287-1294. doi: 10.1093/bioinformatics/btx791.
7
Open-Source Sequence Clustering Methods Improve the State Of the Art.开源序列聚类方法提升了现有技术水平。
mSystems. 2016 Feb 9;1(1). doi: 10.1128/mSystems.00003-15. eCollection 2016 Jan-Feb.
9
Updating the 97% identity threshold for 16S ribosomal RNA OTUs.更新 16S 核糖体 RNA OTUs 的 97%同一性阈值。
Bioinformatics. 2018 Jul 15;34(14):2371-2375. doi: 10.1093/bioinformatics/bty113.

引用本文的文献

9
Female reproductive tract microbiota varies with MHC profile.女性生殖道微生物群与 MHC 谱有关。
Proc Biol Sci. 2024 Oct;291(2033):20241334. doi: 10.1098/rspb.2024.1334. Epub 2024 Oct 30.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验