• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Swarm v3:迈向万亿级扩增子聚类。

Swarm v3: towards tera-scale amplicon clustering.

机构信息

UMR PHIM, CIRAD, Montpellier, France.

PHIM Plant Health Institute, Univ Montpellier, CIRAD, INRAE, Institut Agro, IRD, Montpellier, France.

出版信息

Bioinformatics. 2021 Dec 22;38(1):267-269. doi: 10.1093/bioinformatics/btab493.

DOI:10.1093/bioinformatics/btab493
PMID:34244702
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8696092/
Abstract

MOTIVATION

Previously we presented swarm, an open-source amplicon clustering programme that produces fine-scale molecular operational taxonomic units (OTUs) that are free of arbitrary global clustering thresholds. Here, we present swarm v3 to address issues of contemporary datasets that are growing towards tera-byte sizes.

RESULTS

When compared with previous swarm versions, swarm v3 has modernized C++ source code, reduced memory footprint by up to 50%, optimized CPU-usage and multithreading (more than 7 times faster with default parameters), and it has been extensively tested for its robustness and logic.

AVAILABILITY AND IMPLEMENTATION

Source code and binaries are available at https://github.com/torognes/swarm.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

此前,我们介绍了 swarm,这是一个开源的扩增子聚类程序,可生成无任意全局聚类阈值的精细分子操作分类单元 (OTUs)。在这里,我们介绍 swarm v3,以解决当前数据集不断增长到太字节大小的问题。

结果

与以前的 swarm 版本相比,swarm v3 对 C++ 源代码进行了现代化改造,内存占用减少了 50%,优化了 CPU 使用率和多线程(使用默认参数时速度提高了 7 倍以上),并且已经对其鲁棒性和逻辑进行了广泛测试。

可用性和实现

源代码和二进制文件可在 https://github.com/torognes/swarm 上获得。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
Swarm v3: towards tera-scale amplicon clustering.Swarm v3:迈向万亿级扩增子聚类。
Bioinformatics. 2021 Dec 22;38(1):267-269. doi: 10.1093/bioinformatics/btab493.
2
FROGS: Find, Rapidly, OTUs with Galaxy Solution.FROGS:使用 Galaxy 解决方案快速找到 OTUs。
Bioinformatics. 2018 Apr 15;34(8):1287-1294. doi: 10.1093/bioinformatics/btx791.
3
Swarm v2: highly-scalable and high-resolution amplicon clustering.Swarm v2:高度可扩展且高分辨率的扩增子聚类
PeerJ. 2015 Dec 10;3:e1420. doi: 10.7717/peerj.1420. eCollection 2015.
4
AmpliCI: a high-resolution model-based approach for denoising Illumina amplicon data.AmpliCI:一种基于高分辨率模型的 Illumina 扩增子数据去噪方法。
Bioinformatics. 2021 Jan 29;36(21):5151-5158. doi: 10.1093/bioinformatics/btaa648.
5
Swarm: robust and fast clustering method for amplicon-based studies.Swarm:一种基于扩增子的快速稳健聚类方法。
PeerJ. 2014 Sep 25;2:e593. doi: 10.7717/peerj.593. eCollection 2014.
6
MAPseq: highly efficient k-mer search with confidence estimates, for rRNA sequence analysis.MAPseq:用于 rRNA 序列分析的高效 k-mer 搜索方法及其置信度估计。
Bioinformatics. 2017 Dec 1;33(23):3808-3810. doi: 10.1093/bioinformatics/btx517.
7
Open-Source Sequence Clustering Methods Improve the State Of the Art.开源序列聚类方法提升了现有技术水平。
mSystems. 2016 Feb 9;1(1). doi: 10.1128/mSystems.00003-15. eCollection 2016 Jan-Feb.
8
AncestralClust: clustering of divergent nucleotide sequences by ancestral sequence reconstruction using phylogenetic trees.AncestralClust:基于系统发生树的祖先序列重建对分歧核苷酸序列进行聚类。
Bioinformatics. 2022 Jan 12;38(3):663-670. doi: 10.1093/bioinformatics/btab723.
9
Updating the 97% identity threshold for 16S ribosomal RNA OTUs.更新 16S 核糖体 RNA OTUs 的 97%同一性阈值。
Bioinformatics. 2018 Jul 15;34(14):2371-2375. doi: 10.1093/bioinformatics/bty113.
10
GMHCC: high-throughput analysis of biomolecular data using graph-based multiple hierarchical consensus clustering.GMHCC:基于图的多重层次共识聚类的生物分子数据的高通量分析。
Bioinformatics. 2022 May 26;38(11):3020-3028. doi: 10.1093/bioinformatics/btac290.

引用本文的文献

1
Varying Responses to Heat Stress and Salinization Between Benthic and Pelagic Riverine Microbial Communities.底栖和浮游河流微生物群落对热应激和盐渍化的不同响应
Environ Microbiol. 2025 Sep;27(9):e70173. doi: 10.1111/1462-2920.70173.
2
Predicting coarse-grained representations of biogeochemical cycles from metabarcoding data.从宏条形码数据预测生物地球化学循环的粗粒度表示。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i49-i57. doi: 10.1093/bioinformatics/btaf230.
3
Reverse microdialysis of sucrose stimulates soil fungal and bacterial growth at the microscale.蔗糖的反向微透析在微观尺度上刺激土壤真菌和细菌生长。
BMC Microbiol. 2025 Jul 14;25(1):436. doi: 10.1186/s12866-025-04082-5.
4
Biomonitoring 2.0 Refined: observing local change through metaphylogeography using a community-based eDNA metabarcoding monitoring network.生物监测2.0优化版:利用基于群落的环境DNA宏条形码监测网络,通过元系统发育地理学观察局部变化。
BMC Biol. 2025 Jul 1;23(1):187. doi: 10.1186/s12915-025-02284-x.
5
The impact of elevated temperature and salinity on microbial communities and food selectivity in heterotrophic nanoflagellates in the Boye River.高温和盐度对博耶河中异养型纳米鞭毛虫的微生物群落及食物选择性的影响。
ISME Commun. 2025 Mar 21;5(1):ycaf049. doi: 10.1093/ismeco/ycaf049. eCollection 2025 Jan.
6
Benthic Feeding and Diet Partitioning in Red Sea Mesopelagic Fish Resolved Through DNA Metabarcoding and ROV Footage.通过DNA宏条形码技术和遥控潜水器影像解析红海中层鱼类的底栖摄食与食性划分
Ecol Evol. 2025 Mar 6;15(3):e71091. doi: 10.1002/ece3.71091. eCollection 2025 Mar.
7
Comprehensive survey of kombucha microbial communities of diverse origins and fermentation practices.对不同来源和发酵方式的康普茶微生物群落进行全面调查。
FEMS Yeast Res. 2025 Jan 30;25. doi: 10.1093/femsyr/foaf005.
8
Enrichment of putative plant growth promoting microorganisms in biodynamic compared with organic agriculture soils.与有机农业土壤相比,生物动力农业土壤中假定的促进植物生长微生物的富集情况。
ISME Commun. 2024 Feb 5;4(1):ycae021. doi: 10.1093/ismeco/ycae021. eCollection 2024 Jan.
9
Female reproductive tract microbiota varies with MHC profile.女性生殖道微生物群与 MHC 谱有关。
Proc Biol Sci. 2024 Oct;291(2033):20241334. doi: 10.1098/rspb.2024.1334. Epub 2024 Oct 30.
10
Sediment Core DNA-Metabarcoding and Chitinous Remain Identification: Integrating Complementary Methods to Characterise Chironomidae Biodiversity in Lake Sediment Archives.沉积物岩芯DNA代谢条形码分析与几丁质残骸鉴定:整合互补方法以表征湖泊沉积物档案中的摇蚊科生物多样性
Mol Ecol Resour. 2025 Jan;25(1):e14035. doi: 10.1111/1755-0998.14035. Epub 2024 Oct 21.