• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

弗洛里亚:宏基因组中快速准确的菌株单倍型分型。

Floria: fast and accurate strain haplotyping in metagenomes.

机构信息

Department of Mathematics, University of Toronto, Toronto, Ontario, M5S 2E4, Canada.

Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), 60 Biopolis Street, Singapore, 138672, Republic of Singapore.

出版信息

Bioinformatics. 2024 Jun 28;40(Suppl 1):i30-i38. doi: 10.1093/bioinformatics/btae252.

DOI:10.1093/bioinformatics/btae252
PMID:38940183
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11211831/
Abstract

SUMMARY

Shotgun metagenomics allows for direct analysis of microbial community genetics, but scalable computational methods for the recovery of bacterial strain genomes from microbiomes remains a key challenge. We introduce Floria, a novel method designed for rapid and accurate recovery of strain haplotypes from short and long-read metagenome sequencing data, based on minimum error correction (MEC) read clustering and a strain-preserving network flow model. Floria can function as a standalone haplotyping method, outputting alleles and reads that co-occur on the same strain, as well as an end-to-end read-to-assembly pipeline (Floria-PL) for strain-level assembly. Benchmarking evaluations on synthetic metagenomes show that Floria is  > 3× faster and recovers 21% more strain content than base-level assembly methods (Strainberry) while being over an order of magnitude faster when only phasing is required. Applying Floria to a set of 109 deeply sequenced nanopore metagenomes took <20 min on average per sample and identified several species that have consistent strain heterogeneity. Applying Floria's short-read haplotyping to a longitudinal gut metagenomics dataset revealed a dynamic multi-strain Anaerostipes hadrus community with frequent strain loss and emergence events over 636 days. With Floria, accurate haplotyping of metagenomic datasets takes mere minutes on standard workstations, paving the way for extensive strain-level metagenomic analyses.

AVAILABILITY AND IMPLEMENTATION

Floria is available at https://github.com/bluenote-1577/floria, and the Floria-PL pipeline is available at https://github.com/jsgounot/Floria_analysis_workflow along with code for reproducing the benchmarks.

摘要

摘要

shotgun 宏基因组学允许直接分析微生物群落遗传学,但从微生物组中恢复细菌菌株基因组的可扩展计算方法仍然是一个关键挑战。我们介绍了 Floria,这是一种基于最小错误纠正(MEC)读聚类和菌株保留网络流模型的从短读和长读宏基因组测序数据中快速准确恢复菌株单倍型的新方法。Floria 可以作为一种独立的单倍型分析方法,输出在同一菌株上共同出现的等位基因和读取,也可以作为一个端到端的从读取到组装的流水线(Floria-PL)用于菌株水平的组装。在合成宏基因组上的基准评估表明,Floria 比基线组装方法(Strainberry)快 3 倍以上,恢复了 21%的菌株内容,而在仅需要相位时速度快一个数量级以上。将 Floria 应用于 109 个深度测序的纳米孔宏基因组,平均每个样本用时不到 20 分钟,并鉴定出了一些具有一致菌株异质性的物种。将 Floria 的短读单倍型分析应用于纵向肠道宏基因组数据集,揭示了一个动态的多菌株 Anaerostipes hadrus 群落,在 636 天内频繁出现菌株丢失和出现事件。使用 Floria,标准工作站上只需几分钟即可准确进行宏基因组数据集的单倍型分析,为广泛的菌株水平宏基因组分析铺平了道路。

可用性和实现

Floria 可在 https://github.com/bluenote-1577/floria 上获得,Floria-PL 流水线可在 https://github.com/jsgounot/Floria_analysis_workflow 上获得,以及用于重现基准的代码。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1762/11211831/ed7dc28db574/btae252f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1762/11211831/a0f46779583b/btae252f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1762/11211831/ae0b3bac9ca8/btae252f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1762/11211831/5e2d3d1e2f45/btae252f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1762/11211831/ed7dc28db574/btae252f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1762/11211831/a0f46779583b/btae252f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1762/11211831/ae0b3bac9ca8/btae252f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1762/11211831/5e2d3d1e2f45/btae252f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1762/11211831/ed7dc28db574/btae252f4.jpg

相似文献

1
Floria: fast and accurate strain haplotyping in metagenomes.弗洛里亚:宏基因组中快速准确的菌株单倍型分型。
Bioinformatics. 2024 Jun 28;40(Suppl 1):i30-i38. doi: 10.1093/bioinformatics/btae252.
2
Strainberry: automated strain separation in low-complexity metagenomes using long reads.Strainberry:使用长读长进行低复杂度宏基因组中菌株的自动分离。
Nat Commun. 2021 Jul 23;12(1):4485. doi: 10.1038/s41467-021-24515-9.
3
MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.环境宏基因组的MinION™纳米孔测序:一种合成方法。
Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007.
4
Hybrid metagenomic assembly enables high-resolution analysis of resistance determinants and mobile elements in human microbiomes.混合宏基因组组装可实现人类微生物组中抗性决定因子和移动元件的高分辨率分析。
Nat Biotechnol. 2019 Aug;37(8):937-944. doi: 10.1038/s41587-019-0191-2. Epub 2019 Jul 29.
5
ViraPipe: scalable parallel pipeline for viral metagenome analysis from next generation sequencing reads.ViraPipe:用于从下一代测序读取中进行病毒宏基因组分析的可扩展并行管道。
Bioinformatics. 2018 Mar 15;34(6):928-935. doi: 10.1093/bioinformatics/btx702.
6
Advancing metagenome-assembled genome-based pathogen identification: unraveling the power of long-read assembly algorithms in Oxford Nanopore sequencing.推进宏基因组组装基因组为基础的病原体鉴定:揭示长读长组装算法在牛津纳米孔测序中的强大功能。
Microbiol Spectr. 2024 Jun 4;12(6):e0011724. doi: 10.1128/spectrum.00117-24. Epub 2024 Apr 30.
7
Benchmarking short-, long- and hybrid-read assemblers for metagenome sequencing of complex microbial communities.对用于复杂微生物群落宏基因组测序的短读长、长读长和混合读长组装器进行基准测试。
Microbiology (Reading). 2024 Jun;170(6). doi: 10.1099/mic.0.001469.
8
A comprehensive investigation of metagenome assembly by linked-read sequencing.基于链接读取测序的宏基因组组装综合研究。
Microbiome. 2020 Nov 11;8(1):156. doi: 10.1186/s40168-020-00929-3.
9
Evaluating Assembly and Binning Strategies for Time Series Drinking Water Metagenomes.评估时间序列饮用水宏基因组的组装和分类策略。
Microbiol Spectr. 2021 Dec 22;9(3):e0143421. doi: 10.1128/Spectrum.01434-21. Epub 2021 Nov 3.
10
Benchmarking genome assembly methods on metagenomic sequencing data.基于宏基因组测序数据对基因组组装方法进行基准测试。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad087.

引用本文的文献

1
Overcoming challenges in metagenomic AMR surveillance with nanopore sequencing: a case study on fluoroquinolone resistance.利用纳米孔测序克服宏基因组抗菌药物耐药性监测中的挑战:氟喹诺酮耐药性案例研究
Front Microbiol. 2025 Jul 23;16:1614301. doi: 10.3389/fmicb.2025.1614301. eCollection 2025.

本文引用的文献

1
High-quality metagenome assembly from long accurate reads with metaMDBG.使用 metaMDBG 从长而准确的读取中进行高质量的宏基因组组装。
Nat Biotechnol. 2024 Sep;42(9):1378-1383. doi: 10.1038/s41587-023-01983-6. Epub 2024 Jan 2.
2
Fast and robust metagenomic sequence comparison through sparse chaining with skani.通过使用 skani 进行稀疏链接实现快速稳健的宏基因组序列比较。
Nat Methods. 2023 Nov;20(11):1661-1665. doi: 10.1038/s41592-023-02018-3. Epub 2023 Sep 21.
3
Metabolic independence drives gut microbial colonization and resilience in health and disease.
代谢独立性驱动肠道微生物定植和在健康与疾病中的恢复力。
Genome Biol. 2023 Apr 17;24(1):78. doi: 10.1186/s13059-023-02924-x.
4
Comparison of transformations for single-cell RNA-seq data.单细胞 RNA-seq 数据转换方法比较。
Nat Methods. 2023 May;20(5):665-672. doi: 10.1038/s41592-023-01814-1. Epub 2023 Apr 10.
5
Genome-centric analysis of short and long read metagenomes reveals uncharacterized microbiome diversity in Southeast Asians.基于基因组的短读长宏基因组分析揭示了东南亚人群中未被描述的微生物组多样性。
Nat Commun. 2022 Oct 13;13(1):6044. doi: 10.1038/s41467-022-33782-z.
6
Targeted suppression of human IBD-associated gut microbiota commensals by phage consortia for treatment of intestinal inflammation.靶向抑制人类 IBD 相关肠道微生物共生体的噬菌体组合治疗肠道炎症。
Cell. 2022 Aug 4;185(16):2879-2898.e24. doi: 10.1016/j.cell.2022.07.003.
7
Oxford Nanopore R10.4 long-read sequencing enables the generation of near-finished bacterial genomes from pure cultures and metagenomes without short-read or reference polishing.牛津纳米孔 R10.4 长读测序能够从纯培养物和宏基因组中生成近乎完成的细菌基因组,而无需进行短读测序或参考序列优化。
Nat Methods. 2022 Jul;19(7):823-826. doi: 10.1038/s41592-022-01539-7. Epub 2022 Jul 4.
8
StrainXpress: strain aware metagenome assembly from short reads.StrainXpress:基于短读长的菌株感知宏基因组组装。
Nucleic Acids Res. 2022 Sep 23;50(17):e101. doi: 10.1093/nar/gkac543.
9
Metagenome assembly of high-fidelity long reads with hifiasm-meta.利用 hifiasm-meta 进行高保真长读长的宏基因组组装。
Nat Methods. 2022 Jun;19(6):671-674. doi: 10.1038/s41592-022-01478-3. Epub 2022 May 9.
10
StrainGE: a toolkit to track and characterize low-abundance strains in complex microbial communities.StrainGE:一个用于追踪和描述复杂微生物群落中低丰度菌株的工具包。
Genome Biol. 2022 Mar 7;23(1):74. doi: 10.1186/s13059-022-02630-0.