• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用组装图通过重叠分箱改进宏基因组分箱结果。

Improving metagenomic binning results with overlapped bins using assembly graphs.

作者信息

Mallawaarachchi Vijini G, Wickramarachchi Anuradha S, Lin Yu

机构信息

School of Computing, College of Engineering and Computer Science, Australian National University, Canberra, Australia.

出版信息

Algorithms Mol Biol. 2021 May 4;16(1):3. doi: 10.1186/s13015-021-00185-6.

DOI:10.1186/s13015-021-00185-6
PMID:33947431
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8097841/
Abstract

BACKGROUND

Metagenomic sequencing allows us to study the structure, diversity and ecology in microbial communities without the necessity of obtaining pure cultures. In many metagenomics studies, the reads obtained from metagenomics sequencing are first assembled into longer contigs and these contigs are then binned into clusters of contigs where contigs in a cluster are expected to come from the same species. As different species may share common sequences in their genomes, one assembled contig may belong to multiple species. However, existing tools for binning contigs only support non-overlapped binning, i.e., each contig is assigned to at most one bin (species).

RESULTS

In this paper, we introduce GraphBin2 which refines the binning results obtained from existing tools and, more importantly, is able to assign contigs to multiple bins. GraphBin2 uses the connectivity and coverage information from assembly graphs to adjust existing binning results on contigs and to infer contigs shared by multiple species. Experimental results on both simulated and real datasets demonstrate that GraphBin2 not only improves binning results of existing tools but also supports to assign contigs to multiple bins.

CONCLUSION

GraphBin2 incorporates the coverage information into the assembly graph to refine the binning results obtained from existing binning tools. GraphBin2 also enables the detection of contigs that may belong to multiple species. We show that GraphBin2 outperforms its predecessor GraphBin on both simulated and real datasets. GraphBin2 is freely available at https://github.com/Vini2/GraphBin2 .

摘要

背景

宏基因组测序使我们能够研究微生物群落的结构、多样性和生态,而无需获得纯培养物。在许多宏基因组学研究中,从宏基因组测序获得的读段首先被组装成更长的重叠群,然后这些重叠群被分类到重叠群簇中,其中一个簇中的重叠群预计来自同一物种。由于不同物种在其基因组中可能共享共同序列,一个组装的重叠群可能属于多个物种。然而,现有的重叠群分类工具仅支持非重叠分类,即每个重叠群最多被分配到一个分类(物种)。

结果

在本文中,我们介绍了GraphBin2,它改进了从现有工具获得的分类结果,更重要的是,能够将重叠群分配到多个分类中。GraphBin2利用组装图中的连通性和覆盖信息来调整现有的重叠群分类结果,并推断多个物种共享的重叠群。在模拟和真实数据集上的实验结果表明,GraphBin2不仅改进了现有工具的分类结果,还支持将重叠群分配到多个分类中。

结论

GraphBin2将覆盖信息纳入组装图,以改进从现有分类工具获得的分类结果。GraphBin2还能够检测可能属于多个物种的重叠群。我们表明,GraphBin2在模拟和真实数据集上均优于其前身GraphBin。GraphBin2可在https://github.com/Vini2/GraphBin2上免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/00f1c3d9df83/13015_2021_185_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/46f6f4a8ec89/13015_2021_185_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/ba620faf6ed2/13015_2021_185_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/97f7532b88b8/13015_2021_185_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/4245d6cb1a19/13015_2021_185_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/8b0657b9183c/13015_2021_185_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/acc5da7a95c2/13015_2021_185_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/04191890da71/13015_2021_185_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/0f5d451edb0e/13015_2021_185_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/d60ce2d4b421/13015_2021_185_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/9224ac2e89ae/13015_2021_185_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/a42603f687d1/13015_2021_185_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/00f1c3d9df83/13015_2021_185_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/46f6f4a8ec89/13015_2021_185_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/ba620faf6ed2/13015_2021_185_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/97f7532b88b8/13015_2021_185_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/4245d6cb1a19/13015_2021_185_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/8b0657b9183c/13015_2021_185_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/acc5da7a95c2/13015_2021_185_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/04191890da71/13015_2021_185_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/0f5d451edb0e/13015_2021_185_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/d60ce2d4b421/13015_2021_185_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/9224ac2e89ae/13015_2021_185_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/a42603f687d1/13015_2021_185_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/822d/8097841/00f1c3d9df83/13015_2021_185_Fig12_HTML.jpg

相似文献

1
Improving metagenomic binning results with overlapped bins using assembly graphs.利用组装图通过重叠分箱改进宏基因组分箱结果。
Algorithms Mol Biol. 2021 May 4;16(1):3. doi: 10.1186/s13015-021-00185-6.
2
GraphBin: refined binning of metagenomic contigs using assembly graphs.GraphBin:使用组装图对宏基因组序列进行精细化分箱。
Bioinformatics. 2020 Jun 1;36(11):3307-3313. doi: 10.1093/bioinformatics/btaa180.
3
Accurate Binning of Metagenomic Contigs Using Composition, Coverage, and Assembly Graphs.基于组成、覆盖度和组装图对宏基因组序列进行精确分箱。
J Comput Biol. 2022 Dec;29(12):1357-1376. doi: 10.1089/cmb.2022.0262. Epub 2022 Nov 11.
4
METAMVGL: a multi-view graph-based metagenomic contig binning algorithm by integrating assembly and paired-end graphs.METAMVGL:一种基于多视图图的宏基因组序列拼接 bin 算法,通过整合组装图和配对末端图。
BMC Bioinformatics. 2021 Jul 22;22(Suppl 10):378. doi: 10.1186/s12859-021-04284-4.
5
Improving contig binning of metagenomic data using [Formula: see text] oligonucleotide frequency dissimilarity.使用[公式:见正文]寡核苷酸频率差异改进宏基因组数据的重叠群分箱
BMC Bioinformatics. 2017 Sep 20;18(1):425. doi: 10.1186/s12859-017-1835-1.
6
Binning long reads in metagenomics datasets using composition and coverage information.利用组成和覆盖信息对宏基因组学数据集中的长读段进行分箱。
Algorithms Mol Biol. 2022 Jul 11;17(1):14. doi: 10.1186/s13015-022-00221-z.
7
Binnacle: Using Scaffolds to Improve the Contiguity and Quality of Metagenomic Bins.罗盘箱:利用支架提高宏基因组分箱的连续性和质量
Front Microbiol. 2021 Feb 24;12:638561. doi: 10.3389/fmicb.2021.638561. eCollection 2021.
8
HiFine: integrating Hi-C-based and shotgun-based methods to refine binning of metagenomic contigs.HiFine:整合基于 Hi-C 和 shotgun 的方法来优化宏基因组 contigs 的 bin 划分。
Bioinformatics. 2022 May 26;38(11):2973-2979. doi: 10.1093/bioinformatics/btac295.
9
CoMet: a workflow using contig coverage and composition for binning a metagenomic sample with high precision.CoMet:一种使用 contig 覆盖度和组成进行宏基因组样本高精度分箱的工作流程。
BMC Bioinformatics. 2017 Dec 28;18(Suppl 16):571. doi: 10.1186/s12859-017-1967-3.
10
MetaCluster-TA: taxonomic annotation for metagenomic data based on assembly-assisted binning.MetaCluster-TA:基于组装辅助分箱的宏基因组数据分类注释。
BMC Genomics. 2014;15 Suppl 1(Suppl 1):S12. doi: 10.1186/1471-2164-15-S1-S12. Epub 2014 Jan 24.

引用本文的文献

1
Landscape of mobile genetic elements and their functional cargo across the gastrointestinal tract microbiomes in ruminants.反刍动物胃肠道微生物群中可移动遗传元件及其功能性负载的概况。
Microbiome. 2025 Jul 12;13(1):162. doi: 10.1186/s40168-025-02139-1.
2
Impact of microbiological molecular methodologies on adaptive sampling using nanopore sequencing in metagenomic studies.微生物分子方法对宏基因组研究中使用纳米孔测序进行适应性采样的影响。
Environ Microbiome. 2025 May 5;20(1):47. doi: 10.1186/s40793-025-00704-7.
3
Solving genomic puzzles: computational methods for metagenomic binning.

本文引用的文献

1
metaFlye: scalable long-read metagenome assembly using repeat graphs.metaFlye:使用重复图进行可扩展的长读长宏基因组组装。
Nat Methods. 2020 Nov;17(11):1103-1110. doi: 10.1038/s41592-020-00971-x. Epub 2020 Oct 5.
2
MetaBCC-LR: metagenomics binning by coverage and composition for long reads.MetaBCC-LR:基于覆盖度和组成的长读长宏基因组 bin 划分。
Bioinformatics. 2020 Jul 1;36(Suppl_1):i3-i11. doi: 10.1093/bioinformatics/btaa441.
3
GraphBin: refined binning of metagenomic contigs using assembly graphs.GraphBin:使用组装图对宏基因组序列进行精细化分箱。
解决基因组难题:宏基因组 binning 的计算方法。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae372.
4
PlasmidEC and gplas2: an optimized short-read approach to predict and reconstruct antibiotic resistance plasmids in .质粒EC和gplas2:一种优化的短读长方法,用于预测和重建……中的抗生素抗性质粒
Microb Genom. 2024 Feb;10(2). doi: 10.1099/mgen.0.001193.
5
Phables: from fragmented assemblies to high-quality bacteriophage genomes.噬菌体:从碎片化组装到高质量噬菌体基因组。
Bioinformatics. 2023 Oct 3;39(10). doi: 10.1093/bioinformatics/btad586.
6
Host interactions of novel species belonging to multiple families infecting bacterial host, WH2.感染细菌宿主 WH2 的多种科属新型物种的宿主相互作用。
Microb Genom. 2023 Sep;9(9). doi: 10.1099/mgen.0.001100.
7
Phables: from fragmented assemblies to high-quality bacteriophage genomes.噬菌体组装拼接软件(Phables):从片段化组装到高质量噬菌体基因组
bioRxiv. 2023 Sep 11:2023.04.04.535632. doi: 10.1101/2023.04.04.535632.
8
Unitig level assembly graph based metagenome-assembled genome refiner (UGMAGrefiner): A tool to increase completeness and resolution of metagenome-assembled genomes.基于单条重叠群水平组装图的宏基因组组装基因组优化器(UGMAGrefiner):一种提高宏基因组组装基因组完整性和分辨率的工具。
Comput Struct Biotechnol J. 2023 Mar 21;21:2394-2404. doi: 10.1016/j.csbj.2023.03.030. eCollection 2023.
9
Massively parallel single-cell genomics of microbiomes in rice paddies.稻田微生物群落的大规模平行单细胞基因组学
Front Microbiol. 2022 Nov 3;13:1024640. doi: 10.3389/fmicb.2022.1024640. eCollection 2022.
10
Constructing metagenome-assembled genomes for almost all components in a real bacterial consortium for binning benchmarking.为真实细菌群落中的几乎所有组件构建宏基因组组装基因组,用于分箱基准测试。
BMC Genomics. 2022 Nov 10;23(1):746. doi: 10.1186/s12864-022-08967-x.
Bioinformatics. 2020 Jun 1;36(11):3307-3313. doi: 10.1093/bioinformatics/btaa180.
4
MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies.MetaBAT 2:一种用于从宏基因组组装中进行稳健且高效的基因组重建的自适应分箱算法。
PeerJ. 2019 Jul 26;7:e7359. doi: 10.7717/peerj.7359. eCollection 2019.
5
SolidBin: improving metagenome binning with semi-supervised normalized cut.SolidBin:利用半监督归一化割提高宏基因组 bin 划分。
Bioinformatics. 2019 Nov 1;35(21):4229-4238. doi: 10.1093/bioinformatics/btz253.
6
Assembly of long, error-prone reads using repeat graphs.使用重复图组装长的、易错的读取。
Nat Biotechnol. 2019 May;37(5):540-546. doi: 10.1038/s41587-019-0072-8. Epub 2019 Apr 1.
7
Hidden in plain sight-highly abundant and diverse planktonic freshwater Chloroflexi.隐藏在众目睽睽之下——高度丰富多样的浮游淡水绿屈挠菌。
Microbiome. 2018 Oct 2;6(1):176. doi: 10.1186/s40168-018-0563-8.
8
Simulating Illumina metagenomic data with InSilicoSeq.用 InSilicoSeq 模拟 Illumina 宏基因组数据。
Bioinformatics. 2019 Feb 1;35(3):521-522. doi: 10.1093/bioinformatics/bty630.
9
Metagenomic binning through low-density hashing.基于低密度哈希的宏基因组 bin 划分。
Bioinformatics. 2019 Jan 15;35(2):219-226. doi: 10.1093/bioinformatics/bty611.
10
BMC3C: binning metagenomic contigs using codon usage, sequence composition and read coverage.BMC3C:基于密码子使用、序列组成和读段覆盖度对宏基因组 contigs 进行分箱。
Bioinformatics. 2018 Dec 15;34(24):4172-4179. doi: 10.1093/bioinformatics/bty519.