• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

优化和评估宏基因组组装微生物基因组的重建。

Optimizing and evaluating the reconstruction of Metagenome-assembled microbial genomes.

机构信息

Bioinformatics and Medical Informatics, San Diego State University, San Diego, California, USA.

National Center for Genome Analysis Support, Indiana University, Bloomington, Indiana, USA.

出版信息

BMC Genomics. 2017 Nov 28;18(1):915. doi: 10.1186/s12864-017-4294-1.

DOI:10.1186/s12864-017-4294-1
PMID:29183281
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5706307/
Abstract

BACKGROUND

Microbiome/host interactions describe characteristics that affect the host's health. Shotgun metagenomics includes sequencing a random subset of the microbiome to analyze its taxonomic and metabolic potential. Reconstruction of DNA fragments into genomes from metagenomes (called metagenome-assembled genomes) assigns unknown fragments to taxa/function and facilitates discovery of novel organisms. Genome reconstruction incorporates sequence assembly and sorting of assembled sequences into bins, characteristic of a genome. However, the microbial community composition, including taxonomic and phylogenetic diversity may influence genome reconstruction. We determine the optimal reconstruction method for four microbiome projects that had variable sequencing platforms (IonTorrent and Illumina), diversity (high or low), and environment (coral reefs and kelp forests), using a set of parameters to select for optimal assembly and binning tools.

METHODS

We tested the effects of the assembly and binning processes on population genome reconstruction using 105 marine metagenomes from 4 projects. Reconstructed genomes were obtained from each project using 3 assemblers (IDBA, MetaVelvet, and SPAdes) and 2 binning tools (GroopM and MetaBat). We assessed the efficiency of assemblers using statistics that including contig continuity and contig chimerism and the effectiveness of binning tools using genome completeness and taxonomic identification.

RESULTS

We concluded that SPAdes, assembled more contigs (143,718 ± 124 contigs) of longer length (N50 = 1632 ± 108 bp), and incorporated the most sequences (sequences-assembled = 19.65%). The microbial richness and evenness were maintained across the assembly, suggesting low contig chimeras. SPAdes assembly was responsive to the biological and technological variations within the project, compared with other assemblers. Among binning tools, we conclude that MetaBat produced bins with less variation in GC content (average standard deviation: 1.49), low species richness (4.91 ± 0.66), and higher genome completeness (40.92 ± 1.75) across all projects. MetaBat extracted 115 bins from the 4 projects of which 66 bins were identified as reconstructed metagenome-assembled genomes with sequences belonging to a specific genus. We identified 13 novel genomes, some of which were 100% complete, but show low similarity to genomes within databases.

CONCLUSIONS

In conclusion, we present a set of biologically relevant parameters for evaluation to select for optimal assembly and binning tools. For the tools we tested, SPAdes assembler and MetaBat binning tools reconstructed quality metagenome-assembled genomes for the four projects. We also conclude that metagenomes from microbial communities that have high coverage of phylogenetically distinct, and low taxonomic diversity results in highest quality metagenome-assembled genomes.

摘要

背景

微生物组/宿主相互作用描述了影响宿主健康的特征。 shotgun 宏基因组学包括对微生物组的随机子集进行测序,以分析其分类和代谢潜力。从宏基因组中重建 DNA 片段成基因组(称为宏基因组组装基因组)将未知片段分配给分类/功能,并有助于发现新的生物体。基因组重建包括序列组装和将组装的序列分类到基因组特征的bins 中。然而,微生物群落组成,包括分类和系统发育多样性,可能会影响基因组重建。我们使用一组参数来选择最佳的组装和 binning 工具,针对具有不同测序平台(IonTorrent 和 Illumina)、多样性(高或低)和环境(珊瑚礁和海带林)的四个微生物组项目,确定了最佳的重建方法。

方法

我们使用来自 4 个项目的 105 个海洋宏基因组,测试了组装和 binning 过程对种群基因组重建的影响。使用 3 个组装器(IDBA、MetaVelvet 和 SPAdes)和 2 个 binning 工具(GroopM 和 MetaBat)从每个项目中获得重建的基因组。我们使用包括连续体和连续体嵌合体的统计数据评估组装器的效率,以及使用基因组完整性和分类鉴定评估 binning 工具的有效性。

结果

我们得出结论,SPAdes 组装了更多的 contigs(143718 ± 124 contigs),长度更长(N50 = 1632 ± 108 bp),并整合了更多的序列(sequences-assembled = 19.65%)。微生物丰富度和均匀度在整个组装过程中保持不变,表明嵌合体数量较少。与其他组装器相比,SPAdes 组装器对项目内的生物学和技术变化具有响应性。在 binning 工具方面,我们得出结论,MetaBat 生成的 bins 在 GC 含量(平均标准偏差:1.49)、物种丰富度(4.91 ± 0.66)和基因组完整性(40.92 ± 1.75)方面变化较小。MetaBat 从 4 个项目中提取了 115 个 bins,其中 66 个被确定为具有特定属序列的重建宏基因组组装基因组。我们鉴定了 13 个新的基因组,其中一些是 100%完整的,但与数据库中的基因组相似度较低。

结论

总之,我们提出了一组具有生物学意义的参数来评估和选择最佳的组装和 binning 工具。对于我们测试的工具,SPAdes 组装器和 MetaBat binning 工具为这四个项目重建了高质量的宏基因组组装基因组。我们还得出结论,来自具有高覆盖率的分类学上不同的和低分类多样性的微生物群落的宏基因组,产生了最高质量的宏基因组组装基因组。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e019/5706307/bab3c1a72da9/12864_2017_4294_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e019/5706307/cb2342ec37a2/12864_2017_4294_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e019/5706307/98bd968d7297/12864_2017_4294_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e019/5706307/d3712e85dda3/12864_2017_4294_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e019/5706307/7c7f8e37e3cf/12864_2017_4294_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e019/5706307/bab3c1a72da9/12864_2017_4294_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e019/5706307/cb2342ec37a2/12864_2017_4294_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e019/5706307/98bd968d7297/12864_2017_4294_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e019/5706307/d3712e85dda3/12864_2017_4294_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e019/5706307/7c7f8e37e3cf/12864_2017_4294_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e019/5706307/bab3c1a72da9/12864_2017_4294_Fig5_HTML.jpg

相似文献

1
Optimizing and evaluating the reconstruction of Metagenome-assembled microbial genomes.优化和评估宏基因组组装微生物基因组的重建。
BMC Genomics. 2017 Nov 28;18(1):915. doi: 10.1186/s12864-017-4294-1.
2
Evaluating metagenomics tools for genome binning with real metagenomic datasets and CAMI datasets.评估宏基因组工具在真实宏基因组数据集和 CAMI 数据集上的基因组 binning 效果。
BMC Bioinformatics. 2020 Jul 28;21(1):334. doi: 10.1186/s12859-020-03667-3.
3
Fragmentation and Coverage Variation in Viral Metagenome Assemblies, and Their Effect in Diversity Calculations.病毒宏基因组组装中的碎片化和覆盖度变化,及其对多样性计算的影响。
Front Bioeng Biotechnol. 2015 Sep 17;3:141. doi: 10.3389/fbioe.2015.00141. eCollection 2015.
4
Accurate binning of metagenomic contigs via automated clustering sequences using information of genomic signatures and marker genes.通过利用基因组特征和标记基因信息对序列进行自动聚类,实现宏基因组重叠群的精确分类。
Sci Rep. 2016 Apr 12;6:24175. doi: 10.1038/srep24175.
5
ICoVeR - an interactive visualization tool for verification and refinement of metagenomic bins.ICoVeR - 一种用于宏基因组分箱验证与优化的交互式可视化工具。
BMC Bioinformatics. 2017 May 2;18(1):233. doi: 10.1186/s12859-017-1653-5.
6
MetaBAT, an efficient tool for accurately reconstructing single genomes from complex microbial communities.MetaBAT是一种从复杂微生物群落中准确重建单个基因组的高效工具。
PeerJ. 2015 Aug 27;3:e1165. doi: 10.7717/peerj.1165. eCollection 2015.
7
Evaluating Assembly and Binning Strategies for Time Series Drinking Water Metagenomes.评估时间序列饮用水宏基因组的组装和分类策略。
Microbiol Spectr. 2021 Dec 22;9(3):e0143421. doi: 10.1128/Spectrum.01434-21. Epub 2021 Nov 3.
8
Assessment of urban microbiome assemblies with the help of targeted in silico gold standards.借助靶向计算金标准评估城市微生物组组装。
Biol Direct. 2018 Oct 12;13(1):22. doi: 10.1186/s13062-018-0225-6.
9
Evaluation of viral genome assembly and diversity estimation in deep metagenomes.深度宏基因组中病毒基因组组装及多样性估计的评估
BMC Genomics. 2014 Nov 18;15(1):989. doi: 10.1186/1471-2164-15-989.
10
Theoretical and Simulation-Based Investigation of the Relationship between Sequencing Effort, Microbial Community Richness, and Diversity in Binning Metagenome-Assembled Genomes.基于理论和模拟的分箱宏基因组组装基因组中测序工作量、微生物群落丰富度和多样性之间关系的研究
mSystems. 2019 Sep 17;4(5):e00384-19. doi: 10.1128/mSystems.00384-19.

引用本文的文献

1
A holistic genome dataset of bacteria and archaea of mangrove sediments.红树林沉积物中细菌和古菌的全基因组数据集。
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf081.
2
A Biosynthetic and Taxonomic Atlas of the Global Lichen Holobiont.全球地衣共生体的生物合成与分类图谱
Environ Microbiol. 2025 Jun;27(6):e70112. doi: 10.1111/1462-2920.70112.
3
Simulation of 69 microbial communities indicates sequencing depth and false positives are major drivers of bias in prokaryotic metagenome-assembled genome recovery.模拟 69 个微生物群落表明,测序深度和假阳性是影响原核微生物宏基因组组装基因组回收率偏差的主要因素。

本文引用的文献

1
Critical Assessment of Metagenome Interpretation-a benchmark of metagenomics software.宏基因组解读的批判性评估——宏基因组学软件的一项基准测试
Nat Methods. 2017 Nov;14(11):1063-1071. doi: 10.1038/nmeth.4458. Epub 2017 Oct 2.
2
Aura-biomes are present in the water layer above coral reef benthic macro-organisms.先兆生物群落存在于珊瑚礁底栖大型生物上方的水层中。
PeerJ. 2017 Aug 15;5:e3666. doi: 10.7717/peerj.3666. eCollection 2017.
3
Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea.
PLoS Comput Biol. 2024 Oct 22;20(10):e1012530. doi: 10.1371/journal.pcbi.1012530. eCollection 2024 Oct.
4
Biomimetic CO Capture Unlocked through Enzyme Mining: Discovery of a Highly Thermo- and Alkali-Stable Carbonic Anhydrase.通过酶挖掘解锁仿生 CO 捕获:发现一种高度热稳定和耐碱的碳酸酐酶。
Environ Sci Technol. 2024 Oct 8;58(40):17732-17742. doi: 10.1021/acs.est.4c04291. Epub 2024 Sep 23.
5
Solving genomic puzzles: computational methods for metagenomic binning.解决基因组难题:宏基因组 binning 的计算方法。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae372.
6
From germline genome to highly fragmented somatic genome: genome-wide DNA rearrangement during the sexual process in ciliated protists.从种系基因组到高度碎片化的体细胞基因组:纤毛虫有性生殖过程中的全基因组DNA重排
Mar Life Sci Technol. 2024 Feb 12;6(1):31-49. doi: 10.1007/s42995-023-00213-x. eCollection 2024 Feb.
7
MAGICIAN: MAG simulation for investigating criteria for bioinformatic analysis.魔术师:用于研究生物信息学分析标准的 MAG 模拟。
BMC Genomics. 2024 Jan 12;25(1):55. doi: 10.1186/s12864-023-09912-2.
8
Baseline metagenome-assembled genome (MAG) data of Sikkim hot springs from Indian Himalayan geothermal belt (IHGB) showcasing its potential CAZymes, and sulfur-nitrogen metabolic activity.印度喜马拉雅地热带(IHGB)中来自锡金温泉的宏基因组组装基因组(MAG)的基线数据,展示了其潜在的碳水化合物活性酶(CAZymes)和硫-氮代谢活性。
World J Microbiol Biotechnol. 2023 May 3;39(7):179. doi: 10.1007/s11274-023-03631-2.
9
Stingray epidermal microbiomes are species-specific with local adaptations.黄貂鱼的表皮微生物群具有物种特异性并呈现局部适应性。
Front Microbiol. 2023 Mar 2;14:1031711. doi: 10.3389/fmicb.2023.1031711. eCollection 2023.
10
Assembly of novel microbial genomes from gut metagenomes of rhesus macaque ().从恒河猴肠道宏基因组中组装新型微生物基因组()。
Gut Microbes. 2023 Jan-Dec;15(1):2188848. doi: 10.1080/19490976.2023.2188848.
细菌和古菌单扩增基因组(MISAG)及宏基因组组装基因组(MIMAG)的最低信息要求
Nat Biotechnol. 2017 Aug 8;35(8):725-731. doi: 10.1038/nbt.3893.
4
The skin microbiome of the common thresher shark (Alopias vulpinus) has low taxonomic and gene function β-diversity.常见长尾鲨(Alopias vulpinus)的皮肤微生物组具有较低的分类和基因功能β多样性。
Environ Microbiol Rep. 2017 Aug;9(4):357-373. doi: 10.1111/1758-2229.12537. Epub 2017 Jun 27.
5
Comparing and Evaluating Metagenome Assembly Tools from a Microbiologist's Perspective - Not Only Size Matters!从微生物学家的视角比较和评估宏基因组组装工具——重要的不只是大小!
PLoS One. 2017 Jan 18;12(1):e0169662. doi: 10.1371/journal.pone.0169662. eCollection 2017.
6
Improvements to PATRIC, the all-bacterial Bioinformatics Database and Analysis Resource Center.全细菌生物信息学数据库与分析资源中心PATRIC的改进。
Nucleic Acids Res. 2017 Jan 4;45(D1):D535-D542. doi: 10.1093/nar/gkw1017. Epub 2016 Nov 29.
7
Nearshore Pelagic Microbial Community Abundance Affects Recruitment Success of Giant Kelp, .近岸浮游微生物群落丰度影响巨型海带的补充成功率。
Front Microbiol. 2016 Nov 14;7:1800. doi: 10.3389/fmicb.2016.01800. eCollection 2016.
8
Reconstruction of Bacterial and Viral Genomes from Multiple Metagenomes.从多个宏基因组重建细菌和病毒基因组
Front Microbiol. 2016 Apr 12;7:469. doi: 10.3389/fmicb.2016.00469. eCollection 2016.
9
Scaling laws predict global microbial diversity.标度律预测全球微生物多样性。
Proc Natl Acad Sci U S A. 2016 May 24;113(21):5970-5. doi: 10.1073/pnas.1521291113. Epub 2016 May 2.
10
Recovering complete and draft population genomes from metagenome datasets.从宏基因组数据集中恢复完整和草图的种群基因组。
Microbiome. 2016 Mar 8;4:8. doi: 10.1186/s40168-016-0154-5.