• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

宏基因组组装基因组(MAGs)在代表自然种群方面的可靠性:来自比较源自同一粪便样本的分离基因组的 MAGs 的见解。

The Reliability of Metagenome-Assembled Genomes (MAGs) in Representing Natural Populations: Insights from Comparing MAGs against Isolate Genomes Derived from the Same Fecal Sample.

机构信息

School of Civil and Environmental Engineering, Georgia Institute of Technology, Atlanta, Georgia, USA.

Department of Microbiology, University of Innsbruck, Innsbruck, Tyrol, Austria.

出版信息

Appl Environ Microbiol. 2021 Feb 26;87(6). doi: 10.1128/AEM.02593-20.

DOI:10.1128/AEM.02593-20
PMID:33452027
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8105024/
Abstract

The recovery of metagenome-assembled genomes (MAGs) from metagenomic data has recently become a common task for microbial studies. The strengths and limitations of the underlying bioinformatics algorithms are well appreciated by now based on performance tests with mock data sets of known composition. However, these mock data sets do not capture the complexity and diversity often observed within natural populations, since their construction typically relies on only a single genome of a given organism. Further, it remains unclear if MAGs can recover population-variable genes (those shared by >10% but <90% of the members of the population) as efficiently as core genes (those shared by >90% of the members). To address these issues, we compared the gene variabilities of pathogenic isolates from eight diarrheal samples, for which the isolate was the causative agent, against their corresponding MAGs recovered from the companion metagenomic data set. Our analysis revealed that MAGs with completeness estimates near 95% captured only 77% of the population core genes and 50% of the variable genes, on average. Further, about 5% of the genes of these MAGs were conservatively identified as missing in the isolate and were of different (non-) taxonomic origin, suggesting errors at the genome-binning step, even though contamination estimates based on commonly used pipelines were only 1.5%. Therefore, the quality of MAGs may often be worse than estimated, and we offer examples of how to recognize and improve such MAGs to sufficient quality by (for instance) employing only contigs longer than 1,000 bp for binning. Metagenome assembly and the recovery of metagenome-assembled genomes (MAGs) have recently become common tasks for microbiome studies across environmental and clinical settings. However, the extent to which MAGs can capture the genes of the population they represent remains speculative. Current approaches to evaluating MAG quality are limited to the recovery and copy number of universal housekeeping genes, which represent a small fraction of the total genome, leaving the majority of the genome essentially inaccessible. If MAG quality in reality is lower than these approaches would estimate, this could have dramatic consequences for all downstream analyses and interpretations. In this study, we evaluated this issue using an approach that employed comparisons of the gene contents of MAGs to the gene contents of isolate genomes derived from the same sample. Further, our samples originated from a diarrhea case-control study, and thus, our results are relevant for recovering the virulence factors of pathogens from metagenomic data sets.

摘要

宏基因组组装基因组(MAG)的恢复最近已成为微生物研究的常见任务。基于具有已知组成的模拟数据集的性能测试,现在已经很好地了解了基础生物信息学算法的优缺点。但是,这些模拟数据集无法捕获自然种群中经常观察到的复杂性和多样性,因为它们的构建通常仅依赖于给定生物体的单个基因组。此外,尚不清楚 MAG 是否可以像核心基因(> 90%的成员共享)一样有效地恢复种群可变基因(> 10%但<90%的成员共享)。为了解决这些问题,我们比较了来自八个腹泻样本的致病性分离株的基因可变性,其中分离株是致病原因,而这些分离株是从配套宏基因组数据集中恢复的相应 MAG。我们的分析表明,完整性估计值接近 95%的 MAG 平均仅捕获了 77%的种群核心基因和 50%的可变基因。此外,这些 MAG 中的约 5%的基因被保守地鉴定为在分离株中缺失,并且具有不同(非)分类学起源,这表明在基因组分箱步骤中存在错误,尽管基于常用管道的污染估计值仅为 1.5%。因此,MAG 的质量通常可能比估计的要差,并且我们提供了一些示例,说明如何通过(例如)仅对分箱使用长度大于 1000bp 的连续统来识别和改善这种 MAG 的质量。宏基因组组装和宏基因组组装基因组(MAG)的恢复最近已成为环境和临床环境中微生物组研究的常见任务。但是,MAG 可以捕获其代表的种群的基因的程度仍然是推测性的。当前评估 MAG 质量的方法仅限于普遍的管家基因的恢复和拷贝数,这些基因仅代表总基因组的一小部分,而大部分基因组实际上无法访问。如果 MAG 的质量实际上低于这些方法的估计值,这可能会对所有下游分析和解释产生巨大影响。在这项研究中,我们使用一种方法来评估这个问题,该方法使用 MAG 的基因含量与从同一样本中获得的分离基因组的基因含量进行比较。此外,我们的样本源自腹泻病例对照研究,因此,我们的结果对于从宏基因组数据集恢复病原体的毒力因子是相关的。

相似文献

1
The Reliability of Metagenome-Assembled Genomes (MAGs) in Representing Natural Populations: Insights from Comparing MAGs against Isolate Genomes Derived from the Same Fecal Sample.宏基因组组装基因组(MAGs)在代表自然种群方面的可靠性:来自比较源自同一粪便样本的分离基因组的 MAGs 的见解。
Appl Environ Microbiol. 2021 Feb 26;87(6). doi: 10.1128/AEM.02593-20.
2
Evaluating Assembly and Binning Strategies for Time Series Drinking Water Metagenomes.评估时间序列饮用水宏基因组的组装和分类策略。
Microbiol Spectr. 2021 Dec 22;9(3):e0143421. doi: 10.1128/Spectrum.01434-21. Epub 2021 Nov 3.
3
Long-read metagenomics retrieves complete single-contig bacterial genomes from canine feces.长读宏基因组从犬粪便中获得完整的单菌基因组。
BMC Genomics. 2021 May 6;22(1):330. doi: 10.1186/s12864-021-07607-0.
4
Comparison of metagenomic and traditional methods for diagnosis of enteric infections.宏基因组学与传统方法诊断肠道感染的比较。
mBio. 2024 Apr 10;15(4):e0342223. doi: 10.1128/mbio.03422-23. Epub 2024 Mar 15.
5
MAGNETO: An Automated Workflow for Genome-Resolved Metagenomics.MAGNETO:基因组解析宏基因组学的自动化工作流程。
mSystems. 2022 Aug 30;7(4):e0043222. doi: 10.1128/msystems.00432-22. Epub 2022 Jun 15.
6
Assembly of novel microbial genomes from gut metagenomes of rhesus macaque ().从恒河猴肠道宏基因组中组装新型微生物基因组()。
Gut Microbes. 2023 Jan-Dec;15(1):2188848. doi: 10.1080/19490976.2023.2188848.
7
Constructing metagenome-assembled genomes for almost all components in a real bacterial consortium for binning benchmarking.为真实细菌群落中的几乎所有组件构建宏基因组组装基因组,用于分箱基准测试。
BMC Genomics. 2022 Nov 10;23(1):746. doi: 10.1186/s12864-022-08967-x.
8
Comparing genomes recovered from time-series metagenomes using long- and short-read sequencing technologies.比较使用长读长和短读测序技术从时间序列宏基因组中恢复的基因组。
Microbiome. 2023 May 13;11(1):105. doi: 10.1186/s40168-023-01557-3.
9
Critical assessment of pan-genomic analysis of metagenome-assembled genomes.对宏基因组组装基因组的泛基因组分析的批判性评估。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac413.
10
Recovery of strain-resolved genomes from human microbiome through an integration framework of single-cell genomics and metagenomics.通过单细胞基因组学和宏基因组学的整合框架从人类微生物组中恢复菌株解析基因组。
Microbiome. 2021 Oct 12;9(1):202. doi: 10.1186/s40168-021-01152-4.

引用本文的文献

1
Plastispheres as reservoirs of antimicrobial resistance: Insights from metagenomic analyses across aquatic environments.作为抗菌素耐药性储存库的塑料球:来自对水生环境宏基因组分析的见解
PLoS One. 2025 Sep 3;20(9):e0330754. doi: 10.1371/journal.pone.0330754. eCollection 2025.
2
Microbial Profiling of Buffalo Mozzarella Whey and Ricotta Exhausted Whey: Insights into Potential Probiotic Subdominant Strains.水牛奶酪乳清和意大利乳清干酪废乳清的微生物分析:对潜在益生菌优势菌株的见解
Microorganisms. 2025 Aug 1;13(8):1804. doi: 10.3390/microorganisms13081804.
3
skDER and CiDDER: two scalable approaches for microbial genome dereplication.skDER和CiDDER:两种用于微生物基因组去重复的可扩展方法。
Microb Genom. 2025 Jul;11(7). doi: 10.1099/mgen.0.001438.
4
Genomic characterization of antimicrobial resistance and mobile genetic elements in swine gut bacteria isolated from a Canadian research farm.从加拿大一个研究农场分离出的猪肠道细菌中抗菌药物耐药性和可移动遗传元件的基因组特征分析
Anim Microbiome. 2025 Jun 18;7(1):66. doi: 10.1186/s42523-025-00432-w.
5
Isolation and characterization of mollicute symbionts from a fungus-growing ant reveals high niche overlap leading to co-exclusion.从一种培菌蚁中分离并鉴定柔膜菌共生体,结果显示高度的生态位重叠导致了共同排斥。
mBio. 2025 Jul 9;16(7):e0089325. doi: 10.1128/mbio.00893-25. Epub 2025 Jun 10.
6
Genome-wide approaches to bacterial strain typing: a history and review of recent methodological advances.细菌菌株分型的全基因组方法:历史与近期方法学进展综述
Curr Opin Infect Dis. 2025 Aug 1;38(4):329-338. doi: 10.1097/QCO.0000000000001118. Epub 2025 Jun 12.
7
Depth-dependent Metagenome-Assembled Genomes of Agricultural Soils under Managed Aquifer Recharge.在含水层人工补给管理下农业土壤的深度依赖性宏基因组组装基因组
Sci Data. 2025 May 24;12(1):858. doi: 10.1038/s41597-025-05218-y.
8
Thomasclavelia ramosa and alcohol-related hepatocellular carcinoma: a microbial culturomics study.分枝托马斯菌与酒精相关肝细胞癌:一项微生物培养组学研究
Gut Pathog. 2025 May 7;17(1):27. doi: 10.1186/s13099-025-00703-6.
9
Detecting microbial engraftment after FMT using placebo sequencing and culture enriched metagenomics to sort signals from noise.使用安慰剂测序和培养富集宏基因组学从噪声中筛选信号,以检测粪菌移植后的微生物植入情况。
Nat Commun. 2025 Apr 11;16(1):3469. doi: 10.1038/s41467-025-58673-x.
10
Leveraging strain competition to address antimicrobial resistance with microbiota therapies.利用菌株竞争通过微生物群疗法解决抗生素耐药性问题。
Gut Microbes. 2025 Dec;17(1):2488046. doi: 10.1080/19490976.2025.2488046. Epub 2025 Apr 7.

本文引用的文献

1
Biases in genome reconstruction from metagenomic data.宏基因组数据基因组重建中的偏差。
PeerJ. 2020 Oct 30;8:e10119. doi: 10.7717/peerj.10119. eCollection 2020.
2
Iterative subtractive binning of freshwater chronoseries metagenomes identifies over 400 novel species and their ecologic preferences.迭代减法分箱淡水时间序列宏基因组,鉴定出超过 400 个新物种及其生态偏好。
Environ Microbiol. 2020 Aug;22(8):3394-3412. doi: 10.1111/1462-2920.15112. Epub 2020 Jul 29.
3
Accurate and complete genomes from metagenomes.从宏基因组中获得准确和完整的基因组。
Genome Res. 2020 Mar;30(3):315-333. doi: 10.1101/gr.258640.119. Epub 2020 Mar 18.
4
An Integrated Metagenome Catalog Reveals New Insights into the Murine Gut Microbiome.整合宏基因组目录揭示了对鼠肠道微生物组的新见解。
Cell Rep. 2020 Mar 3;30(9):2909-2922.e6. doi: 10.1016/j.celrep.2020.02.036.
5
Metagenomic Signatures of Gut Infections Caused by Different Pathotypes.肠道感染的宏基因组特征由不同的病原体引起。
Appl Environ Microbiol. 2019 Nov 27;85(24). doi: 10.1128/AEM.01820-19. Print 2019 Dec 15.
6
Composite Metagenome-Assembled Genomes Reduce the Quality of Public Genome Repositories.复合宏基因组组装基因组降低了公共基因组库的质量。
mBio. 2019 Jun 4;10(3):e00725-19. doi: 10.1128/mBio.00725-19.
7
Recovering microbial genomes from metagenomes in hypersaline environments: The Good, the Bad and the Ugly.从高盐环境中的宏基因组中回收微生物基因组:好的、坏的和丑的。
Syst Appl Microbiol. 2019 Jan;42(1):30-40. doi: 10.1016/j.syapm.2018.11.001. Epub 2018 Nov 15.
8
Locals get travellers' diarrhoea too: risk factors for diarrhoeal illness and pathogenic Escherichia coli infection across an urban-rural gradient in Ecuador.当地人也会感染旅行者腹泻:厄瓜多尔城乡梯度中腹泻病和致病性大肠杆菌感染的危险因素。
Trop Med Int Health. 2019 Feb;24(2):205-219. doi: 10.1111/tmi.13183. Epub 2018 Dec 6.
9
Quantifying the changes in genetic diversity within sequence-discrete bacterial populations across a spatial and temporal riverine gradient.量化序列离散细菌种群在空间和时间河流梯度上遗传多样性的变化。
ISME J. 2019 Mar;13(3):767-779. doi: 10.1038/s41396-018-0307-6. Epub 2018 Nov 5.
10
Sizing Up the Uncultured Microbial Majority.评估未培养的微生物主体
mSystems. 2018 Sep 25;3(5). doi: 10.1128/mSystems.00185-18. eCollection 2018 Sep-Oct.