• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从复杂的群落短读宏基因组数据集进行个体基因组组装。

Individual genome assembly from complex community short-read metagenomic datasets.

机构信息

Center for Bioinformatics and Computational Genomics and School of Biology, Georgia Institute of Technology, Atlanta, GA 30332-0512, USA.

出版信息

ISME J. 2012 Apr;6(4):898-901. doi: 10.1038/ismej.2011.147. Epub 2011 Oct 27.

DOI:10.1038/ismej.2011.147
PMID:22030673
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3309356/
Abstract

Assembling individual genomes from complex community metagenomic data remains a challenging issue for environmental studies. We evaluated the quality of genome assemblies from community short read data (Illumina 100 bp pair-ended sequences) using datasets recovered from freshwater and soil microbial communities as well as in silico simulations. Our analyses revealed that the genome of a single genotype (or species) can be accurately assembled from a complex metagenome when it shows at least about 20 × coverage. At lower coverage, however, the derived assemblies contained a substantial fraction of non-target sequences (chimeras), which explains, at least in part, the higher number of hypothetical genes recovered in metagenomic relative to genomic projects. We also provide examples of how to detect intrapopulation structure in metagenomic datasets and estimate the type and frequency of errors in assembled genes and contigs from datasets of varied species complexity.

摘要

从复杂的群落宏基因组数据中组装个体基因组仍然是环境研究中的一个具有挑战性的问题。我们使用从淡水和土壤微生物群落以及计算机模拟中恢复的数据集来评估来自群落短读数据(Illumina 100bp 双端序列)的基因组组装的质量。我们的分析表明,当单个基因型(或物种)的覆盖率至少达到 20×时,可以从复杂的宏基因组中准确地组装基因组。然而,在较低的覆盖率下,所得到的组装包含了大量非目标序列(嵌合体),这至少部分解释了在宏基因组相对于基因组项目中恢复的假设基因数量较多的原因。我们还提供了如何在宏基因组数据集中检测种群内结构的示例,并估计了来自不同物种复杂度数据集的组装基因和基因簇中的错误类型和频率。

相似文献

1
Individual genome assembly from complex community short-read metagenomic datasets.从复杂的群落短读宏基因组数据集进行个体基因组组装。
ISME J. 2012 Apr;6(4):898-901. doi: 10.1038/ismej.2011.147. Epub 2011 Oct 27.
2
Fragmentation and Coverage Variation in Viral Metagenome Assemblies, and Their Effect in Diversity Calculations.病毒宏基因组组装中的碎片化和覆盖度变化,及其对多样性计算的影响。
Front Bioeng Biotechnol. 2015 Sep 17;3:141. doi: 10.3389/fbioe.2015.00141. eCollection 2015.
3
MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.环境宏基因组的MinION™纳米孔测序:一种合成方法。
Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007.
4
Benchmarking genome assembly methods on metagenomic sequencing data.基于宏基因组测序数据对基因组组装方法进行基准测试。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad087.
5
Intestinal microbiota domination under extreme selective pressures characterized by metagenomic read cloud sequencing and assembly.肠道微生物群落在具有宏基因组读段云测序和组装特征的极端选择压力下占主导地位。
BMC Bioinformatics. 2019 Dec 2;20(Suppl 16):585. doi: 10.1186/s12859-019-3073-1.
6
A comprehensive investigation of metagenome assembly by linked-read sequencing.基于链接读取测序的宏基因组组装综合研究。
Microbiome. 2020 Nov 11;8(1):156. doi: 10.1186/s40168-020-00929-3.
7
Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes.通过验证的视角看宏基因组组装:评估和提高宏基因组组装基因组质量的最新进展。
Brief Bioinform. 2019 Jul 19;20(4):1140-1150. doi: 10.1093/bib/bbx098.
8
Direct comparisons of Illumina vs. Roche 454 sequencing technologies on the same microbial community DNA sample.Illumina 与 Roche 454 测序技术在同一微生物群落 DNA 样本上的直接比较。
PLoS One. 2012;7(2):e30087. doi: 10.1371/journal.pone.0030087. Epub 2012 Feb 10.
9
Nonpareil: a redundancy-based approach to assess the level of coverage in metagenomic datasets.无双:一种基于冗余的方法,用于评估宏基因组数据集的覆盖度水平。
Bioinformatics. 2014 Mar 1;30(5):629-35. doi: 10.1093/bioinformatics/btt584. Epub 2013 Oct 11.
10
Performance Characteristics of Next-Generation Sequencing for the Detection of Antimicrobial Resistance Determinants in Escherichia coli Genomes and Metagenomes.下一代测序技术在检测大肠杆菌基因组和宏基因组中抗菌药物耐药决定因子的性能特征。
mSystems. 2022 Jun 28;7(3):e0002222. doi: 10.1128/msystems.00022-22. Epub 2022 Jun 1.

引用本文的文献

1
Genome Mining Reveals Rifamycin Biosynthesis in a Taklamakan Desert Actinomycete.基因组挖掘揭示了塔克拉玛干沙漠放线菌中的利福霉素生物合成。
Microorganisms. 2025 May 3;13(5):1068. doi: 10.3390/microorganisms13051068.
2
Decomposing a San Francisco estuary microbiome using long-read metagenomics reveals species- and strain-level dominance from picoeukaryotes to viruses.利用长读长宏基因组学分解旧金山河口微生物组,揭示了从微微型真核生物到病毒的种属和菌株水平的优势。
mSystems. 2024 Sep 17;9(9):e0024224. doi: 10.1128/msystems.00242-24. Epub 2024 Aug 19.
3
Evaluating and improving the representation of bacterial contents in long-read metagenome assemblies.评估和改进长读长基因组组装中细菌含量的表示。
Genome Biol. 2024 Apr 11;25(1):92. doi: 10.1186/s13059-024-03234-6.
4
Comparison of metagenomic and traditional methods for diagnosis of enteric infections.宏基因组学与传统方法诊断肠道感染的比较。
mBio. 2024 Apr 10;15(4):e0342223. doi: 10.1128/mbio.03422-23. Epub 2024 Mar 15.
5
Seasonal microbial dynamics in the ocean inferred from assembled and unassembled data: a view on the unknown biosphere.从组装和未组装数据推断海洋中的季节性微生物动态:对未知生物圈的一种见解。
ISME Commun. 2022 Sep 21;2(1):87. doi: 10.1038/s43705-022-00167-8.
6
A metagenomic catalog for exploring the plastizymes landscape covering taxa, genes, and proteins.一个元基因组目录,用于探索涵盖分类群、基因和蛋白质的 plastizymes 景观。
Sci Rep. 2023 Sep 25;13(1):16029. doi: 10.1038/s41598-023-43042-9.
7
Recovery of metagenome-assembled genomes from the phyllosphere of 110 rice genotypes.从110个水稻基因型的叶际中恢复宏基因组组装基因组。
Sci Data. 2022 Jun 1;9(1):254. doi: 10.1038/s41597-022-01320-7.
8
Metagenomic tracking of antibiotic resistance genes through a pre-harvest vegetable production system: an integrated lab-, microcosm- and greenhouse-scale analysis.通过预收获蔬菜生产系统对抗生素抗性基因进行宏基因组追踪:实验室、微宇宙和温室规模的综合分析。
Environ Microbiol. 2022 Aug;24(8):3705-3721. doi: 10.1111/1462-2920.16022. Epub 2022 May 18.
9
Considerations for constructing a protein sequence database for metaproteomics.构建宏蛋白质组学蛋白质序列数据库的注意事项。
Comput Struct Biotechnol J. 2022 Jan 21;20:937-952. doi: 10.1016/j.csbj.2022.01.018. eCollection 2022.
10
Genome-resolved metagenomics using environmental and clinical samples.基于环境和临床样本的基因组解析宏基因组学。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab030.

本文引用的文献

1
Metagenomic insights into the evolution, function, and complexity of the planktonic microbial community of Lake Lanier, a temperate freshwater ecosystem.对莱纳伊湖(一个温带淡水生态系统)浮游微生物群落的进化、功能和复杂性进行宏基因组学研究。
Appl Environ Microbiol. 2011 Sep;77(17):6000-11. doi: 10.1128/AEM.00107-11. Epub 2011 Jul 15.
2
Genome sequencing of environmental Escherichia coli expands understanding of the ecology and speciation of the model bacterial species.环境大肠杆菌基因组测序拓展了对模式细菌物种生态和物种形成的理解。
Proc Natl Acad Sci U S A. 2011 Apr 26;108(17):7200-5. doi: 10.1073/pnas.1015622108. Epub 2011 Apr 11.
3
Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries.分析并最小化 Illumina 测序文库中的 PCR 扩增偏倚。
Genome Biol. 2011;12(2):R18. doi: 10.1186/gb-2011-12-2-r18. Epub 2011 Feb 21.
4
A human gut microbial gene catalogue established by metagenomic sequencing.宏基因组测序建立的人类肠道微生物基因目录。
Nature. 2010 Mar 4;464(7285):59-65. doi: 10.1038/nature08821.
5
Accurate determination of microbial diversity from 454 pyrosequencing data.从454焦磷酸测序数据中准确测定微生物多样性。
Nat Methods. 2009 Sep;6(9):639-41. doi: 10.1038/nmeth.1361. Epub 2009 Aug 9.
6
Systematic artifacts in metagenomes from complex microbial communities.来自复杂微生物群落的宏基因组中的系统假象。
ISME J. 2009 Nov;3(11):1314-7. doi: 10.1038/ismej.2009.72. Epub 2009 Jul 9.
7
Comparative metagenomic analysis of a microbial community residing at a depth of 4,000 meters at station ALOHA in the North Pacific subtropical gyre.对北太平洋亚热带环流中阿洛哈站4000米深处微生物群落的比较宏基因组分析。
Appl Environ Microbiol. 2009 Aug;75(16):5345-55. doi: 10.1128/AEM.00473-09. Epub 2009 Jun 19.
8
Genomic patterns of recombination, clonal divergence and environment in marine microbial populations.海洋微生物种群中的重组、克隆分化和环境的基因组模式。
ISME J. 2008 Oct;2(10):1052-65. doi: 10.1038/ismej.2008.62. Epub 2008 Jun 26.
9
Use of simulated data sets to evaluate the fidelity of metagenomic processing methods.使用模拟数据集评估宏基因组学处理方法的保真度。
Nat Methods. 2007 Jun;4(6):495-500. doi: 10.1038/nmeth1043. Epub 2007 Apr 29.
10
Community genomics among stratified microbial assemblages in the ocean's interior.海洋内部分层微生物群落中的群落基因组学。
Science. 2006 Jan 27;311(5760):496-503. doi: 10.1126/science.1120250.