Suppr超能文献

稀有生物界的褶皱:焦磷酸测序错误可能导致多样性估计的人为膨胀。

Wrinkles in the rare biosphere: pyrosequencing errors can lead to artificial inflation of diversity estimates.

机构信息

Microbial Ecology Program, DOE Joint Genome Institute, Walnut Creek, CA 94598, USA.

出版信息

Environ Microbiol. 2010 Jan;12(1):118-23. doi: 10.1111/j.1462-2920.2009.02051.x. Epub 2009 Aug 27.

Abstract

Massively parallel pyrosequencing of the small subunit (16S) ribosomal RNA gene has revealed that the extent of rare microbial populations in several environments, the 'rare biosphere', is orders of magnitude higher than previously thought. One important caveat with this method is that sequencing error could artificially inflate diversity estimates. Although the per-base error of 16S rDNA amplicon pyrosequencing has been shown to be as good as or lower than Sanger sequencing, no direct assessments of pyrosequencing errors on diversity estimates have been reported. Using only Escherichia coli MG1655 as a reference template, we find that 16S rDNA diversity is grossly overestimated unless relatively stringent read quality filtering and low clustering thresholds are applied. In particular, the common practice of removing reads with unresolved bases and anomalous read lengths is insufficient to ensure accurate estimates of microbial diversity. Furthermore, common and reproducible homopolymer length errors can result in relatively abundant spurious phylotypes further confounding data interpretation. We suggest that stringent quality-based trimming of 16S pyrotags and clustering thresholds no greater than 97% identity should be used to avoid overestimates of the rare biosphere.

摘要

大规模平行焦磷酸测序的小亚基(16S)核糖体 RNA 基因已经表明,在几个环境中的稀有微生物种群的程度,“稀有生物圈”,是数量级高于以前的想法。这种方法的一个重要警告是测序错误可能会人为地夸大多样性估计。虽然已经表明 16S rDNA 扩增子焦磷酸测序的每个碱基错误与 Sanger 测序一样好或更低,但尚未报告对多样性估计的焦磷酸测序错误的直接评估。仅使用大肠杆菌 MG1655 作为参考模板,我们发现 16S rDNA 多样性被严重高估,除非应用相对严格的读取质量过滤和低聚类阈值。特别是,去除未解决碱基和异常读取长度的读取的常见做法不足以确保微生物多样性的准确估计。此外,常见和可重复的长同源序列错误可能导致相对丰富的虚假分类群,进一步混淆数据解释。我们建议使用严格的基于质量的修剪 16S 焦磷酸标签和聚类阈值不大于 97%的身份,以避免对稀有生物圈的高估。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验