• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

减少 PCR 扩增和测序伪影对基于 16S rRNA 的研究的影响。

Reducing the effects of PCR amplification and sequencing artifacts on 16S rRNA-based studies.

机构信息

Department of Microbiology & Immunology, University of Michigan, Ann Arbor, Michigan, United States of America.

出版信息

PLoS One. 2011;6(12):e27310. doi: 10.1371/journal.pone.0027310. Epub 2011 Dec 14.

DOI:10.1371/journal.pone.0027310
PMID:22194782
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3237409/
Abstract

The advent of next generation sequencing has coincided with a growth in interest in using these approaches to better understand the role of the structure and function of the microbial communities in human, animal, and environmental health. Yet, use of next generation sequencing to perform 16S rRNA gene sequence surveys has resulted in considerable controversy surrounding the effects of sequencing errors on downstream analyses. We analyzed 2.7×10(6) reads distributed among 90 identical mock community samples, which were collections of genomic DNA from 21 different species with known 16S rRNA gene sequences; we observed an average error rate of 0.0060. To improve this error rate, we evaluated numerous methods of identifying bad sequence reads, identifying regions within reads of poor quality, and correcting base calls and were able to reduce the overall error rate to 0.0002. Implementation of the PyroNoise algorithm provided the best combination of error rate, sequence length, and number of sequences. Perhaps more problematic than sequencing errors was the presence of chimeras generated during PCR. Because we knew the true sequences within the mock community and the chimeras they could form, we identified 8% of the raw sequence reads as chimeric. After quality filtering the raw sequences and using the Uchime chimera detection program, the overall chimera rate decreased to 1%. The chimeras that could not be detected were largely responsible for the identification of spurious operational taxonomic units (OTUs) and genus-level phylotypes. The number of spurious OTUs and phylotypes increased with sequencing effort indicating that comparison of communities should be made using an equal number of sequences. Finally, we applied our improved quality-filtering pipeline to several benchmarking studies and observed that even with our stringent data curation pipeline, biases in the data generation pipeline and batch effects were observed that could potentially confound the interpretation of microbial community data.

摘要

下一代测序的出现恰逢人们对利用这些方法来更好地了解微生物群落的结构和功能在人类、动物和环境健康中的作用产生了浓厚的兴趣。然而,使用下一代测序来进行 16S rRNA 基因序列调查导致了围绕测序错误对下游分析影响的相当大的争议。我们分析了分布在 90 个相同模拟群落样本中的 2.7×10(6)个读取,这些样本是来自 21 个具有已知 16S rRNA 基因序列的物种的基因组 DNA 的集合;我们观察到平均错误率为 0.0060。为了提高这个错误率,我们评估了许多识别不良序列读取、识别读取中质量较差区域以及纠正碱基调用的方法,并且能够将总错误率降低到 0.0002。实施 PyroNoise 算法提供了错误率、序列长度和序列数量的最佳组合。比测序错误更成问题的是在 PCR 过程中产生的嵌合体。因为我们知道模拟群落中的真实序列和它们可以形成的嵌合体,所以我们将 8%的原始序列读取识别为嵌合体。在对原始序列进行质量过滤并使用 Uchime 嵌合体检测程序后,总体嵌合体率下降到 1%。无法检测到的嵌合体主要负责识别虚假的操作分类单元 (OTU) 和属水平的分类群。随着测序工作量的增加,虚假 OTU 和分类群的数量增加,表明应该使用相同数量的序列来比较群落。最后,我们将改进的质量过滤管道应用于几个基准研究,并观察到即使使用我们严格的数据管理管道,也会观察到数据生成管道和批次效应中的偏差,这些偏差可能会混淆对微生物群落数据的解释。

相似文献

1
Reducing the effects of PCR amplification and sequencing artifacts on 16S rRNA-based studies.减少 PCR 扩增和测序伪影对基于 16S rRNA 的研究的影响。
PLoS One. 2011;6(12):e27310. doi: 10.1371/journal.pone.0027310. Epub 2011 Dec 14.
2
Impact of DNA Sequencing and Analysis Methods on 16S rRNA Gene Bacterial Community Analysis of Dairy Products.DNA 测序和分析方法对乳制品 16S rRNA 基因细菌群落分析的影响。
mSphere. 2018 Oct 17;3(5):e00410-18. doi: 10.1128/mSphere.00410-18.
3
Evaluation of PacBio sequencing for full-length bacterial 16S rRNA gene classification.用于全长细菌16S rRNA基因分类的PacBio测序评估
BMC Microbiol. 2016 Nov 14;16(1):274. doi: 10.1186/s12866-016-0891-4.
4
PCR biases distort bacterial and archaeal community structure in pyrosequencing datasets.PCR 偏倚会扭曲焦磷酸测序数据集的细菌和古菌群落结构。
PLoS One. 2012;7(8):e43093. doi: 10.1371/journal.pone.0043093. Epub 2012 Aug 15.
5
Groundtruthing next-gen sequencing for microbial ecology-biases and errors in community structure estimates from PCR amplicon pyrosequencing.下一代测序的实地检验-从 PCR 扩增子焦磷酸测序估计微生物生态学中的偏差和误差。
PLoS One. 2012;7(9):e44224. doi: 10.1371/journal.pone.0044224. Epub 2012 Sep 6.
6
Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons.Sanger 和 454 焦磷酸测序 PCR 扩增子中的嵌合 16S rRNA 序列的形成与检测。
Genome Res. 2011 Mar;21(3):494-504. doi: 10.1101/gr.112730.110. Epub 2011 Jan 6.
7
bioOTU: An Improved Method for Simultaneous Taxonomic Assignments and Operational Taxonomic Units Clustering of 16s rRNA Gene Sequences.生物OTU:一种用于16S rRNA基因序列分类分配和操作分类单元聚类的改进方法。
J Comput Biol. 2016 Apr;23(4):229-38. doi: 10.1089/cmb.2015.0214. Epub 2016 Mar 7.
8
Distribution-based clustering: using ecology to refine the operational taxonomic unit.基于分布的聚类:利用生态学细化操作分类单元。
Appl Environ Microbiol. 2013 Nov;79(21):6593-603. doi: 10.1128/AEM.00342-13. Epub 2013 Aug 23.
9
Removing noise from pyrosequenced amplicons.从焦磷酸测序扩增子中去除噪声。
BMC Bioinformatics. 2011 Jan 28;12:38. doi: 10.1186/1471-2105-12-38.
10
MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.环境宏基因组的MinION™纳米孔测序:一种合成方法。
Gigascience. 2017 Mar 1;6(3):1-10. doi: 10.1093/gigascience/gix007.

引用本文的文献

1
Seasonal blooms of in a temperate semi-enclosed bay: linking community succession to thermal and nutrient regimes.温带半封闭海湾中[某种生物]的季节性大量繁殖:将群落演替与热量和营养状况联系起来。 需注意,原文中“of”后面缺少具体内容,这里用“[某种生物]”来表示原文不完整的部分。
Front Microbiol. 2025 Aug 5;16:1650890. doi: 10.3389/fmicb.2025.1650890. eCollection 2025.
2
Rapid and reliable species-level identification from clinical samples using 16 S rRNA gene nanopore sequencing analysis.使用16S rRNA基因纳米孔测序分析从临床样本中进行快速可靠的物种水平鉴定。
Sci Rep. 2025 Aug 5;15(1):28606. doi: 10.1038/s41598-025-14071-3.
3
Diet and environment drive the convergence of gut microbiome in wild-released giant pandas and forest musk deer.

本文引用的文献

1
UCHIME improves sensitivity and speed of chimera detection.UCHIME 提高了嵌合体检测的灵敏度和速度。
Bioinformatics. 2011 Aug 15;27(16):2194-200. doi: 10.1093/bioinformatics/btr381. Epub 2011 Jun 23.
2
Assessing and improving methods used in operational taxonomic unit-based approaches for 16S rRNA gene sequence analysis.评估和改进基于操作分类单元的 16S rRNA 基因序列分析方法。
Appl Environ Microbiol. 2011 May;77(10):3219-26. doi: 10.1128/AEM.02810-10. Epub 2011 Mar 18.
3
Removing noise from pyrosequenced amplicons.从焦磷酸测序扩增子中去除噪声。
饮食和环境促使野生放归大熊猫和林麝的肠道微生物群趋同。
iScience. 2025 Jun 7;28(7):112837. doi: 10.1016/j.isci.2025.112837. eCollection 2025 Jul 18.
4
Impact of DNA Extraction and 16S rRNA Gene Amplification Strategy on Microbiota Profiling of Faecal Samples.DNA提取和16S rRNA基因扩增策略对粪便样本微生物群分析的影响
Int J Mol Sci. 2025 May 29;26(11):5226. doi: 10.3390/ijms26115226.
5
Analyzing human gut microbiome data from global populations: challenges and resources.分析来自全球人群的人类肠道微生物组数据:挑战与资源
Trends Microbiol. 2025 Jun 6. doi: 10.1016/j.tim.2025.05.008.
6
Integrating traditional omics and machine learning approaches to identify microbial biomarkers and therapeutic targets in pediatric inflammatory bowel disease.整合传统组学和机器学习方法以识别儿童炎症性肠病中的微生物生物标志物和治疗靶点。
Front Pharmacol. 2025 Apr 14;16:1545392. doi: 10.3389/fphar.2025.1545392. eCollection 2025.
7
Protein Coding Low-Copy and Regions Are Viable Fungal Metabarcoding DNA Markers Which Can Supplement ITS for Better Accuracy.蛋白质编码低拷贝区域是可行的真菌代谢条形码DNA标记,可补充ITS以提高准确性。
Ecol Evol. 2025 Apr 21;15(4):e71352. doi: 10.1002/ece3.71352. eCollection 2025 Apr.
8
DNA Metabarcoding Illuminates Seasonal Dietary Pattern and Niche Partitioning by Three Sympatric Herbivores.DNA 代谢条形码技术揭示了三种同域食草动物的季节性饮食模式和生态位划分。
Ecol Evol. 2025 Apr 21;15(4):e71321. doi: 10.1002/ece3.71321. eCollection 2025 Apr.
9
Refining microbiome diversity analysis by concatenating and integrating dual 16S rRNA amplicon reads.通过拼接和整合双16S rRNA扩增子读数来优化微生物组多样性分析。
NPJ Biofilms Microbiomes. 2025 Apr 12;11(1):57. doi: 10.1038/s41522-025-00686-x.
10
Comparison of Gut Microbiota in Two Different Maternal Exposure Models of Autism Spectrum Disorder in Mice.两种不同的小鼠自闭症谱系障碍母体暴露模型中肠道微生物群的比较
Alpha Psychiatry. 2025 Feb 28;26(1):38790. doi: 10.31083/AP38790. eCollection 2025 Feb.
BMC Bioinformatics. 2011 Jan 28;12:38. doi: 10.1186/1471-2105-12-38.
4
Chimeric 16S rRNA sequence formation and detection in Sanger and 454-pyrosequenced PCR amplicons.Sanger 和 454 焦磷酸测序 PCR 扩增子中的嵌合 16S rRNA 序列的形成与检测。
Genome Res. 2011 Mar;21(3):494-504. doi: 10.1101/gr.112730.110. Epub 2011 Jan 6.
5
Tackling the widespread and critical impact of batch effects in high-throughput data.解决高通量数据中广泛存在且极具影响力的批次效应问题。
Nat Rev Genet. 2010 Oct;11(10):733-9. doi: 10.1038/nrg2825. Epub 2010 Sep 14.
6
Rapidly denoising pyrosequencing amplicon reads by exploiting rank-abundance distributions.利用秩丰度分布快速去噪焦磷酸测序扩增子reads。
Nat Methods. 2010 Sep;7(9):668-9. doi: 10.1038/nmeth0910-668b.
7
The effects of alignment quality, distance calculation method, sequence filtering, and region on the analysis of 16S rRNA gene-based studies.基于 16S rRNA 基因的研究中,序列比对质量、距离计算方法、序列过滤和区域对分析的影响。
PLoS Comput Biol. 2010 Jul 8;6(7):e1000844. doi: 10.1371/journal.pcbi.1000844.
8
Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample.每个样本深度达到数百万条序列的 16S rRNA 多样性的全球模式。
Proc Natl Acad Sci U S A. 2011 Mar 15;108 Suppl 1(Suppl 1):4516-22. doi: 10.1073/pnas.1000080107. Epub 2010 Jun 3.
9
Replicate or lie.复制或说谎。
Environ Microbiol. 2010 Jul;12(7):1806-10. doi: 10.1111/j.1462-2920.2010.02201.x. Epub 2010 Apr 28.
10
Ironing out the wrinkles in the rare biosphere through improved OTU clustering.通过改进的 OTU 聚类来消除稀有生物群落中的褶皱。
Environ Microbiol. 2010 Jul;12(7):1889-98. doi: 10.1111/j.1462-2920.2010.02193.x. Epub 2010 Mar 11.