Epstein Hannah E, Hernandez-Agreda Alejandra, Starko Samuel, Baum Julia K, Vega Thurber Rebecca
Department of Microbiology, Oregon State University, Corvallis, OR, United States.
California Academy of Sciences, San Francisco, CA, United States.
Front Microbiol. 2021 Nov 25;12:740932. doi: 10.3389/fmicb.2021.740932. eCollection 2021.
16S rRNA gene profiling (amplicon sequencing) is a popular technique for understanding host-associated and environmental microbial communities. Most protocols for sequencing amplicon libraries follow a standardized pipeline that can differ slightly depending on laboratory facility and user. Given that the same variable region of the 16S gene is targeted, it is generally accepted that sequencing output from differing protocols are comparable and this assumption underlies our ability to identify universal patterns in microbial dynamics through meta-analyses. However, discrepant results from a combined 16S rRNA gene dataset prepared by two labs whose protocols differed only in DNA polymerase and sequencing platform led us to scrutinize the outputs and challenge the idea of confidently combining them for standard microbiome analysis. Using technical replicates of reef-building coral samples from two species, and , we evaluated the consistency of alpha and beta diversity metrics between data resulting from these highly similar protocols. While we found minimal variation in alpha diversity between platform, significant differences were revealed with most beta diversity metrics, dependent on host species. These inconsistencies persisted following removal of low abundance taxa and when comparing across higher taxonomic levels, suggesting that bacterial community differences associated with sequencing protocol are likely to be context dependent and difficult to correct without extensive validation work. The results of this study encourage caution in the statistical comparison and interpretation of studies that combine rRNA gene sequence data from distinct protocols and point to a need for further work identifying mechanistic causes of these observed differences.
16S rRNA基因分析(扩增子测序)是一种用于了解宿主相关和环境微生物群落的常用技术。大多数扩增子文库测序方案都遵循标准化流程,该流程可能会因实验室设施和用户的不同而略有差异。鉴于16S基因的同一可变区是目标区域,人们普遍认为不同方案的测序结果具有可比性,这一假设是我们通过荟萃分析识别微生物动态普遍模式的能力的基础。然而,两个实验室制备的16S rRNA基因数据集结果存在差异,这两个实验室的方案仅在DNA聚合酶和测序平台上有所不同,这促使我们仔细审查这些结果,并对将它们可靠地合并用于标准微生物组分析的想法提出质疑。我们使用来自两种造礁珊瑚物种( 和 )的技术重复样本,评估了这些高度相似方案产生的数据之间的α多样性和β多样性指标的一致性。虽然我们发现不同平台之间的α多样性变化最小,但大多数β多样性指标显示出显著差异,这取决于宿主物种。在去除低丰度分类群后以及在更高分类水平上进行比较时,这些不一致性仍然存在,这表明与测序方案相关的细菌群落差异可能取决于具体情况,并且在没有广泛验证工作的情况下很难纠正。这项研究的结果提醒人们在对来自不同方案的rRNA基因序列数据进行统计比较和解释时要谨慎,并指出需要进一步开展工作来确定这些观察到的差异的机制原因。