Universidade Federal do Pampa, Campus São Gabriel, Av. Antônio Trilha, 1847, São Gabriel, RS, Brazil.
J Microbiol Methods. 2011 Jul;86(1):42-51. doi: 10.1016/j.mimet.2011.03.014. Epub 2011 Mar 30.
The analysis of amplified and sequenced 16S rRNA genes has become the most important single approach for microbial diversity studies. The new sequencing technologies allow for sequencing thousands of reads in a single run and a cost-effective option is split into a single run across many samples. However for this type of investigation the key question that needs to be answered is how many samples can be sequenced without biasing the results due to lack of sequence representativeness? In this work we demonstrated that the level of sequencing effort used for analyzing soil microbial communities biases the results and determines the most effective type of analysis for small and large datasets. Many simulations were performed with four independent pyrosequencing-generated 16S rRNA gene libraries from different environments. The analysis performed here illustrates the lack of resolution of OTU-based approaches for datasets with low sequence coverage. This analysis should be performed with at least 90% of sequence coverage. Diversity index values increase with sample size making normalization of the number of sequences in all samples crucial. An important finding of this study was the advantage of phylogenetic approaches for examining microbial communities with low sequence coverage. However, if the environments being compared were closely related, a deeper sequencing would be necessary to detect the variation in the microbial composition.
对扩增和测序的 16S rRNA 基因的分析已成为微生物多样性研究的最重要的单一方法。新的测序技术允许在单次运行中对数千个读取进行测序,并且一种具有成本效益的选择是将多个样本分配到单个运行中。然而,对于这种类型的调查,需要回答的关键问题是,在由于缺乏序列代表性而导致结果产生偏差的情况下,可以对多少个样本进行测序?在这项工作中,我们证明了用于分析土壤微生物群落的测序工作水平会产生偏差,并且确定了小数据集和大数据集的最有效分析类型。使用来自不同环境的四个独立的焦磷酸测序生成的 16S rRNA 基因文库进行了许多模拟。此处的分析说明了基于 OTU 的方法对于具有低序列覆盖率的数据集的分辨率不足。这种分析至少需要 90%的序列覆盖率。随着样本量的增加,多样性指数值增加,因此对所有样本中序列数量进行归一化至关重要。本研究的一个重要发现是,对于具有低序列覆盖率的微生物群落,系统发育方法具有优势。但是,如果正在比较的环境密切相关,则需要更深入的测序才能检测到微生物组成的变化。