Department of Animal Sciences, The Ohio State University, 2029 Fyffe Road, Columbus, OH 43210, USA.
J Microbiol Methods. 2011 Jan;84(1):81-7. doi: 10.1016/j.mimet.2010.10.020. Epub 2010 Oct 31.
Operational taxonomic units (OTUs) are conventionally defined at a phylogenetic distance (0.03--species, 0.05--genus, 0.10--family) based on full-length 16S rRNA gene sequences. However, partial sequences (700 bp or shorter) have been used in most studies. This discord may affect analysis of diversity and species richness because sequence divergence is not distributed evenly along the 16S rRNA gene. In this study, we compared a set each of bacterial and archaeal 16S rRNA gene sequences of nearly full length with multiple sets of different partial 16S rRNA gene sequences derived therefrom (approximately 440-700 bp), at conventional and alternative distance levels. Our objective was to identify partial sequence region(s) and distance level(s) that allow more accurate phylogenetic analysis of partial 16S rRNA genes. Our results showed that no partial sequence region could estimate OTU richness or define OTUs as reliably as nearly full-length genes. However, the V1-V4 regions can provide more accurate estimates than others. For analysis of archaea, we recommend the V1-V3 and the V4-V7 regions and clustering of species-level OTUs at 0.03 and 0.02 distances, respectively. For analysis of bacteria, the V1-V3 and the V1-V4 regions should be targeted, with species-level OTUs being clustered at 0.04 distance in both cases.
操作分类单元 (OTUs) 通常根据全长 16S rRNA 基因序列在系统发育距离 (0.03--种,0.05--属,0.10--科) 上定义。然而,在大多数研究中使用的是部分序列 (700bp 或更短)。这种差异可能会影响多样性和物种丰富度的分析,因为序列分歧并不是沿着 16S rRNA 基因均匀分布的。在这项研究中,我们比较了一组细菌和古菌的 16S rRNA 基因全长序列和多组不同的部分 16S rRNA 基因序列 (约 440-700bp),在常规和替代距离水平上。我们的目标是确定部分序列区域和距离水平,使部分 16S rRNA 基因的系统发育分析更加准确。我们的结果表明,没有任何部分序列区域可以像全长基因一样可靠地估计 OTU 丰富度或定义 OTUs。然而,V1-V4 区域可以提供比其他区域更准确的估计。对于古菌的分析,我们建议使用 V1-V3 和 V4-V7 区域,并分别以 0.03 和 0.02 的距离聚类种水平的 OTUs。对于细菌的分析,应针对 V1-V3 和 V1-V4 区域,在这两种情况下,种水平的 OTUs 应在 0.04 的距离处聚类。