Acinas Silvia G, Marcelino Luisa A, Klepac-Ceraj Vanja, Polz Martin F
Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA.
J Bacteriol. 2004 May;186(9):2629-35. doi: 10.1128/JB.186.9.2629-2635.2004.
The level of sequence heterogeneity among rrn operons within genomes determines the accuracy of diversity estimation by 16S rRNA-based methods. Furthermore, the occurrence of widespread horizontal gene transfer (HGT) between distantly related rrn operons casts doubt on reconstructions of phylogenetic relationships. For this study, patterns of distribution of rrn copy numbers, interoperonic divergence, and redundancy of 16S rRNA sequences were evaluated. Bacterial genomes display up to 15 operons and operon numbers up to 7 are commonly found, but approximately 40% of the organisms analyzed have either one or two operons. Among the Archaea, a single operon appears to dominate and the highest number of operons is five. About 40% of sequences among 380 operons in 76 bacterial genomes with multiple operons were identical to at least one other 16S rRNA sequence in the same genome, and in 38% of the genomes all 16S rRNAs were invariant. For Archaea, the number of identical operons was only 25%, but only five genomes with 21 operons are currently available. These considerations suggest an upper bound of roughly threefold overestimation of bacterial diversity resulting from cloning and sequencing of 16S rRNA genes from the environment; however, the inclusion of genomes with a single rrn operon may lower this correction factor to approximately 2.5. Divergence among operons appears to be small overall for both Bacteria and Archaea, with the vast majority of 16S rRNA sequences showing <1% nucleotide differences. Only five genomes with operons with a higher level of nucleotide divergence were detected, and Thermoanaerobacter tengcongensis exhibited the highest level of divergence (11.6%) noted to date. Overall, four of the five extreme cases of operon differences occurred among thermophilic bacteria, suggesting a much higher incidence of HGT in these bacteria than in other groups.
基因组内rrn操纵子之间的序列异质性水平决定了基于16S rRNA方法进行多样性估计的准确性。此外,远缘rrn操纵子之间广泛发生的水平基因转移(HGT)对系统发育关系的重建提出了质疑。在本研究中,评估了rrn拷贝数的分布模式、操纵子间差异以及16S rRNA序列的冗余性。细菌基因组最多可显示15个操纵子,通常可发现操纵子数量多达7个,但分析的生物体中约40%只有一个或两个操纵子。在古菌中,单个操纵子似乎占主导地位,操纵子的最高数量为5个。在76个具有多个操纵子的细菌基因组的380个操纵子中,约40%的序列与同一基因组中的至少一个其他16S rRNA序列相同,并且在38%的基因组中所有16S rRNA都是不变的。对于古菌,相同操纵子的数量仅为25%,但目前仅有5个具有21个操纵子的基因组。这些考虑表明,从环境中克隆和测序16S rRNA基因导致细菌多样性估计大约高估了三倍;然而,纳入具有单个rrn操纵子的基因组可能会将这个校正因子降低到约2.5。对于细菌和古菌来说,操纵子之间的差异总体上似乎很小,绝大多数16S rRNA序列显示出<1%的核苷酸差异。仅检测到5个具有较高核苷酸差异水平操纵子的基因组,嗜热栖热菌表现出迄今为止所记录的最高差异水平(11.6%)。总体而言,操纵子差异的5个极端案例中有4个发生在嗜热细菌中,这表明这些细菌中HGT的发生率比其他群体高得多。