Rocha Eduardo P C, Danchin Antoine
Unité Génétique des Génomes Bactériens, Institut Pasteur, 28, rue du Dr Roux, 75724 Paris Cedex 15, France.
Nucleic Acids Res. 2003 Nov 15;31(22):6570-7. doi: 10.1093/nar/gkg859.
In Escherichia coli and Bacillus subtilis, essentiality, not expressivity, drives the distribution of genes between the two replicating strands. Although essential genes tend to be coded in the leading replicating strand, the underlying selective constraints and the evolutionary extent of these findings have still not been subject to comparative studies. Here, we extend our previous analysis to the genomes of low G + C firmicutes and gamma-proteobacteria, and in a second step to all sequenced bacterial genomes. The inference of essentiality by homology allows us to show that essential genes are much more frequent in the leading strand than other genes, even when compared with non- essential highly expressed genes. Smaller biases were found in the genomes of obligatory intracellular bacteria, for which the assignment of essentiality by homology from fast growing free-living bacteria is most problematic. Cross-comparisons used to assess potential errors in the assignment of essentiality by homology revealed that, in most cases, variations in the assignment criteria have little influence on the overall results. Essential genes tend to be more conserved in the leading strand than average genes, which is consistent with selection for this positioning and may impose a strong constraint on chromosomal rearrangements. These results indicate that essentiality plays a fundamental role in the distribution of genes in most bacterial genomes.
在大肠杆菌和枯草芽孢杆菌中,基因的必需性而非表达性驱动着基因在两条复制链之间的分布。尽管必需基因倾向于在前导复制链中编码,但这些发现背后的选择限制和进化范围仍未得到比较研究。在这里,我们将之前的分析扩展到低G + C含量的厚壁菌门和γ-变形菌门的基因组,第二步扩展到所有已测序的细菌基因组。通过同源性推断必需性使我们能够表明,即使与非必需的高表达基因相比,必需基因在前导链中的出现频率也比其他基因高得多。在专性胞内细菌的基因组中发现的偏差较小,对于这些细菌来说,从快速生长的自由生活细菌通过同源性确定必需性是最成问题的。用于评估通过同源性确定必需性时潜在误差的交叉比较表明,在大多数情况下,确定标准的变化对总体结果影响很小。必需基因在前导链中往往比平均基因更保守,这与对这种定位的选择一致,并且可能对染色体重排施加强烈限制。这些结果表明,必需性在大多数细菌基因组的基因分布中起着基本作用。