National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bellary Road, Bangalore 560065, India.
Genome Biol Evol. 2022 Jul 2;14(7). doi: 10.1093/gbe/evac102.
Bacterial genome organization is primarily driven by chromosomal replication from a single origin of replication. However, chromosomal rearrangements, which can disrupt such organization, are inevitable in nature. Long DNA repeats are major players mediating rearrangements, large and small, via homologous recombination. Since changes to genome organization affect bacterial fitness-and more so in fast-growing than slow-growing bacteria-and are under selection, it is reasonable to expect that genomic positioning of long DNA repeats is also under selection. To test this, we identified identical DNA repeats of at least 100 base pairs across ∼6,000 bacterial genomes and compared their distribution in fast- and slow-growing bacteria. We found that long identical DNA repeats are distributed in a non-random manner across bacterial genomes. Their distribution differs in the overall number, orientation, and proximity to the origin of replication, between fast- and slow-growing bacteria. We show that their positioning-which might arise from a combination of the processes that produce repeats and selection on rearrangements that recombination between repeat elements might cause-permits less disruption to the replication-dependent genome organization of bacteria compared with random suggesting it as a major constraint to positioning of long DNA repeats.
细菌基因组的组织主要是由单个复制起点的染色体复制所驱动的。然而,染色体重排,这种可以破坏这种组织的结构,在自然界中是不可避免的。长 DNA 重复序列是通过同源重组介导大、小染色体重排的主要参与者。由于基因组组织的改变会影响细菌的适应性——在快速生长的细菌中比在缓慢生长的细菌中更为明显——并且受到选择的影响,因此有理由认为长 DNA 重复序列的基因组定位也受到选择的影响。为了验证这一点,我们在大约 6000 个细菌基因组中识别了至少 100 个碱基对的相同 DNA 重复序列,并比较了它们在快速生长和缓慢生长细菌中的分布。我们发现长的相同 DNA 重复序列在细菌基因组中呈非随机分布。它们在快速生长和缓慢生长细菌之间的整体数量、方向和与复制起点的接近程度上存在差异。我们表明,它们的定位——可能是由产生重复的过程和对重组可能引起的重排的选择共同作用的结果——与随机定位相比,对细菌复制依赖性基因组组织的破坏程度较小,这表明它是长 DNA 重复序列定位的主要限制因素。