Tanaka Mark M, Rosenberg Noah A, Small Peter M
School of Biotechnology & Biomolecular Sciences, University of New South Wales, Australia.
Mol Biol Evol. 2004 Dec;21(12):2195-201. doi: 10.1093/molbev/msh234. Epub 2004 Aug 18.
Insertion sequence (IS) elements are bacterial genes that are able to transpose to different locations in the genome. These elements are often used in molecular epidemiology as genetic markers that track the spread of pathogens. Transposable elements have frequently been described as "selfish DNA" because they facilitate their own transposition, causing damage when they insert into coding regions, while contributing little if anything to the bacterial host. According to this hypothesis, the expansion of copy number of insertion sequences is opposed by negative selection against high copy numbers. From an alternative point of view, we might expect IS elements to intrinsically regulate transposition within cells, thereby limiting damage to their bacterial host. Here, we report evidence that the copy number of IS6110 in Mycobacterium tuberculosis is controlled by selection against the element. We first construct 12 different models of marker change resulting from a combination of possible transposition functions and selective regimes. We then compute the Akaike Information Criterion for each model to identify the models that best explain data consisting of serial isolates of M. tuberculosis genotyped with IS6110. We find that the best performing models all include selection against the accumulation of copies. Specifically, our analysis points to the interaction of separate copies of the element causing lethal effects. We discuss the implications of these findings for genome evolution and molecular epidemiology.
插入序列(IS)元件是能够转座到基因组不同位置的细菌基因。这些元件在分子流行病学中常被用作追踪病原体传播的遗传标记。转座元件经常被描述为“自私DNA”,因为它们促进自身转座,插入编码区时会造成损害,而对细菌宿主几乎没有贡献。根据这一假设,插入序列拷贝数的增加受到针对高拷贝数的负选择的抑制。从另一个角度来看,我们可能期望IS元件在细胞内对转座进行内在调节,从而限制对其细菌宿主的损害。在此,我们报告证据表明结核分枝杆菌中IS6110的拷贝数受针对该元件的选择控制。我们首先构建了12种不同的由可能的转座功能和选择机制组合产生的标记变化模型。然后,我们计算每个模型的赤池信息准则,以识别最能解释由用IS6110进行基因分型的结核分枝杆菌系列分离株组成的数据的模型。我们发现表现最佳的模型都包括针对拷贝积累的选择。具体而言,我们的分析指出该元件的单独拷贝之间的相互作用会导致致死效应。我们讨论了这些发现对基因组进化和分子流行病学的意义。