Nelson K E, Richardson D L, Dougherty B A
Institute for Genomic Research, Rockville, Maryland, USA.
Microb Comp Genomics. 1997;2(4):313-21. doi: 10.1089/omi.1.1997.2.313.
The availability of completely sequenced genomes has created an opportunity for high throughput mutational studies. Using the conjugative transposon Tn916, a pilot project was initiated to determine the efficiency of gene disruption in the first completely sequenced bacterium, Haemophilus influenzae Rd strain KW20. DNA was isolated from Tn916-mutagenized cells, and the point of transposon insertion was determined by inverse PCR, DNA sequencing, and mapping to the wild-type genome sequence. Analysis of the insertion sites at the nucleotide level demonstrated a biased pattern of insertion into regions rich in stretches of A's and T's. Although Tn916 integrated at multiple dispersed positions throughout the chromosome, 9 of 10 insertion events occurred in noncoding, intergenic DNA. It was determined that the intergenic DNA was over 5% more A + T-rich than that of protein coding sequences. This suggests that A + T-rich sequences similar to the Tn916 insertion site would be more likely to reside in the intergenic DNA. In an effort to identify other likely sites for transposon integration, a hidden Markov model of the consensus target insertion site was derived from the Tn916-H. influenzae junction fragments and searched against the entire genome. Eighty percent of the 30 highest-scoring predicted Tn916 target sites were from intergenic, nonprotein-coding regions of the genome. These data support the hypothesis that Tn916 has a marked preference for insertion into noncoding DNA for H. influenzae, suggesting that this mobile element has evolved to minimize disruption of host cell function on integration.
完全测序基因组的可得性为高通量突变研究创造了机会。利用接合转座子Tn916,启动了一个试点项目,以确定在首个完全测序的细菌——流感嗜血杆菌Rd菌株KW20中进行基因破坏的效率。从经Tn916诱变的细胞中分离DNA,并通过反向PCR、DNA测序以及与野生型基因组序列比对来确定转座子插入位点。在核苷酸水平对插入位点的分析表明,插入富含A和T序列区域的模式存在偏差。尽管Tn916整合在整个染色体的多个分散位置,但10个插入事件中有9个发生在非编码的基因间DNA中。已确定基因间DNA的A+T含量比蛋白质编码序列高5%以上。这表明类似于Tn916插入位点的富含A+T的序列更有可能存在于基因间DNA中。为了识别转座子整合的其他可能位点,从Tn916-流感嗜血杆菌连接片段推导了共有靶标插入位点的隐马尔可夫模型,并在整个基因组中进行搜索。预测得分最高的30个Tn916靶标位点中,80%来自基因组的基因间非蛋白质编码区域。这些数据支持了这样的假设,即Tn916对流感嗜血杆菌非编码DNA的插入具有明显偏好,这表明这种移动元件在进化过程中已尽量减少整合时对宿主细胞功能的破坏。