Department of Zoology, Life Sciences Institute, University of British Columbia, Vancouver, BC V6T1Z3, Canada.
Nucleic Acids Res. 2012 Sep 1;40(17):8536-49. doi: 10.1093/nar/gks640. Epub 2012 Jun 29.
Some naturally competent bacteria exhibit both a strong preference for DNA fragments containing specific 'uptake sequences' and dramatic overrepresentation of these sequences in their genomes. Uptake sequences are often assumed to directly reflect the specificity of the DNA uptake machinery, but the actual specificity has not been well characterized for any bacterium. We produced a detailed analysis of Haemophilus influenzae's uptake specificity, using Illumina sequencing of degenerate uptake sequences in fragments recovered from competent cells. This identified an uptake motif with the same consensus as the motif overrepresented in the genome, with a 9 bp core (AAGTGCGGT) and two short flanking T-rich tracts. Only four core bases (GCGG) were critical for uptake, suggesting that these make strong specific contacts with the uptake machinery. Other core bases had weaker roles when considered individually, as did the T-tracts, but interaction effects between these were also determinants of uptake. The properties of genomic uptake sequences are also constrained by mutational biases and selective forces acting on USSs with coding and termination functions. Our findings define constraints on gene transfer by natural transformation and suggest how the DNA uptake machinery overcomes the physical constraints imposed by stiff highly charged DNA molecules.
有些天生具有能力的细菌表现出对含有特定“摄取序列”的 DNA 片段的强烈偏好,并且这些序列在其基因组中明显过表达。摄取序列通常被认为直接反映了 DNA 摄取机制的特异性,但对于任何细菌,其实际特异性都没有很好地表征。我们使用从感受态细胞中回收的片段中的退化摄取序列的 Illumina 测序,对流感嗜血杆菌的摄取特异性进行了详细分析。这确定了一个摄取基序,其一致性与基因组中过表达的基序相同,具有 9 个碱基的核心(AAGTGCGGT)和两个短的富含 T 的侧翼。只有四个核心碱基(GCGG)对摄取至关重要,这表明它们与摄取机制形成了强烈的特异性接触。其他核心碱基单独考虑时作用较弱,T tract 也是如此,但这些碱基之间的相互作用也是摄取的决定因素。基因组摄取序列的性质也受到编码和终止功能的 USS 上的突变偏倚和选择压力的限制。我们的发现定义了自然转化的基因转移的限制,并表明 DNA 摄取机制如何克服僵硬的高电荷 DNA 分子带来的物理限制。