Schroeder C, Jurkschat H, Meisel A, Reich J G, Krüger D
Gene. 1986;45(1):77-86. doi: 10.1016/0378-1119(86)90134-4.
Selected and counterselected oligodeoxynucleotide sequences were identified in the total sequence of bacteriophage T7 DNA using a statistical criterion derived for a probability model of the Markov chain type. All extremely rare tetra- and pentadeoxynucleotides are (or contain) recognition sequences for the Escherichia coli DNA methylases dam or dcm. Most of the 37 hexadeoxynucleotides absent from T7 DNA are recognition sequences for type II modification/restriction enzymes of E. coli or related species. In contrast to most restriction sites counterselected during evolution, the EcoP1 site GGTCT occurs 126 times in the T7 genome, and phage T7 replication is severely repressed in P1-lysogenic host cells. We demonstrate that the frequency of the EcoP1 site is determined by that of the overlapping recognition sites for T7 primase, an essential phage enzyme. The recognition site of a type III enzyme, EcoP15, is also not counterselected. In T7 DNA all 36 EcoP15 sites are arranged in such a manner that the sequence CAGCAG is confined to the H strand, the complementary sequence CTGCTG to the L strand. This "strand bias" is highly significant and, therefore, very probably selected. A functional relation between this strand bias and the refractive behaviour of phage T7 to EcoP15 restriction is suspected.
利用为马尔可夫链类型的概率模型推导的统计标准,在噬菌体T7 DNA的全序列中鉴定出了选择性和反选择性的寡脱氧核苷酸序列。所有极其罕见的四聚和五聚脱氧核苷酸都是(或包含)大肠杆菌DNA甲基化酶dam或dcm的识别序列。T7 DNA中不存在的37种六聚脱氧核苷酸中的大多数是大肠杆菌或相关物种的II型修饰/限制酶的识别序列。与进化过程中反选择的大多数限制位点不同,EcoP1位点GGTCT在T7基因组中出现126次,并且噬菌体T7在P1溶原性宿主细胞中的复制受到严重抑制。我们证明EcoP1位点的频率由T7引发酶(一种必需的噬菌体酶)的重叠识别位点的频率决定。III型酶EcoP15的识别位点也没有被反选择。在T7 DNA中,所有36个EcoP15位点都以这样的方式排列,即序列CAGCAG局限于H链,互补序列CTGCTG局限于L链。这种“链偏向”非常显著,因此很可能是被选择的。怀疑这种链偏向与噬菌体T7对EcoP15限制的抗性行为之间存在功能关系。