Beasty A M, Behe M J
Department of Chemistry, Lehigh University, Bethlehem, PA 18015.
Nucleic Acids Res. 1988 Feb 25;16(4):1517-28. doi: 10.1093/nar/16.4.1517.
Twenty four DNA and RNA viral nucleotide sequences, comprising over 346 kilobases, have been analyzed for the occurrence of strings of contiguous purine or pyrimidine residues. On average strings greater than or equal to 10 contiguous purines or pyrimidines are found three and a half times more frequently than would be expected for a random distribution of bases. Detailed analysis of the 172 kilobase Epstein-Barr viral sequence shows that the bias in favor of contiguous purine residues increases with the length of the purine string. These findings are similar to those seen for genomic DNA from higher eukaryotes. In contrast no overrepresentation of oligopurine or oligopyrimidine strings is observed in 52 kilobases from eight bacteriophage and E. coli DNA sequences.
已对24个DNA和RNA病毒核苷酸序列(总长超过346千碱基)进行分析,以确定连续嘌呤或嘧啶残基串的出现情况。平均而言,长度大于或等于10个连续嘌呤或嘧啶的串出现的频率比随机碱基分布预期的高出3.5倍。对172千碱基的爱泼斯坦-巴尔病毒序列进行的详细分析表明,对连续嘌呤残基的偏好随着嘌呤串长度的增加而增加。这些发现与高等真核生物基因组DNA的情况相似。相比之下,在来自8种噬菌体和大肠杆菌DNA序列的52千碱基中,未观察到寡嘌呤或寡嘧啶串的过度出现。