Rogerson A C
Biology Department, St Lawrence University, Canton, NY 13617.
Nucleic Acids Res. 1989 Jul 25;17(14):5547-63. doi: 10.1093/nar/17.14.5547.
I have examined potential determinants of the asymmetric distribution of nucleotide sequences in the genome of Escherichia coli as cataloged in GenBank release 44. I have used the frequency of occurrence of all possible tetranucleotides in a given sequence catalog or derivative as a comparative measure of asymmetry. The GenBank-cataloged strand and its complement show statistically similar (not complementary) distributions. The distribution is statistically similar in comparisons between the protein coding subset and the total genome, the coding subset and selected non-coding genes, the coding subset and the remainder of the DNA, and the coding subset and stable RNA sequences. I have compared the distribution in the genome of E. coli with the distributions found in the cataloged genomes of Salmonella typhimurium, Bacillus subtilis, and of coliphages lambda and T7. The distribution summed in both strands of the cataloged DNA differs statistically only in comparisons with lytic bacteriophage T7 because only the two strands of T7 show statistically dissimilar distributions. Despite similarities in tetranucleotide distribution, the pattern of codon complementarity in B. subtilis is different than that documented for E. coli. Thus, sequence asymmetry does not seem related to specific DNA function or to documented similarities or differences in codon bias. The sequence asymmetry of the E. coli genome may thus reflect a hitherto unsuspected pattern impressed on both strands of DNA which is or can be packaged into bacterial genomes.
我研究了GenBank第44版中所收录的大肠杆菌基因组中核苷酸序列不对称分布的潜在决定因素。我将给定序列目录或其衍生物中所有可能的四核苷酸的出现频率用作不对称性的比较指标。GenBank收录的链及其互补链显示出统计学上相似(而非互补)的分布。在蛋白质编码子集与整个基因组、编码子集与选定的非编码基因、编码子集与DNA其余部分以及编码子集与稳定RNA序列之间的比较中,分布在统计学上是相似的。我将大肠杆菌基因组中的分布与鼠伤寒沙门氏菌、枯草芽孢杆菌以及噬菌体λ和T7的已收录基因组中的分布进行了比较。已收录DNA的两条链上的分布总和在统计学上仅在与裂解性噬菌体T7的比较中有所不同,因为只有T7的两条链显示出统计学上不同的分布。尽管四核苷酸分布存在相似性,但枯草芽孢杆菌中的密码子互补模式与大肠杆菌中记录的不同。因此,序列不对称似乎与特定的DNA功能或密码子偏好方面已记录的相似性或差异无关。大肠杆菌基因组的序列不对称可能因此反映了一种迄今未被怀疑的模式,这种模式印刻在DNA的两条链上,而这两条链是或能够被包装进细菌基因组中的。