Mrázek J, Karlin S
Department of Mathematics, Stanford University, 450 Serra Mall, Bldg. 380, Stanford, CA 94305-2125, USA.
Proc Natl Acad Sci U S A. 1998 Mar 31;95(7):3720-5. doi: 10.1073/pnas.95.7.3720.
Several bacterial genomes exhibit preference for G over C on the DNA leading strand extending from the origin of replication to the ter-region in the genomes of Escherichia coli, Mycoplasma genitalium, Bacillus subtilis, and marginally in Haemophilus influenzae, Mycoplasma pneumoniae, and Helicobacter pylori. Strand compositional asymmetry is not observed in the cyanobacterium Synechocystis sp. genome nor in the archaeal genomes of Methanococcus jannaschii, Methanobacterium thermoautotrophicum, and Archaeoglobus fulgidus. A strong strand compositional asymmetry is observed in beta-type but not alpha- or gamma-type human herpesviruses featuring G > C downstream of oriL and C > G upstream of oriL. Dinucleotide relative abundances (i.e., dinucleotide representations normalized by the component nucleotide frequencies) are consonant with respect to the leading and lagging strands. Strand compositional asymmetry may reflect on differences in replication synthesis of the leading versus lagging strand, on differences between template and coding strand associated with transcription-coupled repair mechanisms, on differences in gene density between the two strands, on differences in residue and codon biases in relation to gene function, expression level, or operon organization, or on differences in single or context-dependent base mutational rates. The absence of strand asymmetry in the archaeal genomes may reflect the presence of multiple origins of replication.
在大肠杆菌、生殖支原体、枯草芽孢杆菌的基因组中,以及在流感嗜血杆菌、肺炎支原体和幽门螺杆菌中略有体现,从复制起点延伸至基因组中ter区域的DNA前导链上,几种细菌基因组表现出对G的偏好超过C。在蓝藻集胞藻属基因组以及詹氏甲烷球菌、嗜热自养甲烷杆菌和嗜热栖热菌的古菌基因组中未观察到链组成不对称。在β型而非α型或γ型人类疱疹病毒中观察到强烈的链组成不对称,其oriL下游G > C,oriL上游C > G。二核苷酸相对丰度(即通过组成核苷酸频率归一化的二核苷酸表示)在前导链和后随链方面是一致的。链组成不对称可能反映在前导链与后随链复制合成的差异、与转录偶联修复机制相关的模板链和编码链之间的差异、两条链之间基因密度的差异、与基因功能、表达水平或操纵子组织相关的残基和密码子偏好的差异,或者单碱基或上下文依赖碱基突变率的差异。古菌基因组中不存在链不对称可能反映了多个复制起点的存在。