Price Morgan N, Alm Eric J, Arkin Adam P
Lawrence Berkeley National Lab 1 Cyclotron Road, Mailstop 977-152, Berkeley, CA 94720, USA.
Nucleic Acids Res. 2005 Jun 7;33(10):3224-34. doi: 10.1093/nar/gki638. Print 2005.
In bacteria, most genes are on the leading strand of replication, a phenomenon attributed to collisions between the DNA and RNA polymerases. In Escherichia coli, these collisions slow the movement of the replication fork through actively transcribed genes only if they are coded on the lagging strand. For genes on both strands, however, these collisions sever nascent transcripts and interrupt gene expression. Based on these observations, we propose a new theory to explain strand bias: genes whose expression is important for fitness are selected to the leading strand because this reduces the duration of these interruptions. Our theory predicts that multi-gene operons, which are subject to longer interruptions, should be more strongly selected to the leading strand than singleton transcripts. We show that this is true even after controlling for the tendency for essential genes, which are strongly biased to the leading strand, to occur in operons. Our theory also predicts that other factors that are associated with strand bias should have stronger effects for genes that are in operons. We find that expression level and phylogenetic ubiquity are correlated with strand bias for both essential and non-essential genes, but only for genes in operons.
在细菌中,大多数基因位于复制的前导链上,这一现象归因于DNA聚合酶与RNA聚合酶之间的碰撞。在大肠杆菌中,只有当活跃转录的基因位于滞后链上时,这些碰撞才会减缓复制叉通过这些基因的移动速度。然而,对于位于两条链上的基因而言,这些碰撞会切断新生转录本并中断基因表达。基于这些观察结果,我们提出了一种新理论来解释链偏好:那些对适应性很重要的基因会被选择定位于前导链,因为这样可以减少这些中断的持续时间。我们的理论预测,受到更长时间中断影响的多基因操纵子,相比单基因转录本,应该会被更强烈地选择定位于前导链。我们发现,即使在控制了强烈偏向于前导链的必需基因倾向于出现在操纵子中的情况后,情况依然如此。我们的理论还预测,与链偏好相关的其他因素,对于操纵子中的基因应该会有更强的影响。我们发现,表达水平和系统发育普遍性与必需基因和非必需基因的链偏好都相关,但仅适用于操纵子中的基因。