Delaye Luis, Deluna Alexander, Lazcano Antonio, Becerra Arturo
Facultad de Ciencias, Universidad Nacional Autónoma de México, Apdo. Postal 70-407, Cd. Universitaria, 04510 México DF, México.
BMC Evol Biol. 2008 Jan 28;8:31. doi: 10.1186/1471-2148-8-31.
Overlapped genes originate by a) loss of a stop codon among contiguous genes coded in different frames; b) shift to an upstream initiation codon of one of the contiguous genes; or c) by overprinting, whereby a novel open reading frame originates through point mutation inside an existing gene. Although overlapped genes are common in viruses, it is not clear whether overprinting has led to new genes in prokaryotes.
Here we report the origin of a new gene through overprinting in Escherichia coli K12. The htgA gene coding for a positive regulator of the sigma 32 heat shock promoter arose by point mutation in a 123/213 phase within an open reading frame (yaaW) of unknown function, most likely in the lineage leading to E. coli and Shigella sp. Further, we show that yaaW sequences coding for htgA genes have a slower evolutionary rate than those lacking an overlapped htgA gene.
While overprinting has been shown to be rather frequent in the evolution of new genes in viruses, our results suggest that this mechanism has also contributed to the origin of a novel gene in a prokaryote. We propose the term janolog (from Jano, the two-faced Roman god) to describe the homology relationship that holds between two genes when one originated through overprinting of the other. One cannot dismiss the possibility that at least a small fraction of the large number of novel ORPhan genes detected in pan-genome and metagenomic studies arose by overprinting.
重叠基因的产生方式有:a) 不同阅读框编码的相邻基因中终止密码子缺失;b) 相邻基因之一的起始密码子向上游移动;或c) 通过套印,即通过现有基因内部的点突变产生新的开放阅读框。虽然重叠基因在病毒中很常见,但套印是否导致了原核生物中的新基因尚不清楚。
在此我们报道了大肠杆菌K12中通过套印产生新基因的情况。编码σ32热休克启动子正调控因子的htgA基因,是由一个功能未知的开放阅读框(yaaW)内123/213相位的点突变产生的,最有可能发生在导致大肠杆菌和志贺氏菌属的谱系中。此外,我们表明,编码htgA基因的yaaW序列的进化速率比那些没有重叠htgA基因的序列要慢。
虽然套印在病毒新基因进化中已被证明相当频繁,但我们的结果表明,这种机制也促成了原核生物中一个新基因的产生。我们提出“janolog”(源自两面罗马神亚努斯)这一术语,来描述当一个基因通过另一个基因的套印产生时,这两个基因之间存在的同源关系。人们不能排除这样一种可能性,即在泛基因组和宏基因组研究中检测到的大量新孤儿基因中,至少有一小部分是通过套印产生的。