Williams Bryony A P, Slamovits Claudio H, Patron Nicola J, Fast Naomi M, Keeling Patrick J
Canadian Institute for Advanced Research, Botany Department, University of British Columbia, 3529-6270 University Boulevard, Vancouver, BC, Canada V6T 1Z4.
Proc Natl Acad Sci U S A. 2005 Aug 2;102(31):10936-41. doi: 10.1073/pnas.0501321102. Epub 2005 Jul 21.
The gene density of eukaryotic nuclear genomes is generally low relative to prokaryotes, but several eukaryotic lineages (many parasites or endosymbionts) have independently evolved highly compacted, gene-dense genomes. The best studied of these are the microsporidia, highly adapted fungal parasites, and the nucleomorphs, relict nuclei of endosymbiotic algae found in cryptomonads and chlorarachniophytes. These systems are now models for the effects of compaction on the form and dynamics of the nuclear genome. Here we report a large-scale investigation of gene expression from compacted eukaryotic genomes. We have conducted EST surveys of the microsporidian Antonospora locustae and nucleomorphs of the cryptomonad Guillardia theta and the chlorarachniophyte Bigelowiella natans. In all three systems we find a high frequency of mRNA molecules that encode sequence from more than one gene. There is no bias for these genes to be on the same strand, so it is unlikely that these mRNAs represent operons. Instead, compaction appears to have reduced the intergenic regions to such an extent that control elements like promoters and terminators have been forced into or beyond adjacent genes, resulting in long untranslated regions that encode other genes. Normally, transcriptional overlap can interfere with expression of a gene, but these genomes cope with high frequencies of overlap and with termination signals within expressed genes. These findings also point to serious practical difficulties in studying expression in compacted genomes, because many techniques, such as arrays or serial analysis of gene expression will be misleading.
与原核生物相比,真核细胞核基因组的基因密度通常较低,但一些真核生物谱系(许多寄生虫或内共生体)已经独立进化出高度紧凑、基因密集的基因组。其中研究得最充分的是微孢子虫,即高度适应的真菌寄生虫,以及核质体,即在隐藻和绿藻门中发现的内共生藻类的残余细胞核。这些系统现在是基因组压缩对核基因组形式和动态影响的模型。在这里,我们报告了对紧凑真核基因组基因表达的大规模研究。我们对微孢子虫蝗虫微孢子虫以及隐藻吉氏隐藻和绿藻门毕氏绿藻的核质体进行了EST调查。在所有这三个系统中,我们发现编码来自多个基因序列的mRNA分子的频率很高。这些基因在同一条链上没有偏向性,因此这些mRNA不太可能代表操纵子。相反,基因组压缩似乎已将基因间区域减少到这样的程度,以至于启动子和终止子等控制元件被迫进入相邻基因或延伸到相邻基因之外,从而产生了编码其他基因的长非翻译区。正常情况下,转录重叠会干扰基因的表达,但这些基因组能够应对高频率的重叠以及表达基因内的终止信号。这些发现还指出了在研究紧凑基因组中的表达时存在的严重实际困难,因为许多技术,如基因芯片或基因表达序列分析,可能会产生误导。