Clermont Université, Université d'Auvergne, Laboratoire: Microorganismes Génome et Environnement, BP 10448, F-63000 CLERMONT-FERRAND.
BMC Genomics. 2009 Dec 15;10:607. doi: 10.1186/1471-2164-10-607.
Microsporidia are obligate intracellular eukaryotic parasites with genomes ranging in size from 2.3 Mbp to more than 20 Mbp. The extremely small (2.9 Mbp) and highly compact (approximately 1 gene/kb) genome of the human parasite Encephalitozoon cuniculi has been fully sequenced. The aim of this study was to characterize noncoding motifs that could be involved in regulation of gene expression in E. cuniculi and to show whether these motifs are conserved among the phylum Microsporidia.
To identify such signals, 5' and 3'RACE-PCR experiments were performed on different E. cuniculi mRNAs. This analysis confirmed that transcription overrun occurs in E. cuniculi and may result from stochastic recognition of the AAUAAA polyadenylation signal. Such experiments also showed highly reduced 5'UTR's (<7 nts). Most of the E. cuniculi genes presented a CCC-like motif immediately upstream from the coding start. To characterize other signals involved in differential transcriptional regulation, we then focused our attention on the gene family coding for ribosomal proteins. An AAATTT-like signal was identified upstream from the CCC-like motif. In rare cases the cytosine triplet was shown to be substituted by a GGG-like motif. Comparative genomic studies confirmed that these different signals are also located upstream from genes encoding ribosomal proteins in other microsporidian species including Antonospora locustae, Enterocytozoon bieneusi, Anncaliia algerae (syn. Brachiola algerae) and Nosema ceranae. Based on these results a systematic analysis of the approximately 2000 E. cuniculi coding DNA sequences was then performed and brings to highlight that 364 translation initiation codons (18.29% of total CDSs) had been badly predicted.
We identified various signals involved in the maturation of E. cuniculi mRNAs. Presence of such signals, in phylogenetically distant microsporidian species, suggests that a common regulatory mechanism exists among the microsporidia. Furthermore, 5'UTRs being strongly reduced, these signals can be used to ensure the accurate prediction of translation initiation codons for microsporidian genes and to improve microsporidian genome annotation.
微孢子虫是一种专性细胞内真核寄生虫,其基因组大小从 2.3 Mbp 到 20 Mbp 以上不等。人类寄生虫脑炎微孢子虫的极小(2.9 Mbp)和高度紧凑(约 1 个基因/kb)基因组已经完全测序。本研究的目的是描述可能参与脑炎微孢子虫基因表达调控的非编码模体,并展示这些模体是否在微孢子虫门中保守。
为了识别这些信号,我们对不同脑炎微孢子虫的 mRNA 进行了 5'和 3'RACE-PCR 实验。该分析证实,脑炎微孢子虫中转录超越发生,可能是由于随机识别 AAUAAA 多聚腺苷酸化信号。此类实验还显示出高度简化的 5'UTR(<7 个核苷酸)。大多数脑炎微孢子虫基因在编码起始点的上游都有一个 CCC 样模体。为了描述参与差异转录调控的其他信号,我们将注意力集中在编码核糖体蛋白的基因家族上。在 CCC 样模体的上游发现了一个 AAATTT 样信号。在少数情况下,三碱基胞嘧啶被 GGG 样模体取代。比较基因组研究证实,这些不同的信号也位于编码核糖体蛋白的基因上游,这些基因存在于其他微孢子虫物种中,包括 Antonospora locustae、Enterocytozoon bieneusi、Anncaliia algerae(原名 Brachiola algerae)和 Nosema ceranae。基于这些结果,我们对大约 2000 个脑炎微孢子虫编码 DNA 序列进行了系统分析,并强调了 364 个翻译起始密码子(总 CDS 的 18.29%)的预测存在严重错误。
我们确定了参与脑炎微孢子虫 mRNA 成熟的各种信号。这些信号存在于系统发育上相距甚远的微孢子虫物种中,表明微孢子虫之间存在共同的调控机制。此外,由于 5'UTR 被强烈简化,这些信号可用于准确预测微孢子虫基因的翻译起始密码子,并改善微孢子虫基因组注释。