Belkorchia Abdel, Gasc Cyrielle, Polonais Valérie, Parisot Nicolas, Gallois Nicolas, Ribière Céline, Lerat Emmanuelle, Gaspin Christine, Pombert Jean-François, Peyret Pierre, Peyretaillade Eric
Clermont Université, Université d'Auvergne, Laboratoire "Microorganismes: Génome et Environnement", BP 10448, F-63000, Clermont-Ferrand, France; CNRS, UMR 6023, LMGE, F-63171, Aubière, France.
Clermont Université, Université d'Auvergne, EA 4678 CIDAM, BP 10448, F-63001, Clermont-Ferrand, France.
PLoS One. 2015 Sep 30;10(9):e0139075. doi: 10.1371/journal.pone.0139075. eCollection 2015.
The proper prediction of the gene catalogue of an organism is essential to obtain a representative snapshot of its overall lifestyle, especially when it is not amenable to culturing. Microsporidia are obligate intracellular, sometimes hard to culture, eukaryotic parasites known to infect members of every animal phylum. To date, sequencing and annotation of microsporidian genomes have revealed a poor gene complement with highly reduced gene sizes. In the present paper, we investigated whether such gene sizes may have induced biases for the methodologies used for genome annotation, with an emphasis on small coding sequence (CDS) gene prediction. Using better delineated intergenic regions from four Encephalitozoon genomes, we predicted de novo new small CDSs with sizes ranging from 78 to 255 bp (median 168) and corroborated these predictions by RACE-PCR experiments in Encephalitozoon cuniculi. Most of the newly found genes are present in other distantly related microsporidian species, suggesting their biological relevance. The present study provides a better framework for annotating microsporidian genomes and to train and evaluate new computational methods dedicated at detecting ultra-small genes in various organisms.
准确预测生物体的基因目录对于全面了解其生活方式至关重要,特别是对于那些难以培养的生物体。微孢子虫是专性细胞内寄生的真核寄生虫,有时难以培养,已知可感染所有动物门的成员。迄今为止,微孢子虫基因组的测序和注释显示其基因组成较差,基因大小大幅缩减。在本文中,我们研究了这种基因大小是否会对用于基因组注释的方法产生偏差,重点是小编码序列(CDS)基因预测。利用来自四个脑胞内原虫基因组更好界定的基因间区域,我们从头预测了大小在78至255 bp(中位数168)之间的新小CDS,并通过兔脑胞内原虫的RACE-PCR实验证实了这些预测。大多数新发现的基因存在于其他远缘相关的微孢子虫物种中,表明它们具有生物学相关性。本研究为注释微孢子虫基因组以及训练和评估用于检测各种生物体中超小基因的新计算方法提供了更好的框架。