Jaillon Olivier, Dossat Carole, Eckenberg Ralph, Eiglmeier Karin, Segurens Béatrice, Aury Jean-Marc, Roth Charles W, Scarpelli Claude, Brey Paul T, Weissenbach Jean, Wincker Patrick
Genoscope/Centre National de Séquençage and CNRS UMR 8030, 91057 Evry Cedex, France.
Genome Res. 2003 Jul;13(7):1595-9. doi: 10.1101/gr.922503.
We performed genome-wide sequence comparisons at the protein coding level between the genome sequences of Drosophila melanogaster and Anopheles gambiae. Such comparisons detect evolutionarily conserved regions (ecores) that can be used for a qualitative and quantitative evaluation of the available annotations of both genomes. They also provide novel candidate features for annotation. The percentage of ecores mapping outside annotations in the A. gambiae genome is about fourfold higher than in D. melanogaster. The A. gambiae genome assembly also contains a high proportion of duplicated ecores, possibly resulting from artefactual sequence duplications in the genome assembly. The occurrence of 4063 ecores in the D. melanogaster genome outside annotations suggests that some genes are not yet or only partially annotated. The present work illustrates the power of comparative genomics approaches towards an exhaustive and accurate establishment of gene models and gene catalogues in insect genomes.
我们在黑腹果蝇和冈比亚按蚊的基因组序列之间进行了蛋白质编码水平的全基因组序列比较。此类比较可检测出进化保守区域(ecores),这些区域可用于对两个基因组现有注释进行定性和定量评估。它们还为注释提供了新的候选特征。在冈比亚按蚊基因组中,映射到注释之外的ecores百分比比黑腹果蝇中的高出约四倍。冈比亚按蚊基因组组装还包含高比例的重复ecores,这可能是基因组组装中人为序列重复导致的。黑腹果蝇基因组中注释之外出现4063个ecores表明,一些基因尚未注释或仅部分注释。本研究说明了比较基因组学方法在全面、准确建立昆虫基因组基因模型和基因目录方面的作用。