Throude Mickael, Bolot Stéphanie, Bosio Mickael, Pont Caroline, Sarda Xavier, Quraishi Umar Masood, Bourgis Fabienne, Lessard Philippe, Rogowsky Peter, Ghesquiere Alain, Murigneux Alain, Charmet Gilles, Perez Pascual, Salse Jérôme
UMR 1095 INRA/UBP, Génétique, Diversité et Ecophysiologie des Céréales (GDEC), Domaine de Crouelle, 234, 63100 Clermont Ferrand, France.
Nucleic Acids Res. 2009 Mar;37(4):1248-59. doi: 10.1093/nar/gkn1048. Epub 2009 Jan 9.
Having a well-known history of genome duplication, rice is a good model for studying structural and functional evolution of paleo duplications. Improved sequence alignment criteria were used to characterize 10 major chromosome-to-chromosome duplication relationships associated with 1440 paralogous pairs, covering 47.8% of the rice genome, with 12.6% of genes that are conserved within sister blocks. Using a micro-array experiment, a genome-wide expression map has been produced, in which 2382 genes show significant differences of expression in root, leaf and grain. By integrating both structural (1440 paralogous pairs) and functional information (2382 differentially expressed genes), we identified 115 paralogous gene pairs for which at least one copy is differentially expressed in one of the three tissues. A vast majority of the 115 paralogous gene pairs have been neofunctionalized or subfunctionalized as 88%, 89% and 96% of duplicates, respectively, expressed in grain, leaf and root show distinct expression patterns. On the basis of a Gene Ontology analysis, we have identified and characterized the gene families that have been structurally and functionally preferentially retained in the duplication showing that the vast majority (>85%) of duplicated have been either lost or have been subfunctionalized or neofunctionalized during 50-70 million years of evolution.
水稻有着广为人知的基因组复制历史,是研究古多倍体结构和功能进化的良好模型。采用改进的序列比对标准来表征与1440个旁系同源基因对相关的10种主要染色体间复制关系,这些旁系同源基因对覆盖了47.8%的水稻基因组,其中12.6%的基因在姐妹区域内保守。通过微阵列实验,绘制了全基因组表达图谱,其中2382个基因在根、叶和籽粒中表现出显著的表达差异。通过整合结构信息(1440个旁系同源基因对)和功能信息(2382个差异表达基因),我们鉴定出115个旁系同源基因对,其中至少有一个拷贝在三种组织之一中差异表达。在这115个旁系同源基因对中,绝大多数已发生了新功能化或亚功能化,分别在籽粒、叶和根中表达的重复基因中,有88%、89%和96%表现出不同的表达模式。基于基因本体分析,我们鉴定并表征了在复制过程中在结构和功能上优先保留的基因家族,结果表明,在5000万至7000万年的进化过程中,绝大多数(>85%)的重复基因已丢失或发生了亚功能化或新功能化。