Yim Won Cheol, Lee Byung-Moo, Jang Cheol Seong
Department of Plant Biotechnology, Dongguk University, Seoul, 100-715, South Korea.
Mol Genet Genomics. 2009 May;281(5):483-93. doi: 10.1007/s00438-009-0425-y. Epub 2009 Jan 31.
Duplicate genes are believed to be a major source of new gene functions over evolutionary time. In order to evaluate the evolutionary dynamics of rice duplicate genes, formed principally by paleoployploidization prior to the speciation of the Poaceae family, we have employed a public microarray dataset including 155 gene expression omnibus sample plates and bioinformatics tools. At least 57.4% of old approximately 70 million years ago (MYA) duplicate gene pairs exhibit divergences in expression over the given experimental set, whereas at least 50.9% of young approximately 7.7-MYA duplicate gene pairs were shown to be divergent. When grouping the rice duplicate genes according to functional categories, we noted a striking and significant enrichment of divergent duplicate metabolism-associated genes, as compared to that observed in non-divergent duplicate genes. While both non-synonymous substitution (Ka) and synonymous substitution (Ks) values between non- and divergent duplicate gene pairs evidenced significant differences, the Ka/Ks values between them exhibited no significant differences. Interestingly, the average numbers of conserved motifs of the duplicate gene pairs revealed a pattern of decline along with an increase in expression diversity, partially supporting the subfunctionalization model with degenerative complementation in regulatory motifs. Duplicate gene pairs with high local similarity (HLS) segments, which might be formed via conversion between rice paleologs, evidenced higher expression correlations than were observed in the gene pairs without the HLS segments; this probably resulted in an increased likelihood of gene conversion in promoters of the gene pairs harboring HLS segments. More than 30% [corrected] of the rice gene families exhibited similar high expression diversity between members as compared to that of randomly selected gene pairs. These findings are likely reflective of the evolutionary dynamics of rice duplicate genes for gene retention.
在进化过程中,重复基因被认为是新基因功能的主要来源。为了评估水稻重复基因的进化动态,这些基因主要由禾本科物种形成之前的古多倍体化产生,我们使用了一个包含155个基因表达综合样本板的公共微阵列数据集和生物信息学工具。在给定的实验集中,至少57.4%的大约7000万年前(MYA)的古老重复基因对在表达上存在差异,而至少50.9%的大约770万年前的年轻重复基因对也表现出差异。当根据功能类别对水稻重复基因进行分组时,我们注意到与非差异重复基因相比,差异重复代谢相关基因显著富集。虽然非差异和差异重复基因对之间的非同义替换(Ka)和同义替换(Ks)值都显示出显著差异,但它们之间的Ka/Ks值没有显著差异。有趣的是,重复基因对的保守基序平均数量呈现出随着表达多样性增加而下降的模式,部分支持了调控基序中退化互补的亚功能化模型。具有高局部相似性(HLS)片段的重复基因对可能是通过水稻古同源基因之间的转换形成的,与没有HLS片段的基因对相比,它们表现出更高的表达相关性;这可能导致在含有HLS片段的基因对启动子中基因转换的可能性增加。超过30%[校正后]的水稻基因家族成员之间表现出与随机选择的基因对相似的高表达多样性。这些发现可能反映了水稻重复基因在基因保留方面的进化动态。