Plant Genome Mapping Laboratory, University of Georgia, Athens, GA 30602, USA.
BMC Genomics. 2013 Sep 24;14:652. doi: 10.1186/1471-2164-14-652.
Divergence in gene structure following gene duplication is not well understood. Gene duplication can occur via whole-genome duplication (WGD) and single-gene duplications including tandem, proximal and transposed duplications. Different modes of gene duplication may be associated with different types, levels, and patterns of structural divergence.
In Arabidopsis thaliana, we denote levels of structural divergence between duplicated genes by differences in coding-region lengths and average exon lengths, and the number of insertions/deletions (indels) and maximum indel length in their protein sequence alignment. Among recent duplicates of different modes, transposed duplicates diverge most dramatically in gene structure. In transposed duplications, parental loci tend to have longer coding-regions and exons, and smaller numbers of indels and maximum indel lengths than transposed loci, reflecting biased structural changes in transposed duplications. Structural divergence increases with evolutionary time for WGDs, but not transposed duplications, possibly because of biased gene losses following transposed duplications. Structural divergence has heterogeneous relationships with nucleotide substitution rates, but is consistently positively correlated with gene expression divergence. The NBS-LRR gene family shows higher-than-average levels of structural divergence.
Our study suggests that structural divergence between duplicated genes is greatly affected by the mechanisms of gene duplication and may be not proportional to evolutionary time, and that certain gene families are under selection on rapid evolution of gene structure.
基因结构在基因复制后的分歧尚不清楚。基因复制可以通过全基因组复制(WGD)和单基因复制(包括串联、近端和转座复制)发生。不同的基因复制模式可能与不同类型、水平和结构分歧模式相关。
在拟南芥中,我们通过编码区长度和平均外显子长度的差异,以及蛋白质序列比对中外显子插入/缺失(indels)的数量和最大 indel 长度,来表示重复基因之间结构分歧的程度。在不同模式的近期重复中,转座重复在基因结构上的分歧最为显著。在转座重复中,亲本基因座往往具有较长的编码区和外显子,以及较少的 indels 和最大 indel 长度,这反映了转座重复中的偏向性结构变化。结构分歧随着 WGD 的进化时间而增加,但不随转座重复而增加,这可能是因为转座重复后基因丢失存在偏向性。结构分歧与核苷酸替换率呈异质关系,但与基因表达分歧呈一致的正相关。NBS-LRR 基因家族表现出高于平均水平的结构分歧。
我们的研究表明,重复基因之间的结构分歧受基因复制机制的影响很大,可能与进化时间不成比例,某些基因家族在基因结构的快速进化中受到选择。