Pan Deng, Zhang Liqing
Department of Computer Science, Virginia Tech, Torgerson Hall, Blacksburg, Virginia 24061-0106, USA.
Genome Biol. 2007;8(8):R158. doi: 10.1186/gb-2007-8-8-r158.
The rate of gene duplication is an important parameter in the study of evolution, but the influence of gene conversion and technical problems have confounded previous attempts to provide a satisfying estimate. We propose a new strategy to estimate the rate that involves separate quantification of the rates of two different mechanisms of gene duplication and subsequent combination of the two rates, based on their respective contributions to the overall gene duplication rate.
Previous estimates of gene duplication rates are based on small gene families. Therefore, to assess the applicability of this to families of all sizes, we looked at both two-copy gene families and the entire genome. We studied unequal crossover and retrotransposition, and found that these mechanisms of gene duplication are largely independent and account for a substantial amount of duplicated genes. Unequal crossover contributed more to duplications in the entire genome than retrotransposition did, but this contribution was significantly less in two-copy gene families, and duplicated genes arising from this mechanism are more likely to be retained. Combining rates of duplication using the two mechanisms, we estimated the overall rates to be from approximately 0.515 to 1.49 x 10(-3) per gene per million years in human, and from approximately 1.23 to 4.23 x 10(-3) in mouse. The rates estimated from two-copy gene families are always lower than those from the entire genome, and so it is not appropriate to use small families to estimate the rate for the entire genome.
We present a novel strategy for estimating gene duplication rates. Our results show that different mechanisms contribute differently to the evolution of small and large gene families.
基因复制速率是进化研究中的一个重要参数,但基因转换的影响和技术问题使之前试图提供令人满意估计值的尝试变得复杂。我们提出了一种新的估计速率的策略,该策略涉及分别量化基因复制的两种不同机制的速率,并根据它们对总体基因复制速率的各自贡献将这两种速率进行合并。
先前对基因复制速率的估计是基于小基因家族的。因此,为了评估这一方法对所有大小基因家族的适用性,我们研究了双拷贝基因家族和整个基因组。我们研究了不等交换和逆转座,发现这些基因复制机制在很大程度上是独立的,并且占了大量的重复基因。不等交换对整个基因组中复制的贡献比逆转座更大,但在双拷贝基因家族中这种贡献明显较小,并且由这种机制产生的重复基因更有可能被保留。使用这两种机制合并复制速率后,我们估计人类中每个基因每百万年的总体速率约为0.515至1.49×10⁻³,小鼠中约为1.23至4.23×10⁻³。从双拷贝基因家族估计的速率总是低于从整个基因组估计的速率,因此使用小家族来估计整个基因组的速率是不合适的。
我们提出了一种估计基因复制速率的新策略。我们的结果表明,不同机制对大小基因家族的进化贡献不同。