Kondrashov Fyodor A, Kondrashov Alexey S
Rybka Research Institute, 25138 Woodfield School Rd., Gaithersburg, MD 20882, USA.
J Theor Biol. 2006 Mar 21;239(2):141-51. doi: 10.1016/j.jtbi.2005.08.033. Epub 2005 Oct 20.
New genes commonly appear through complete or partial duplications of pre-existing genes. Duplications of long DNA segments are constantly produced by rare mutations, may become fixed in a population by selection or random drift, and are subject to divergent evolution of the paralogous sequences after fixation, although gene conversion can impede this process. New data shed some light on each of these processes. Mutations which involve duplications can occur through at least two different mechanisms, backward strand slippage during DNA replication and unequal crossing-over. The background rate of duplication of a complete gene in humans is 10(-9)-10(-10) per generation, although many genes located within hot-spots of large-scale mutation are duplicated much more often. Many gene duplications affect fitness strongly, and are responsible, through gene dosage effects, for a number of genetic diseases. However, high levels of intrapopulation polymorphism caused by presence or absence of long, gene-containing DNA segments imply that some duplications are not under strong selection. The polymorphism to fixation ratios appear to be approximately the same for gene duplications and for presumably selectively neutral nucleotide substitutions, which, according to the McDonald-Kreitman test, is consistent with selective neutrality of duplications. However, this pattern can also be due to negative selection against most of segregating duplications and positive selection for at least some duplications which become fixed. Patterns in post-fixation evolution of duplicated genes do not easily reveal the causes of fixations. Many gene duplications which became fixed recently in a variety of organisms were positively selected because the increased expression of the corresponding genes was beneficial. The effects of gene dosage provide a unified framework for studying all phases of the life history of a gene duplication. Application of well-known methods of evolutionary genetics to accumulating data on new, polymorphic, and fixed duplication will enhance our understanding of the role of natural selection in the evolution by gene duplication.
新基因通常通过已有基因的全部或部分复制而出现。长DNA片段的复制不断地由罕见突变产生,可能通过选择或随机漂变在种群中固定下来,并且在固定之后会经历旁系同源序列的趋异进化,尽管基因转换可能会阻碍这一过程。新数据为这些过程中的每一个都提供了一些线索。涉及复制的突变可以通过至少两种不同的机制发生,即DNA复制过程中的后向链滑动和不等交换。人类中完整基因的复制背景速率为每代10^(-9)-10^(-10),尽管许多位于大规模突变热点区域内的基因被复制的频率要高得多。许多基因复制对适应性有强烈影响,并通过基因剂量效应导致多种遗传疾病。然而,由长的、含基因的DNA片段的存在或缺失所引起的种群内高水平多态性意味着一些复制不受强烈选择。基因复制与可能是选择性中性的核苷酸替换的多态性与固定比率似乎大致相同,根据麦克唐纳-克雷特曼检验,这与复制的选择性中性是一致的。然而,这种模式也可能是由于对大多数分离的复制进行负选择以及对至少一些固定下来的复制进行正选择所致。复制基因固定后的进化模式不容易揭示固定的原因。最近在各种生物体中固定下来的许多基因复制是经过正选择的,因为相应基因表达的增加是有益的。基因剂量效应为研究基因复制生命史的所有阶段提供了一个统一的框架。将著名的进化遗传学方法应用于积累的关于新的、多态的和固定的复制的数据,将增强我们对自然选择在基因复制进化中作用的理解。