Messer Philipp W, Arndt Peter F
Max-Planck-Institute for Molecular Genetics, Berlin, Germany.
Mol Biol Evol. 2007 May;24(5):1190-7. doi: 10.1093/molbev/msm035. Epub 2007 Feb 24.
Nucleotide substitutions, insertions, and deletions constitute the principal molecular mechanisms generating genetic variation on small length scales. In contrast to substitutions, the nature of short DNA insertions and deletions (indels) is far less understood. With the recent availability of whole-genome multiple alignments between human and other primates, detailed investigations on indel characteristics and origin have come within reach. Here, we show that the majority of short (1-100 bp) DNA insertions in the human lineage are tandem duplications of directly adjacent sequence segments with conserved polarity. Indels in microsatellites comprise only a small fraction. The underlying molecular processes generating indels do not necessarily rely on the presence of preexisting duplicates, as would be expected for unequal crossing over, as well as replication slippage. Instead, our findings point toward a mechanism that preferentially occurs in the male germline and is not recombination-mediated. Surprisingly, nonframeshifting tandem duplications and deletions in coding regions still occur at approximately 50% of their genomic background rates. As is already well established in the context of gene and segmental duplications, our results demonstrate that duplications are also likely to constitute the predominant process for rapid generation of new genetic material and function on smaller scales.
核苷酸替换、插入和缺失是在小长度尺度上产生遗传变异的主要分子机制。与替换不同,短DNA插入和缺失(indel)的本质还远未被充分理解。随着人类与其他灵长类动物全基因组多重比对的近期可得,对indel特征和起源的详细研究已触手可及。在这里,我们表明人类谱系中大多数短(1 - 100 bp)DNA插入是具有保守极性的直接相邻序列片段的串联重复。微卫星中的indel仅占一小部分。产生indel的潜在分子过程不一定依赖于预先存在的重复序列,而这是不等交换以及复制滑移所预期的情况。相反,我们的发现指向一种优先发生在雄性生殖系且不是由重组介导的机制。令人惊讶的是,编码区的非移码串联重复和缺失仍以其基因组背景率的约50%发生。正如在基因和片段重复的背景下已经确立的那样,我们的结果表明重复也可能是在较小尺度上快速产生新遗传物质和功能的主要过程。