Bioinformatics Research Center, C. F. Møllers Alle, Arhus, Denmark.
PLoS One. 2010 Jan 19;5(1):e8650. doi: 10.1371/journal.pone.0008650.
Although insertions and deletions (indels) account for a sizable portion of genetic changes within and among species, they have received little attention because they are difficult to type, are alignment dependent and their underlying mutational process is poorly understood. A fundamental question in this respect is whether insertions and deletions are governed by similar or different processes and, if so, what these differences are.
METHODOLOGY/PRINCIPAL FINDINGS: We use published resequencing data from Seattle SNPs and NIEHS human polymorphism databases to construct a genomewide data set of short polymorphic insertions and deletions in the human genome (n = 6228). We contrast these patterns of polymorphism with insertions and deletions fixed in the same regions since the divergence of human and chimpanzee (n = 10,546). The macaque genome is used to resolve all indels into insertions and deletions. We find that the ratio of deletions to insertions is greater within humans than between human and chimpanzee. Deletions segregate at lower frequency in humans, providing evidence for deletions being under stronger purifying selection than insertions. The insertion and deletion rates correlate with several genomic features and we find evidence that both insertions and deletions are associated with point mutations. Finally, we find no evidence for a direct effect of the local recombination rate on the insertion and deletion rate.
CONCLUSIONS/SIGNIFICANCE: Our data strongly suggest that deletions are more deleterious than insertions but that insertions and deletions are otherwise generally governed by the same genomic factors.
尽管插入和缺失(indels)在物种内和物种间的遗传变化中占据相当大的比例,但由于它们难以分型、依赖于比对以及其潜在的突变过程理解不足,因此一直受到关注。在这方面的一个基本问题是插入和缺失是否受到相似或不同的过程控制,如果是这样,这些差异是什么。
方法/主要发现:我们使用西雅图 SNPs 和 NIEHS 人类多态性数据库中发布的重测序数据,构建了人类基因组中短多态性插入和缺失的全基因组数据集(n = 6228)。我们将这些多态性模式与人类和黑猩猩分化以来同一区域固定的插入和缺失进行对比(n = 10546)。使用猕猴基因组将所有的插入和缺失解析为插入和缺失。我们发现,人类内部的缺失与插入的比例大于人类与黑猩猩之间的缺失与插入的比例。人类中缺失的分离频率较低,这表明缺失受到更强的净化选择。插入和缺失率与几种基因组特征相关,我们发现证据表明插入和缺失都与点突变有关。最后,我们没有发现局部重组率对插入和缺失率有直接影响的证据。
结论/意义:我们的数据强烈表明,缺失比插入更具危害性,但插入和缺失在其他方面通常受到相同的基因组因素的控制。