Fredman David, White Stefan J, Potter Susanna, Eichler Evan E, Den Dunnen Johan T, Brookes Anthony J
Center for Genomics and Bioinformatics, Karolinska Institute, Berzelius väg 35, S-171 77 Stockholm, Sweden.
Nat Genet. 2004 Aug;36(8):861-6. doi: 10.1038/ng1401. Epub 2004 Jul 11.
There is uncertainty about the true nature of predicted single-nucleotide polymorphisms (SNPs) in segmental duplications (duplicons) and whether these markers genuinely exist at increased density as indicated in public databases. We explored these issues by genotyping 157 predicted SNPs in duplicons and control regions in normal diploid genomes and fully homozygous complete hydatidiform moles. Our data identified many true SNPs in duplicon regions and few paralogous sequence variants. Twenty-eight percent of the polymorphic duplicon sequences we tested involved multisite variation, a new type of polymorphism representing the sum of the signals from many individual duplicon copies that vary in sequence content due to duplication, deletion or gene conversion. Multisite variations can masquerade as normal SNPs when genotyped. Given that duplicons comprise at least 5% of the genome and many are yet to be annotated in the genome draft, effective strategies to identify multisite variation must be established and deployed.
对于节段性重复(复制子)中预测的单核苷酸多态性(SNP)的真实性质,以及这些标记是否真如公共数据库所示以增加的密度存在,存在不确定性。我们通过对正常二倍体基因组和完全纯合的完全性葡萄胎中的复制子和对照区域的157个预测SNP进行基因分型,探讨了这些问题。我们的数据在复制子区域鉴定出许多真实的SNP,而旁系同源序列变异很少。我们测试的多态性复制子序列中有28%涉及多位点变异,这是一种新型的多态性,代表了许多因复制、缺失或基因转换而序列内容不同的单个复制子拷贝信号的总和。多位点变异在进行基因分型时可能会伪装成正常的SNP。鉴于复制子至少占基因组的5%,且许多复制子在基因组草图中尚未注释,必须建立并采用有效的策略来识别多位点变异。