Volkman Sarah K, Lozovsky Elena, Barry Alyssa E, Bedford Trevor, Bethke Lara, Myrick Alissa, Day Karen P, Hartl Daniel L, Wirth Dyann F, Sawyer Stanley A
Department of Immunology and Infectious Disease, Harvard School of Public Health, Boston, MA, USA.
Gene. 2007 Jan 31;387(1-2):1-6. doi: 10.1016/j.gene.2006.07.026. Epub 2006 Aug 14.
The density and distribution of single-nucleotide polymorphisms (SNPs) across the genome has important implications for linkage disequilibrium mapping and association studies, and the level of simple-sequence microsatellite polymorphisms has important implications for the use of oligonucleotide hybridization methods to genotype SNPs. To assess the density of these types of polymorphisms in P. falciparum, we sampled introns and noncoding DNA upstream and downstream of coding regions among a variety of geographically diverse parasites. Across 36,229 base pairs of noncoding sequence representing 41 genetic loci, a total of 307 polymorphisms including 248 polymorphic microsatellites and 39 SNPs were identified. We found a significant excess of microsatellite polymorphisms having a repeat unit length of one or two, compared to those with longer repeat lengths, as well as a nonrandom distribution of SNP polymorphisms. Almost half of the SNPs localized to only three of the 41 genetic loci sampled. Furthermore, we find significant differences in the frequency of polymorphisms across the two chromosomes (2 and 3) examined most extensively, with an excess of SNPs and a surplus of polymorphic microsatellites on chromosome 3 as compared to chromosome 2 (P=0.0001). Furthermore, at some individual genetic loci we also find a nonrandom distribution of polymorphisms between coding and flanking noncoding sequences, where completely monomorphic regions may flank highly polymorphic genes. These data, combined with our previous findings of nonrandom distribution of SNPs across chromosome 2, suggest that the Plasmodium falciparum genome may be a mosaic with regard to genetic diversity, containing chromosomal regions that are highly polymorphic interspersed with regions that are much less polymorphic.
单核苷酸多态性(SNP)在基因组中的密度和分布对连锁不平衡图谱绘制及关联研究具有重要意义,而简单序列微卫星多态性的水平对使用寡核苷酸杂交方法进行SNP基因分型具有重要意义。为评估恶性疟原虫中这些多态性类型的密度,我们在多种地理来源不同的寄生虫中,对编码区上游和下游的内含子及非编码DNA进行了采样。在代表41个基因座的36,229个碱基对的非编码序列中,共鉴定出307个多态性位点,包括248个多态性微卫星和39个SNP。我们发现,与重复长度较长的微卫星多态性相比,重复单元长度为1或2的微卫星多态性显著过量,并且SNP多态性存在非随机分布。几乎一半的SNP仅定位于所采样的41个基因座中的3个。此外,我们发现,在所广泛研究的两条染色体(2号和3号)上,多态性频率存在显著差异,与2号染色体相比,3号染色体上的SNP过量且多态性微卫星过剩(P = 0.0001)。此外,在一些单个基因座上,我们还发现编码序列和侧翼非编码序列之间的多态性存在非随机分布,其中完全单态性区域可能位于高度多态性基因的侧翼。这些数据,结合我们之前关于SNP在2号染色体上非随机分布的发现,表明恶性疟原虫基因组在遗传多样性方面可能是一个镶嵌体,包含高度多态性的染色体区域与多态性低得多的区域相互交错。