Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC 27599, USA.
J Hum Genet. 2010 Jun;55(6):366-74. doi: 10.1038/jhg.2010.41. Epub 2010 May 20.
Understanding genetic variation between populations is important because it affects the portability of human genome-wide analytical methods. We compared genetic variation and substructure between Malawians and other African and non-African HapMap populations. Allele frequencies and adjacent linkage disequilibrium (LD) were measured for 617 715 single nucleotide polymorphisms (SNPs) across subject genomes. Allele frequencies in the Malawian population (N=226) were highly correlated with allele frequencies in HapMap populations of African ancestry (AFA, N=376), namely Yoruban in Ibadan, Nigeria (Spearman's r(2)=0.97), Luhya in Webuye, Kenya (r(2)=0.97), African Americans in the southwest United States (r(2)=0.94) and Maasai in Kinyawa, Kenya (r(2)=0.91). This correlation was much lower between Malawians and other ancestry populations (r(2)<0.52). LD correlations between Malawians and HapMap populations were strongest for the populations of AFA (AFA r(2)>0.82, other ancestries r(2)<0.57). Principal components analyses revealed little population substructure within our Malawi sample but provided clear distinction between Malawians, AFA populations and two European populations. Five SNPs within the lactase gene (LCT) had substantially different allele frequencies between the Malawi population and Maasai in Kenyawa, Kenya (rs3769013, rs730005, rs3769012, rs2304370; P-values <1 x 10(-33)).
了解人群之间的遗传变异很重要,因为它会影响人类全基因组分析方法的可移植性。我们比较了马拉维人与其他非洲和非非洲 HapMap 人群之间的遗传变异和亚结构。在受试者基因组中,测量了 617715 个单核苷酸多态性(SNP)的等位基因频率和相邻连锁不平衡(LD)。马拉维人群(N=226)的等位基因频率与非洲裔 HapMap 人群(N=376)的等位基因频率高度相关,即尼日利亚伊巴丹的约鲁巴人(Spearman's r(2)=0.97)、肯尼亚韦布uye 的卢希亚人(r(2)=0.97)、美国西南部的非裔美国人(r(2)=0.94)和肯尼亚基尼亚瓦的马赛人(r(2)=0.91)。马拉维人与其他祖先人群之间的相关性要低得多(r(2)<0.52)。马拉维人与 HapMap 人群之间的 LD 相关性在 AFA 人群中最强(AFA r(2)>0.82,其他祖先 r(2)<0.57)。主成分分析显示,我们的马拉维样本中人群亚结构很小,但在马拉维人、AFA 人群和两个欧洲人群之间提供了明显的区分。乳糖酶基因(LCT)内的 5 个 SNP 在马拉维人群和肯尼亚 Maasai 人群之间的等位基因频率有很大差异(rs3769013、rs730005、rs3769012、rs2304370;P 值<1 x 10(-33))。