Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden.
PLoS One. 2011;6(8):e22547. doi: 10.1371/journal.pone.0022547. Epub 2011 Aug 4.
Patterns of genetic diversity have previously been shown to mirror geography on a global scale and within continents and individual countries. Using genome-wide SNP data on 5174 Swedes with extensive geographical coverage, we analyzed the genetic structure of the Swedish population. We observed strong differences between the far northern counties and the remaining counties. The population of Dalarna county, in north middle Sweden, which borders southern Norway, also appears to differ markedly from other counties, possibly due to this county having more individuals with remote Finnish or Norwegian ancestry than other counties. An analysis of genetic differentiation (based on pairwise F(st)) indicated that the population of Sweden's southernmost counties are genetically closer to the HapMap CEU samples of Northern European ancestry than to the populations of Sweden's northernmost counties. In a comparison of extended homozygous segments, we detected a clear divide between southern and northern Sweden with small differences between the southern counties and considerably more segments in northern Sweden. Both the increased degree of homozygosity in the north and the large genetic differences between the south and the north may have arisen due to a small population in the north and the vast geographical distances between towns and villages in the north, in contrast to the more densely settled southern parts of Sweden. Our findings have implications for future genome-wide association studies (GWAS) with respect to the matching of cases and controls and the need for within-county matching. We have shown that genetic differences within a single country may be substantial, even when viewed on a European scale. Thus, population stratification needs to be accounted for, even within a country like Sweden, which is often perceived to be relatively homogenous and a favourable resource for genetic mapping, otherwise inferences based on genetic data may lead to false conclusions.
先前的研究表明,遗传多样性模式在全球范围内以及在各大洲和各个国家内部与地理因素密切相关。我们利用覆盖范围广泛的 5174 名瑞典人的全基因组 SNP 数据,分析了瑞典人口的遗传结构。我们观察到最北部的县与其他县之间存在显著差异。位于瑞典中北部、与挪威南部接壤的达拉纳县的人口也与其他县明显不同,这可能是因为该县拥有更多具有芬兰或挪威远亲血统的个体。基于基因分化的分析(基于成对 F(st))表明,瑞典最南部各县的人口在遗传上与北欧 CEU 样本的哈普图(HapMap)更为接近,而与最北部各县的人口则存在差异。在对扩展纯合片段的比较中,我们发现瑞典最南部各县与哈普图(HapMap)的 CEU 样本在遗传上更为接近,而与瑞典最北部各县的人口则存在差异。在北部,同质性程度的增加和南北部之间的巨大遗传差异可能是由于北部人口较少,以及北部城镇和村庄之间的地理距离较大所致。我们的研究结果对未来的全基因组关联研究(GWAS)具有重要意义,包括病例与对照的匹配以及对县内匹配的需求。我们表明,即使在欧洲范围内,一个国家内部的遗传差异也可能很大。因此,即使在像瑞典这样通常被认为相对同质且是遗传图谱的有利资源的国家,也需要考虑人口分层问题,否则基于遗传数据的推断可能会导致错误的结论。