Department of Chemistry, Computational Life Science Cluster, Umeå University, Umeå, Sweden.
Department of Medical Biochemistry and Biophysics, Umeå University, Umeå, Sweden.
PLoS One. 2020 Sep 11;15(9):e0237721. doi: 10.1371/journal.pone.0237721. eCollection 2020.
The number of national reference populations that are whole-genome sequenced are rapidly increasing. Partly driving this development is the fact that genetic disease studies benefit from knowing the genetic variation typical for the geographical area of interest. A whole-genome sequenced Swedish national reference population (n = 1000) has been recently published but with few samples from northern Sweden. In the present study we have whole-genome sequenced a control population (n = 300) (ACpop) from Västerbotten County, a sparsely populated region in northern Sweden previously shown to be genetically different from southern Sweden. The aggregated variant frequencies within ACpop are publicly available (DOI 10.17044/NBIS/G000005) to function as a basic resource in clinical genetics and for genetic studies. Our analysis of ACpop, representing approximately 0.11% of the population in Västerbotten, indicates the presence of a genetic substructure within the county. Furthermore, a demographic analysis showed that the population from which samples were drawn was to a large extent geographically stationary, a finding that was corroborated in the genetic analysis down to the level of municipalities. Including ACpop in the reference population when imputing unknown variants in a Västerbotten cohort resulted in a strong increase in the number of high-confidence imputed variants (up to 81% for variants with minor allele frequency < 5%). ACpop was initially designed for cancer disease studies, but the genetic structure within the cohort will be of general interest for all genetic disease studies in northern Sweden.
越来越多的国家参考人群正在进行全基因组测序。推动这一发展的部分原因是,遗传疾病研究受益于了解特定地理区域的遗传变异。最近发表了一个经过全基因组测序的瑞典国家参考人群(n = 1000),但来自瑞典北部的样本很少。在本研究中,我们对来自瑞典北部人口稀少的瓦斯特堡县的一个对照人群(n = 300)(ACpop)进行了全基因组测序,该人群之前的研究表明与瑞典南部在遗传上存在差异。ACpop 中的聚合变异频率已公开提供(DOI 10.17044/NBIS/G000005),可作为临床遗传学和遗传研究的基本资源。我们对代表瓦斯特堡县人口约 0.11%的 ACpop 的分析表明,该县存在遗传亚结构。此外,人口分析表明,样本取自的人群在很大程度上是地理上静止的,这一发现在遗传分析中得到了证实,甚至到了市镇一级。在对瓦斯特堡队列中的未知变异进行推测时,将 ACpop 纳入参考人群会导致高置信度推测变异的数量大幅增加(对于频率小于 5%的变异,增加高达 81%)。ACpop 最初是为癌症疾病研究设计的,但该队列中的遗传结构将对瑞典北部所有遗传疾病研究具有普遍意义。