Department of Biological Sciences, University of Botswana, Gaborone, Botswana.
Department of Medical Microbiology, College of Health Sciences, Makerere University, Kampala, Uganda.
Am J Hum Genet. 2018 May 3;102(5):731-743. doi: 10.1016/j.ajhg.2018.03.010. Epub 2018 Apr 26.
Large-scale, population-based genomic studies have provided a context for modern medical genetics. Among such studies, however, African populations have remained relatively underrepresented. The breadth of genetic diversity across the African continent argues for an exploration of local genomic context to facilitate burgeoning disease mapping studies in Africa. We sought to characterize genetic variation and to assess population substructure within a cohort of HIV-positive children from Botswana-a Southern African country that is regionally underrepresented in genomic databases. Using whole-exome sequencing data from 164 Batswana and comparisons with 150 similarly sequenced HIV-positive Ugandan children, we found that 13%-25% of variation observed among Batswana was not captured by public databases. Uncaptured variants were significantly enriched (p = 2.2 × 10) for coding variants with minor allele frequencies between 1% and 5% and included predicted-damaging non-synonymous variants. Among variants found in public databases, corresponding allele frequencies varied widely, with Botswana having significantly higher allele frequencies among rare (<1%) pathogenic and damaging variants. Batswana clustered with other Southern African populations, but distinctly from 1000 Genomes African populations, and had limited evidence for admixture with extra-continental ancestries. We also observed a surprising lack of genetic substructure in Botswana, despite multiple tribal ethnicities and language groups, alongside a higher degree of relatedness than purported founder populations from the 1000 Genomes project. Our observations reveal a complex, but distinct, ancestral history and genomic architecture among Batswana and suggest that disease mapping within similar Southern African populations will require a deeper repository of genetic variation and allelic dependencies than presently exists.
大规模的基于人群的基因组研究为现代医学遗传学提供了背景。然而,在这些研究中,非洲人群的代表性仍然相对较低。非洲大陆的遗传多样性广泛,这就需要探索当地的基因组背景,以促进非洲新兴的疾病图谱研究。我们试图描述博茨瓦纳艾滋病毒阳性儿童队列中的遗传变异,并评估其群体结构。博茨瓦纳是南部非洲国家,在基因组数据库中代表性不足。我们使用来自 164 名博茨瓦纳人的全外显子组测序数据,并与 150 名同样经过测序的乌干达艾滋病毒阳性儿童进行比较,发现博茨瓦纳人中观察到的 13%-25%的变异无法从公共数据库中捕获。未捕获的变异在编码变异中明显富集(p = 2.2×10),这些变异的次要等位基因频率在 1%-5%之间,包括预测的有害非同义变异。在公共数据库中发现的变异中,相应的等位基因频率差异很大,博茨瓦纳的罕见(<1%)致病性和破坏性变异的等位基因频率明显更高。博茨瓦纳人与其他南部非洲人群聚类,但与 1000 基因组项目的非洲人群明显不同,与非洲以外祖先的混合证据有限。我们还观察到博茨瓦纳令人惊讶的缺乏遗传结构,尽管存在多种部落种族和语言群体,以及比 1000 基因组项目中推测的创始人群体更高的亲缘关系。我们的观察结果揭示了博茨瓦纳复杂但独特的祖先历史和基因组结构,并表明在类似的南部非洲人群中进行疾病图谱研究需要比目前更深入的遗传变异和等位基因依赖性存储库。