Chimusa Emile R, Meintjies Ayton, Tchanga Milaine, Mulder Nicola, Seoighe Cathal, Soodyall Himla, Ramesar Rajkumar
Computational Biology Group, Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Cape Town, South Africa; Centre for Proteomic and Genomic Research, Cape Town, South Africa.
Computational Biology Group, Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Cape Town, South Africa.
PLoS Genet. 2015 Mar 26;11(3):e1005052. doi: 10.1371/journal.pgen.1005052. eCollection 2015 Mar.
We report a study of genome-wide, dense SNP (∼ 900K) and copy number polymorphism data of indigenous southern Africans. We demonstrate the genetic contribution to southern and eastern African populations, which involved admixture between indigenous San, Niger-Congo-speaking and populations of Eurasian ancestry. This finding illustrates the need to account for stratification in genome-wide association studies, and that admixture mapping would likely be a successful approach in these populations. We developed a strategy to detect the signature of selection prior to and following putative admixture events. Several genomic regions show an unusual excess of Niger-Kordofanian, and unusual deficiency of both San and Eurasian ancestry, which were considered the footprints of selection after population admixture. Several SNPs with strong allele frequency differences were observed predominantly between the admixed indigenous southern African populations, and their ancestral Eurasian populations. Interestingly, many candidate genes, which were identified within the genomic regions showing signals for selection, were associated with southern African-specific high-risk, mostly communicable diseases, such as malaria, influenza, tuberculosis, and human immunodeficiency virus/AIDs. This observation suggests a potentially important role that these genes might have played in adapting to the environment. Additionally, our analyses of haplotype structure, linkage disequilibrium, recombination, copy number variation and genome-wide admixture highlight, and support the unique position of San relative to both African and non-African populations. This study contributes to a better understanding of population ancestry and selection in south-eastern African populations; and the data and results obtained will support research into the genetic contributions to infectious as well as non-communicable diseases in the region.
我们报告了一项对南部非洲原住民全基因组、高密度单核苷酸多态性(约90万个)和拷贝数多态性数据的研究。我们证明了对南部和东部非洲人群的遗传贡献,其中涉及本土桑人、尼日尔 - 刚果语系人群和欧亚血统人群之间的混合。这一发现说明了在全基因组关联研究中考虑分层的必要性,并且混合映射在这些人群中可能是一种成功的方法。我们开发了一种策略来检测假定混合事件之前和之后的选择特征。几个基因组区域显示出尼日尔 - 科尔多凡尼血统异常过量,而桑人和欧亚血统则异常缺乏,这些被认为是人群混合后选择的印记。在混合的南部非洲本土人群与其祖先欧亚人群之间,主要观察到了几个具有强烈等位基因频率差异的单核苷酸多态性。有趣的是,在显示选择信号的基因组区域内鉴定出的许多候选基因,都与南部非洲特有的高风险、大多为传染病相关,如疟疾、流感、结核病和人类免疫缺陷病毒/艾滋病。这一观察结果表明这些基因在适应环境中可能发挥了潜在的重要作用。此外,我们对单倍型结构、连锁不平衡、重组、拷贝数变异和全基因组混合的分析突出并支持了桑人相对于非洲和非非洲人群的独特地位。这项研究有助于更好地理解东南部非洲人群的种群祖先和选择;所获得的数据和结果将支持对该地区传染病和非传染病遗传贡献的研究。