Marsden Clare Diana, Lee Yoosook, Kreppel Katharina, Weakley Allison, Cornel Anthony, Ferguson Heather M, Eskin Eleazar, Lanzaro Gregory C
Vector Genetics Laboratory, Department of Pathology, Microbiology, and Immunology, School of Veterinary Medicine, University of California-Davis, California 95616.
G3 (Bethesda). 2014 Jan 10;4(1):121-31. doi: 10.1534/g3.113.008326.
Association mapping is a widely applied method for elucidating the genetic basis of phenotypic traits. However, factors such as linkage disequilibrium and levels of genetic diversity influence the power and resolution of this approach. Moreover, the presence of population subdivision among samples can result in spurious associations if not accounted for. As such, it is useful to have a detailed understanding of these factors before conducting association mapping experiments. Here we conducted whole-genome sequencing on 24 specimens of the malaria mosquito vector, Anopheles arabiensis, to further understanding of patterns of genetic diversity, population subdivision and linkage disequilibrium in this species. We found high levels of genetic diversity within the An. arabiensis genome, with ~800,000 high-confidence, single- nucleotide polymorphisms detected. However, levels of nucleotide diversity varied significantly both within and between chromosomes. We observed lower diversity on the X chromosome, within some inversions, and near centromeres. Population structure was absent at the local scale (Kilombero Valley, Tanzania) but detected between distant populations (Cameroon vs. Tanzania) where differentiation was largely restricted to certain autosomal chromosomal inversions such as 2Rb. Overall, linkage disequilibrium within An. arabiensis decayed very rapidly (within 200 bp) across all chromosomes. However, elevated linkage disequilibrium was observed within some inversions, suggesting that recombination is reduced in those regions. The overall low levels of linkage disequilibrium suggests that association studies in this taxon will be very challenging for all but variants of large effect, and will require large sample sizes.
关联作图是一种广泛应用于阐明表型性状遗传基础的方法。然而,诸如连锁不平衡和遗传多样性水平等因素会影响该方法的效力和分辨率。此外,如果不加以考虑,样本中群体细分的存在可能会导致虚假关联。因此,在进行关联作图实验之前,详细了解这些因素是很有用的。在这里,我们对24个疟蚊媒介阿拉伯按蚊样本进行了全基因组测序,以进一步了解该物种的遗传多样性模式、群体细分和连锁不平衡情况。我们发现阿拉伯按蚊基因组内存在高水平的遗传多样性,检测到约80万个高可信度的单核苷酸多态性。然而,核苷酸多样性水平在染色体内和染色体间均有显著差异。我们观察到X染色体上、某些倒位区域内以及着丝粒附近的多样性较低。在局部尺度(坦桑尼亚基洛贝罗山谷)不存在群体结构,但在距离较远的群体(喀麦隆与坦桑尼亚)之间检测到了群体结构,其中分化主要局限于某些常染色体倒位,如2Rb。总体而言,阿拉伯按蚊所有染色体上的连锁不平衡衰减非常迅速(在200 bp内)。然而,在某些倒位区域内观察到连锁不平衡升高,这表明这些区域的重组减少。连锁不平衡的总体低水平表明,除了大效应变异外,对该分类群进行关联研究将极具挑战性,并且需要大样本量。