GenPhySE, Université de Toulouse, INRAE, INPT, INP-ENVT, Castanet Tolosan, France.
Roslin Institute, University of Edinburgh, Midlothian, UK.
Mol Ecol Resour. 2022 Nov;22(8):3068-3086. doi: 10.1111/1755-0998.13665. Epub 2022 Jun 27.
Honey bee subspecies originate from specific geographical areas in Africa, Europe and the Middle East, and beekeepers interested in specific phenotypes have imported genetic material to regions outside of the bees' original range for use either in pure lines or controlled crosses. Moreover, imported drones are present in the environment and mate naturally with queens from the local subspecies. The resulting admixture complicates population genetics analyses, and population stratification can be a major problem for association studies. To better understand Western European honey bee populations, we produced a whole genome sequence and single nucleotide polymorphism (SNP) genotype data set from 870 haploid drones and demonstrate its utility for the identification of nine genetic backgrounds and various degrees of admixture in a subset of 629 samples. Five backgrounds identified correspond to subspecies, two to isolated populations on islands and two to managed populations. We also highlight several large haplotype blocks, some of which coincide with the position of centromeres. The largest is 3.6 Mb long and represents 21% of chromosome 11, with two major haplotypes corresponding to the two dominant genetic backgrounds identified. This large naturally phased data set is available as a single vcf file that can now serve as a reference for subsequent populations genomics studies in the honey bee, such as (i) selecting individuals of verified homogeneous genetic backgrounds as references, (ii) imputing genotypes from a lower-density data set generated by an SNP-chip or by low-pass sequencing, or (iii) selecting SNPs compatible with the requirements of genotyping chips.
蜜蜂亚种起源于非洲、欧洲和中东的特定地理区域,有兴趣获得特定表型的养蜂人已经将遗传物质进口到蜜蜂原始分布范围以外的地区,用于纯系或控制杂交。此外,进口的雄蜂存在于环境中,并与当地亚种的蜂王自然交配。由此产生的杂交使群体遗传学分析变得复杂,并且群体分层可能是关联研究的一个主要问题。为了更好地了解西欧蜜蜂种群,我们从 870 只单倍体雄蜂中产生了全基因组序列和单核苷酸多态性(SNP)基因型数据集,并证明其可用于识别 629 个样本子集的九个遗传背景和不同程度的杂交。确定的五个背景对应于亚种,两个对应于岛屿上的隔离种群,两个对应于管理种群。我们还强调了几个大的单倍型块,其中一些与着丝粒的位置重合。最大的单倍型块长 3.6Mb,占 11 号染色体的 21%,有两个主要单倍型对应于确定的两个主要遗传背景。这个大型自然定相数据集作为一个 vcf 文件提供,现在可以作为后续蜜蜂群体基因组学研究的参考,例如:(i)选择具有验证的同质遗传背景的个体作为参考;(ii)使用 SNP 芯片或低深度测序生成的低密度数据集推断基因型;(iii)选择与基因分型芯片要求兼容的 SNPs。