School of Biological Sciences, The University of Hong Kong, Pok Fu Lam Road, Hong Kong SAR, China.
Immunogenetics. 2022 Jun;74(3):327-346. doi: 10.1007/s00251-022-01255-8. Epub 2022 Mar 1.
Duplicates of genes for major histocompatibility complex (MHC) molecules can be subjected to selection independently and vary markedly in their evolutionary rates, sequence polymorphism, and functional roles. Therefore, without a thorough understanding of their copy number variation (CNV) in the genome, the MHC-dependent fitness consequences within a species could be misinterpreted. Studying the intra-specific CNV of this highly polymorphic gene, however, has long been hindered by the difficulties in assigning alleles to loci and the lack of high-quality genomic data. Here, using the high-quality genome of the Siamese fighting fish (Betta splendens), a model for mate choice studies, and the whole-genome sequencing (WGS) data of 17 Betta species, we achieved locus-specific amplification of their three classical MHC class II genes - DAB1, DAB2, and DAB3. By performing quantitative PCR and depth-of-coverage analysis using the WGS data, we revealed intra-specific CNV at the DAB3 locus. We identified individuals that had two allelic copies (i.e., heterozygous or homozygous) or one allele (i.e., hemizygous) and individuals without this gene. The CNV was due to the deletion of a 20-kb-long genomic region harboring both the DAA3 and DAB3 genes. We further showed that the three DAB genes were under different modes of selection, which also applies to their corresponding DAA genes that share similar pattern of polymorphism. Our study demonstrates a combined approach to study CNV within a species, which is crucial for the understanding of multigene family evolution and the fitness consequences of CNV.
基因的复制品对于主要组织相容性复合体 (MHC) 分子可以独立地受到选择,并且在进化速度、序列多态性和功能作用方面有显著的差异。因此,如果不深入了解其在基因组中的拷贝数变异 (CNV),物种内的 MHC 依赖性适应性后果可能会被误解。然而,由于难以将等位基因分配到基因座以及缺乏高质量基因组数据,研究这个高度多态性基因的种内 CNV 一直受到阻碍。在这里,我们利用暹罗斗鱼 (Betta splendens) 的高质量基因组作为配偶选择研究的模型,以及 17 种贝塔鱼类的全基因组测序 (WGS) 数据,实现了它们三个经典 MHC 类 II 基因 - DAB1、DAB2 和 DAB3 的基因座特异性扩增。通过使用 WGS 数据进行定量 PCR 和覆盖深度分析,我们揭示了 DAB3 基因座的种内 CNV。我们鉴定出了具有两个等位基因拷贝(即杂合子或纯合子)或一个等位基因(即半合子)以及没有这个基因的个体。CNV 是由于包含 DAA3 和 DAB3 基因的 20kb 长基因组区域缺失所致。我们进一步表明,这三个 DAB 基因受到不同选择模式的影响,这也适用于它们共享相似多态性模式的相应 DAA 基因。我们的研究展示了一种研究物种内 CNV 的综合方法,这对于理解多基因家族进化和 CNV 的适应性后果至关重要。