Department of Biochemistry and Molecular Genetics, University of Louisville, Louisville, KY, USA.
Faculty of Engineering, Bar Ilan University, Ramat Gan, Israel.
Genes Immun. 2024 Aug;25(4):297-306. doi: 10.1038/s41435-024-00279-2. Epub 2024 Jun 6.
Immunoglobulins (IGs), critical components of the human immune system, are composed of heavy and light protein chains encoded at three genomic loci. The IG Kappa (IGK) chain locus consists of two large, inverted segmental duplications. The complexity of the IG loci has hindered use of standard high-throughput methods for characterizing genetic variation within these regions. To overcome these limitations, we use long-read sequencing to create haplotype-resolved IGK assemblies in an ancestrally diverse cohort (n = 36), representing the first comprehensive description of IGK haplotype variation. We identify extensive locus polymorphism, including novel single nucleotide variants (SNVs) and novel structural variants harboring functional IGKV genes. Among 47 functional IGKV genes, we identify 145 alleles, 67 of which were not previously curated. We report inter-population differences in allele frequencies for 10 IGKV genes, including alleles unique to specific populations within this dataset. We identify haplotypes carrying signatures of gene conversion that associate with SNV enrichment in the IGK distal region, and a haplotype with an inversion spanning the proximal and distal regions. These data provide a critical resource of curated genomic reference information from diverse ancestries, laying a foundation for advancing our understanding of population-level genetic variation in the IGK locus.
免疫球蛋白(IGs)是人体免疫系统的关键组成部分,由三个基因组位置编码的重链和轻链蛋白组成。IGK 链基因座由两个大的、反向的片段重复组成。IG 基因座的复杂性阻碍了使用标准高通量方法来描述这些区域内遗传变异的应用。为了克服这些限制,我们使用长读测序技术在一个具有祖先多样性的队列(n=36)中创建了单倍型解析的 IGK 组装,这是对 IGK 单倍型变异的首次全面描述。我们发现了广泛的基因座多态性,包括新的单核苷酸变异(SNVs)和具有功能 IGKV 基因的新结构变异。在 47 个功能性 IGKV 基因中,我们鉴定了 145 个等位基因,其中 67 个是以前未被注释的。我们报告了 10 个 IGKV 基因在等位基因频率上的种群间差异,包括在这个数据集的特定种群中特有的等位基因。我们鉴定了携带基因转换特征的单倍型,这些单倍型与 IGK 远端的 SNV 富集相关联,以及一个跨越近端和远端的倒置单倍型。这些数据提供了来自不同祖先的经过注释的基因组参考信息的重要资源,为我们理解 IGK 基因座的群体遗传变异奠定了基础。