Department of Biochemistry and Biotechnology, Pwani University, Kilifi, Kenya.
Computational Biology Division, Department of Integrative Biomedical Sciences, IDM, University of Cape Town, Cape Town, South Africa.
PLoS One. 2023 Sep 28;18(9):e0291437. doi: 10.1371/journal.pone.0291437. eCollection 2023.
The Human Leukocyte Antigen (HLA) region plays an important role in autoimmune and infectious diseases. HLA is a highly polymorphic region and thus difficult to impute. We, therefore, sought to evaluate HLA imputation accuracy, specifically in a West African population, since they are understudied and are known to harbor high genetic diversity. The study sets were selected from 315 Gambian individuals within the Gambian Genome Variation Project (GGVP) Whole Genome Sequence datasets. Two different arrays, Illumina Omni 2.5 and Human Hereditary and Health in Africa (H3Africa), were assessed for the appropriateness of their markers, and these were used to test several imputation panels and tools. The reference panels were chosen from the 1000 Genomes (1kg-All), 1000 Genomes African (1kg-Afr), 1000 Genomes Gambian (1kg-Gwd), H3Africa, and the HLA Multi-ethnic datasets. HLA-A, HLA-B, and HLA-C alleles were imputed using HIBAG, SNP2HLA, CookHLA, and Minimac4, and concordance rate was used as an assessment metric. The best performing tool was found to be HIBAG, with a concordance rate of 0.84, while the best performing reference panel was the H3Africa panel, with a concordance rate of 0.62. Minimac4 (0.75) was shown to increase HLA-B allele imputation accuracy compared to HIBAG (0.71), SNP2HLA (0.51) and CookHLA (0.17). The H3Africa and Illumina Omni 2.5 array performances were comparable, showing that genotyping arrays have less influence on HLA imputation in West African populations. The findings show that using a larger population-specific reference panel and the HIBAG tool improves the accuracy of HLA imputation in a West African population.
人类白细胞抗原 (HLA) 区域在自身免疫和传染病中起着重要作用。HLA 是一个高度多态性区域,因此难以推断。因此,我们试图评估 HLA 推断的准确性,特别是在西非人群中,因为他们的研究较少,而且已知遗传多样性很高。研究集从 Gambian Genome Variation Project (GGVP) 全基因组序列数据集内的 315 名冈比亚个体中选择。评估了两种不同的数组,Illumina Omni 2.5 和 Human Hereditary and Health in Africa (H3Africa),以确定其标记的适宜性,并使用这些数组测试了几个推断面板和工具。参考面板是从 1000 Genomes (1kg-All)、1000 Genomes African (1kg-Afr)、1000 Genomes Gambian (1kg-Gwd)、H3Africa 和 HLA 多民族数据集选择的。使用 HIBAG、SNP2HLA、CookHLA 和 Minimac4 推断 HLA-A、HLA-B 和 HLA-C 等位基因,并使用一致性率作为评估指标。结果发现,性能最佳的工具是 HIBAG,一致性率为 0.84,而性能最佳的参考面板是 H3Africa 面板,一致性率为 0.62。与 HIBAG (0.71)、SNP2HLA (0.51) 和 CookHLA (0.17) 相比,Minimac4 (0.75) 提高了 HLA-B 等位基因推断的准确性。H3Africa 和 Illumina Omni 2.5 阵列的性能相当,表明基因分型阵列对西非人群 HLA 推断的影响较小。研究结果表明,使用更大的特定于人群的参考面板和 HIBAG 工具可提高西非人群 HLA 推断的准确性。