Center for Biological Sequence Analysis, Department of Bio and Health Informatics, Technical University of Denmark, DK-2800, Lyngby, Denmark.
Department of Twin Research and Genetic Epidemiology, Kings College London, London, UK.
BMC Bioinformatics. 2018 Jun 25;19(1):239. doi: 10.1186/s12859-018-2239-6.
The adaptive immune response intrinsically depends on hypervariable human leukocyte antigen (HLA) genes. Concomitantly, correct HLA phenotyping is crucial for successful donor-patient matching in organ transplantation. The cost and technical limitations of current laboratory techniques, together with advances in next-generation sequencing (NGS) methodologies, have increased the need for precise computational typing methods.
We tested two widespread HLA typing methods using high quality full genome sequencing data from 150 individuals in 50 family trios from the Genome Denmark project. First, we computed descendant accuracies assessing the agreement in the inheritance of alleles from parents to offspring. Second, we compared the locus-specific homozygosity rates as well as the allele frequencies; and we compared those to the observed values in related populations. We provide guidelines for testing the accuracy of HLA typing methods by comparing family information, which is independent of the availability of curated alleles.
Although current computational methods for HLA typing generally provide satisfactory results, our benchmark - using data with ultra-high sequencing depth - demonstrates the incompleteness of current reference databases, and highlights the importance of providing genomic databases addressing current sequencing standards, a problem yet to be resolved before benefiting fully from personalised medicine approaches HLA phenotyping is essential.
适应性免疫反应本质上依赖于高度多变的人类白细胞抗原(HLA)基因。同时,正确的 HLA 表型对于器官移植中成功的供体-患者匹配至关重要。当前实验室技术的成本和技术限制,以及下一代测序(NGS)方法的进步,增加了对精确计算分型方法的需求。
我们使用来自丹麦基因组计划 50 个三核苷酸家族中的 150 个人的高质量全基因组测序数据,测试了两种广泛使用的 HLA 分型方法。首先,我们计算了后代准确性,评估了从父母到后代的等位基因遗传的一致性。其次,我们比较了特定基因座的同质性率以及等位基因频率;并将其与相关人群中的观察值进行了比较。我们通过比较独立于已审定等位基因可用性的家族信息,提供了测试 HLA 分型方法准确性的指南。
尽管当前用于 HLA 分型的计算方法通常提供令人满意的结果,但我们的基准(使用具有超高测序深度的数据)表明,当前参考数据库不完整,并强调了提供针对当前测序标准的基因组数据库的重要性,这是在充分受益于个性化医疗方法之前尚未解决的问题。HLA 表型至关重要。