Matern Benedict M, Spierings Eric, Bandstra Selle, Madbouly Abeer, Schaub Stefan, Weimer Eric T, Niemann Matthias
PIRCHE AG, Berlin, Germany.
Center for Translational Immunology and Central Diagnostics Laboratory, University Medical Center, Utrecht, Netherlands.
Front Genet. 2024 Sep 25;15:1444554. doi: 10.3389/fgene.2024.1444554. eCollection 2024.
Modern histocompatibility algorithms depend on the comparison and analysis of high-resolution HLA protein sequences and structures, especially when considering epitope-based algorithms, which aim to model the interactions involved in antibody or T cell binding. HLA genotype imputation can be performed in the cases where only low/intermediate-resolution HLA genotype is available or if specific loci are missing, and by providing an individuals' race/ethnicity/ancestry information, imputation results can be more accurate. This study assesses the effect of imputing high-resolution genotypes on molecular mismatch scores under a variety of ancestry assumptions.
We compared molecular matching scores from "ground-truth" high-resolution genotypes against scores from genotypes which are imputed from low-resolution genotypes. Analysis was focused on a simulated patient-donor dataset and confirmed using two real-world datasets, and deviations were aggregated based on various ancestry assumptions.
We observed that using multiple imputation generally results in lower error in molecular matching scores compared to single imputation, and that using the correct ancestry assumptions can reduce error introduced during imputation.
We conclude that for epitope analysis, imputation is a valuable and low-risk strategy, as long as care is taken regarding epitope analysis context, ancestry assumptions, and (multiple) imputation strategy.
现代组织相容性算法依赖于对高分辨率HLA蛋白序列和结构的比较与分析,尤其是在考虑基于表位的算法时,这些算法旨在模拟抗体或T细胞结合中涉及的相互作用。在仅获得低分辨率/中等分辨率HLA基因型或特定基因座缺失的情况下,可以进行HLA基因型推算,并且通过提供个体的种族/民族/血统信息,推算结果会更准确。本研究评估了在各种血统假设下推算高分辨率基因型对分子错配分数的影响。
我们将“真实”高分辨率基因型的分子匹配分数与从低分辨率基因型推算出的基因型分数进行了比较。分析集中在一个模拟的患者-供体数据集上,并使用两个真实世界的数据集进行了验证,偏差基于各种血统假设进行汇总。
我们观察到,与单一推算相比,使用多重推算通常会使分子匹配分数的误差更低,并且使用正确的血统假设可以减少推算过程中引入的误差。
我们得出结论,对于表位分析而言,只要在表位分析背景、血统假设和(多重)推算策略方面加以注意,推算就是一种有价值且低风险的策略。