Karnaukhov Vadim, Paes Wayne, Woodhouse Isaac B, Partridge Thomas, Nicastri Annalisa, Brackenridge Simon, Scherbinin Dmitrii, Chudakov Dmitry M, Zvyagin Ivan V, Ternette Nicola, Koohy Hashem, Borrow Persephone, Shugay Mikhail
Center of Life Sciences, Skolkovo Institute of Science and Technology, Moscow, Russia.
Nuffield Department of Clinical Medicine, University of Oxford, Oxford, UK.
bioRxiv. 2021 Feb 17:2021.02.16.431395. doi: 10.1101/2021.02.16.431395.
Human leukocyte antigen (HLA) is highly polymorphic and plays a key role in guiding adaptive immune responses by presenting foreign and self peptides to T cells. Each HLA variant selects a minor fraction of peptides that match a certain motif required for optimal interaction with the peptide-binding groove. These restriction rules define the landscape of peptides presented to T cells. Given these limitations, one might suggest that the choice of peptides presented by HLA is non-random and there is preferential presentation of an array of peptides that is optimal for distinguishing self and foreign proteins. In this study we explore these preferences with a comparative analysis of self peptides enriched and depleted in HLA ligands. We show that HLAs exhibit preferences towards presenting peptides from certain proteins while disfavoring others with specific functions, and highlight differences between various HLA genes and alleles in those preferences. We link those differences to HLA anchor residue propensities and amino acid composition of preferentially presented proteins. The set of proteins that peptides presented by a given HLA are most likely to be derived from can be used to distinguish between class I and class II HLAs and HLA alleles. Our observations can be extrapolated to explain the protective effect of certain HLA alleles in infectious diseases, and we hypothesize that they can also explain susceptibility to certain autoimmune diseases and cancers. We demonstrate that these differences lead to differential presentation of HIV, influenza virus, SARS-CoV-1 and SARS-CoV-2 proteins by various HLA alleles. Finally, we show that the reported self peptidome preferences of distinct HLA variants can be compensated by combinations of HLA-A/HLA-B and HLA-A/HLA-C alleles in frequent haplotypes.
人类白细胞抗原(HLA)具有高度多态性,通过将外来和自身肽段呈递给T细胞,在引导适应性免疫反应中发挥关键作用。每个HLA变体选择一小部分与肽结合槽最佳相互作用所需特定基序匹配的肽段。这些限制规则定义了呈递给T细胞的肽段格局。鉴于这些局限性,有人可能会认为HLA呈递的肽段选择并非随机,存在一系列对区分自身和外来蛋白质最优化的肽段的优先呈递。在本研究中,我们通过对HLA配体中富集和耗尽的自身肽段进行比较分析来探索这些偏好。我们表明,HLA对呈递某些蛋白质的肽段表现出偏好,而不喜欢具有特定功能的其他蛋白质的肽段,并突出了不同HLA基因和等位基因在这些偏好上的差异。我们将这些差异与HLA锚定残基倾向和优先呈递蛋白质的氨基酸组成联系起来。给定HLA呈递的肽段最可能源自的蛋白质组可用于区分I类和II类HLA以及HLA等位基因。我们的观察结果可以外推来解释某些HLA等位基因在传染病中的保护作用,并且我们假设它们也可以解释对某些自身免疫性疾病和癌症的易感性。我们证明这些差异导致各种HLA等位基因对HIV、流感病毒、SARS-CoV-1和SARS-CoV-2蛋白的呈递存在差异。最后,我们表明,不同HLA变体报告的自身肽组偏好可以通过常见单倍型中HLA-A/HLA-B和HLA-A/HLA-C等位基因的组合得到补偿。