Ruiz Ortega María, Pogorelyy Mikhail V, Minervina Anastasia A, Thomas Paul G, Mora Thierry, Walczak Aleksandra M
Laboratoire de physique de l'École Normale Supérieure, CNRS, PSL Université, Sorbonne Université, and Université Paris-Cité, Paris, France.
Department of Host-Microbe Interactions, St. Jude Children's Research Hospital, Memphis, Tennessee, United States of America.
PLoS Comput Biol. 2025 Jan 6;21(1):e1012724. doi: 10.1371/journal.pcbi.1012724. eCollection 2025 Jan.
T cells recognize a wide range of pathogens using surface receptors that interact directly with peptides presented on major histocompatibility complexes (MHC) encoded by the HLA loci in humans. Understanding the association between T cell receptors (TCR) and HLA alleles is an important step towards predicting TCR-antigen specificity from sequences. Here we analyze the TCR alpha and beta repertoires of large cohorts of HLA-typed donors to systematically infer such associations, by looking for overrepresentation of TCRs in individuals with a common allele.TCRs, associated with a specific HLA allele, exhibit sequence similarities that suggest prior antigen exposure. Immune repertoire sequencing has produced large numbers of datasets, however the HLA type of the corresponding donors is rarely available. Using our TCR-HLA associations, we trained a computational model to predict the HLA type of individuals from their TCR repertoire alone. We propose an iterative procedure to refine this model by using data from large cohorts of untyped individuals, by recursively typing them using the model itself. The resulting model shows good predictive performance, even for relatively rare HLA alleles.
T细胞利用表面受体识别多种病原体,这些受体直接与人类HLA基因座编码的主要组织相容性复合体(MHC)上呈递的肽相互作用。了解T细胞受体(TCR)与HLA等位基因之间的关联是从序列预测TCR-抗原特异性的重要一步。在这里,我们分析了大量HLA分型供体的TCRα和β库,通过寻找具有共同等位基因的个体中TCR的过度代表性来系统地推断这种关联。与特定HLA等位基因相关的TCR表现出序列相似性,这表明先前有抗原暴露。免疫组库测序产生了大量数据集,然而相应供体的HLA类型很少可得。利用我们的TCR-HLA关联,我们训练了一个计算模型,仅根据个体的TCR库来预测其HLA类型。我们提出了一种迭代程序,通过使用来自大量未分型个体的数据,通过使用模型本身对他们进行递归分型来改进该模型。所得模型显示出良好的预测性能,即使对于相对罕见的HLA等位基因也是如此。