Frentzen Angela, Greenbaum Jason A, Kim Haeuk, Peters Bjoern, Koşaloğlu-Yalçın Zeynep
Center for Infectious Disease and Vaccine Research, La Jolla Institute for Immunology, San Diego, CA, United States.
Department of Medicine, University of California, San Diego, San Diego, CA, United States.
Front Genet. 2023 Jan 12;14:1082168. doi: 10.3389/fgene.2023.1082168. eCollection 2023.
Several novel MHC class I epitope prediction tools additionally incorporate the abundance levels of the peptides' source antigens and have shown improved performance for predicting immunogenicity. Such tools require the user to input the MHC alleles and peptide sequences of interest, as well as the abundance levels of the peptides' source proteins. However, such expression data is often not directly available to users, and retrieving the expression level of a peptide's source antigen from public databases is not trivial. We have developed the Peptide eXpression annotator (pepX), which takes a peptide as input, identifies from which proteins the peptide can be derived, and returns an estimate of the expression level of those source proteins from selected public databases. We have also investigated how the abundance level of a peptide can be best estimated in cases when it can originate from multiple transcripts and proteins and found that summing up transcript-level expression values performs best in distinguishing ligands from decoy peptides.
几种新型的MHC I类表位预测工具还纳入了肽源抗原的丰度水平,并在预测免疫原性方面表现出了更好的性能。这类工具要求用户输入感兴趣的MHC等位基因和肽序列,以及肽源蛋白的丰度水平。然而,此类表达数据通常无法直接提供给用户,而且从公共数据库中检索肽源抗原的表达水平并非易事。我们开发了肽表达注释器(pepX),它以肽为输入,识别该肽可源自哪些蛋白质,并从选定的公共数据库中返回这些源蛋白表达水平的估计值。我们还研究了在肽可能源自多个转录本和蛋白质的情况下,如何最好地估计肽的丰度水平,发现汇总转录本水平的表达值在区分配体和诱饵肽方面表现最佳。