Wu Jie, Qi Meng, Zhang Feiyan, Zheng Yuanjie
School of Information Science and Engineering, Shandong Normal University, Jinan 250358, China.
School of Information Science and Engineering, Shandong Normal University, Jinan 250358, China.
Mol Immunol. 2023 May;157:30-41. doi: 10.1016/j.molimm.2023.03.010. Epub 2023 Mar 24.
T cell receptors (TCRs) selectively bind to antigens to fight pathogens with specific immunity. Current tools focus on the nature of amino acids within sequences and take less into account the nature of amino acids far apart and the relationship between sequences, leading to significant differences in the results from different datasets. We propose TPBTE, a model based on convolutional Transformer for Predicting the Binding of TCR to Epitope. It takes epitope sequences and the complementary decision region 3 (CDR3) sequences of TCRβ chain as inputs. And it uses a convolutional attention mechanism to learn amino acid representations between different positions of the sequences based on learning local features of the sequences. At the same time, it uses cross attention to learn the interaction information between TCR sequences and epitope sequences. A comprehensive evaluation of the TCR-epitope data shows that the average area under the curve of TPBTE outperforms the baseline model, and demonstrate an intentional performance. In addition, TPBTE can give the probability of binding TCR to epitopes, which can be used as the first step of epitope screening, narrowing the scope of epitope search and reducing the time of epitope search.
T细胞受体(TCR)通过特异性免疫选择性地结合抗原以对抗病原体。当前的工具侧重于序列中氨基酸的性质,而较少考虑相距较远的氨基酸的性质以及序列之间的关系,导致不同数据集的结果存在显著差异。我们提出了TPBTE,这是一种基于卷积Transformer的用于预测TCR与表位结合的模型。它将表位序列和TCRβ链的互补决定区3(CDR3)序列作为输入。并且它使用卷积注意力机制,基于学习序列的局部特征来学习序列不同位置之间的氨基酸表示。同时,它使用交叉注意力来学习TCR序列和表位序列之间的相互作用信息。对TCR-表位数据的综合评估表明,TPBTE的平均曲线下面积优于基线模型,并展示出良好的性能。此外,TPBTE可以给出TCR与表位结合的概率,这可以用作表位筛选的第一步,缩小表位搜索范围并减少表位搜索时间。