Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, USA.
Department of Mathematics, Boise State University, Boise, USA.
BMC Bioinformatics. 2022 Apr 28;23(1):152. doi: 10.1186/s12859-022-04690-2.
T cell receptors (TCRs) play critical roles in adaptive immune responses, and recent advances in genome technology have made it possible to examine the T cell receptor (TCR) repertoire at the individual sequence level. The analysis of the TCR repertoire with respect to clinical phenotypes can yield novel insights into the etiology and progression of immune-mediated diseases. However, methods for association analysis of the TCR repertoire have not been well developed.
We introduce an analysis tool, TCR-L, for evaluating the association between the TCR repertoire and disease outcomes. Our approach is developed under a mixed effect modeling, where the fixed effect represents features that can be explicitly extracted from TCR sequences while the random effect represents features that are hidden in TCR sequences and are difficult to be extracted. Statistical tests are developed to examine the two types of effects independently, and then the p values are combined.
Simulation studies demonstrate that (1) the proposed approach can control the type I error well; and (2) the power of the proposed approach is greater than approaches that consider fixed effect only or random effect only. The analysis of real data from a skin cutaneous melanoma study identifies an association between the TCR repertoire and the short/long-term survival of patients.
The TCR-L can accommodate features that can be extracted as well as features that are hidden in TCR sequences. TCR-L provides a powerful approach for identifying association between TCR repertoire and disease outcomes.
T 细胞受体(TCRs)在适应性免疫反应中起着至关重要的作用,而基因组技术的最新进展使得在个体序列水平上检测 T 细胞受体(TCR)库成为可能。从临床表型方面分析 TCR 库可以深入了解免疫介导疾病的病因和进展。然而,尚未很好地开发 TCR 库的关联分析方法。
我们引入了一种分析工具 TCR-L,用于评估 TCR 库与疾病结局之间的关联。我们的方法是在混合效应模型下开发的,其中固定效应代表可以从 TCR 序列中明确提取的特征,而随机效应代表隐藏在 TCR 序列中且难以提取的特征。开发了统计检验来分别检验这两种效应,然后合并 p 值。
模拟研究表明:(1)所提出的方法可以很好地控制第一类错误;(2)与仅考虑固定效应或仅考虑随机效应的方法相比,所提出的方法的功效更大。对皮肤黑色素瘤研究中真实数据的分析确定了 TCR 库与患者短期/长期生存之间的关联。
TCR-L 可以适应可以提取的特征以及隐藏在 TCR 序列中的特征。TCR-L 为识别 TCR 库与疾病结局之间的关联提供了一种强大的方法。