K.G. Jebsen Coeliac Disease Research Centre, University of Oslo, Oslo, 0372, Norway.
Department of Immunology, University of Oslo and Oslo University Hospital-Rikshospitalet, Oslo, 0372, Norway.
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab566.
T-cell receptor (TCR) sequencing has enabled the development of innovative diagnostic tests for cancers, autoimmune diseases and other applications. However, the rarity of many T-cell clonotypes presents a detection challenge, which may lead to misdiagnosis if diagnostically relevant TCRs remain undetected. To address this issue, we developed TCRpower, a novel computational pipeline for quantifying the statistical detection power of TCR sequencing methods. TCRpower calculates the probability of detecting a TCR sequence as a function of several key parameters: in-vivo TCR frequency, T-cell sample count, read sequencing depth and read cutoff. To calibrate TCRpower, we selected unique TCRs of 45 T-cell clones (TCCs) as spike-in TCRs. We sequenced the spike-in TCRs from TCCs, together with TCRs from peripheral blood, using a 5' RACE protocol. The 45 spike-in TCRs covered a wide range of sample frequencies, ranging from 5 per 100 to 1 per 1 million. The resulting spike-in TCR read counts and ground truth frequencies allowed us to calibrate TCRpower. In our TCR sequencing data, we observed a consistent linear relationship between sample and sequencing read frequencies. We were also able to reliably detect spike-in TCRs with frequencies as low as one per million. By implementing an optimized read cutoff, we eliminated most of the falsely detected sequences in our data (TCR α-chain 99.0% and TCR β-chain 92.4%), thereby improving diagnostic specificity. TCRpower is publicly available and can be used to optimize future TCR sequencing experiments, and thereby enable reliable detection of disease-relevant TCRs for diagnostic applications.
T 细胞受体(TCR)测序技术为癌症、自身免疫性疾病和其他应用的创新诊断测试提供了可能。然而,许多 T 细胞克隆型的稀有性带来了检测挑战,如果未能检测到与诊断相关的 TCR,可能会导致误诊。为了解决这个问题,我们开发了 TCRpower,这是一种用于量化 TCR 测序方法统计检测能力的新型计算管道。TCRpower 计算了检测 TCR 序列的概率,作为几个关键参数的函数:体内 TCR 频率、T 细胞样本数量、读取测序深度和读取截止值。为了校准 TCRpower,我们选择了 45 个 T 细胞克隆(TCC)的独特 TCR 作为 Spike-in TCR。我们使用 5'RACE 方案对来自 TCC 的 Spike-in TCR 与外周血中的 TCR 进行测序。45 个 Spike-in TCR 涵盖了广泛的样本频率范围,从每 100 个中有 5 个到每 100 万个中有 1 个。由此产生的 Spike-in TCR 读取计数和真实频率使我们能够校准 TCRpower。在我们的 TCR 测序数据中,我们观察到样本和测序读取频率之间存在一致的线性关系。我们还能够可靠地检测到频率低至每百万分之一的 Spike-in TCR。通过实施优化的读取截止值,我们消除了数据中大多数误报的序列(TCR α 链为 99.0%,TCR β 链为 92.4%),从而提高了诊断特异性。TCRpower 是公开可用的,可以用于优化未来的 TCR 测序实验,从而能够可靠地检测与疾病相关的 TCR 用于诊断应用。