利用基于序列的信息对T细胞受体库中的样本间差异进行定量分析。

Quantification of Inter-Sample Differences in T-Cell Receptor Repertoires Using Sequence-Based Information.

作者信息

Yokota Ryo, Kaminaga Yuki, Kobayashi Tetsuya J

机构信息

Institute of Industrial Science, The University of Tokyo, Tokyo, Japan.

Department of Electrical Engineering and Information Systems, Graduate School of Engineering, The University of Tokyo, Tokyo, Japan.

出版信息

Front Immunol. 2017 Nov 15;8:1500. doi: 10.3389/fimmu.2017.01500. eCollection 2017.

DOI:10.3389/fimmu.2017.01500

PMID:29187849

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5694755/

Abstract

Inter-sample comparisons of T-cell receptor (TCR) repertoires are crucial for gaining a better understanding of the immunological states determined by different collections of T cells from different donor sites, cell types, and genetic and pathological backgrounds. For quantitative comparison, most previous studies utilized conventional methods in ecology, which focus on TCR sequences that overlap between pairwise samples. Some recent studies attempted another approach that is categorized into Poisson abundance models using the abundance distribution of observed TCR sequences. However, these methods ignore the details of the measured sequences and are consequently unable to identify sub-repertoires that might have important contributions to the observed inter-sample differences. Moreover, the sparsity of sequence data due to the huge diversity of repertoires hampers the performance of these methods, especially when few overlapping sequences exist. In this paper, we propose a new approach for REpertoire COmparison in Low Dimensions (RECOLD) based on TCR sequence information, which can estimate the low-dimensional structure by embedding the pairwise sequence dissimilarities in high-dimensional sequence space. The inter-sample differences between repertoires are then quantified by information-theoretic measures among the distributions of data estimated in the embedded space. Using datasets of mouse and human TCR repertoires, we demonstrate that RECOLD can accurately identify the inter-sample hierarchical structures, which have a good correspondence with our intuitive understanding about sample conditions. Moreover, for the dataset of transgenic mice that have strong restrictions on the diversity of their repertoires, our estimated inter-sample structure was consistent with the structure estimated by previous methods based on abundance or overlapping sequence information. For the dataset of human healthy donors and Sézary syndrome patients, our method also showed robust estimation performance even under the condition of high sparsity in TCR sequences, while previous studies failed to estimate the structure. In addition, we identified the sequences that contribute to the pairwise-sample differences between the repertoires with the different genetic backgrounds of mice. Such identification of the sequences contributing to variation in immune cell repertoires may provide substantial insight for the development of new immunotherapies and vaccines.

摘要

T细胞受体（TCR）库的样本间比较对于更好地理解由来自不同供体部位、细胞类型以及遗传和病理背景的不同T细胞集合所决定的免疫状态至关重要。对于定量比较，大多数先前的研究采用了生态学中的传统方法，这些方法关注成对样本之间重叠的TCR序列。最近的一些研究尝试了另一种方法，即使用观察到的TCR序列的丰度分布归类为泊松丰度模型。然而，这些方法忽略了测量序列的细节，因此无法识别可能对观察到的样本间差异有重要贡献的亚库。此外，由于库的巨大多样性导致的序列数据稀疏性阻碍了这些方法的性能，特别是当重叠序列很少时。在本文中，我们基于TCR序列信息提出了一种用于低维库比较（RECOLD）的新方法，该方法可以通过将成对序列差异嵌入高维序列空间来估计低维结构。然后通过嵌入空间中估计的数据分布之间的信息论度量来量化库之间的样本间差异。使用小鼠和人类TCR库的数据集，我们证明RECOLD可以准确识别样本间的层次结构，这与我们对样本条件的直观理解有很好的对应关系。此外，对于其库多样性受到强烈限制的转基因小鼠数据集，我们估计的样本间结构与先前基于丰度或重叠序列信息的方法估计的结构一致。对于人类健康供体和蕈样肉芽肿综合征患者的数据集，即使在TCR序列高度稀疏的情况下，我们的方法也显示出稳健的估计性能，而先前的研究未能估计出结构。此外，我们确定了导致具有不同小鼠遗传背景的库之间成对样本差异的序列。这种对导致免疫细胞库变异的序列的识别可能为新免疫疗法和疫苗的开发提供实质性的见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4ba/5694755/6d7d05412c8e/fimmu-08-01500-g001.jpg

相似文献

Quantification of Inter-Sample Differences in T-Cell Receptor Repertoires Using Sequence-Based Information.

Front Immunol. 2017 Nov 15;8:1500. doi: 10.3389/fimmu.2017.01500. eCollection 2017.

Estimation of T-cell repertoire diversity and clonal size distribution by Poisson abundance models.

J Immunol Methods. 2010 Feb 28;353(1-2):124-37. doi: 10.1016/j.jim.2009.11.009. Epub 2009 Nov 18.

Clustering based approach for population level identification of condition-associated T-cell receptor β-chain CDR3 sequences.

BMC Bioinformatics. 2021 Mar 25;22(1):159. doi: 10.1186/s12859-021-04087-7.

Characterization of human T cell receptor repertoire data in eight thymus samples and four related blood samples.

Data Brief. 2021 Jan 20;35:106751. doi: 10.1016/j.dib.2021.106751. eCollection 2021 Apr.

A new method for quantitative analysis of the T cell receptor V region repertoires in healthy common marmosets by microplate hybridization assay.

J Immunol Methods. 2012 Oct 31;384(1-2):81-91. doi: 10.1016/j.jim.2012.07.012. Epub 2012 Jul 25.

T cell receptor BV repertoires using real time PCR: a comparison of SYBR green and a dual-labelled HuTrec fluorescent probe.

J Immunol Methods. 2004 Nov;294(1-2):43-52. doi: 10.1016/j.jim.2004.08.015. Epub 2004 Oct 6.

[T cell repertoires correlate with pathogenesis of chronic idiopathic thrombocytopenic purpura].

Zhonghua Yi Xue Za Zhi. 2005 Dec 14;85(47):3316-22.

Deep autoregressive generative models capture the intrinsics embedded in T-cell receptor repertoires.

Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad038.

Comparing T cell receptor repertoires using optimal transport.

PLoS Comput Biol. 2022 Dec 7;18(12):e1010681. doi: 10.1371/journal.pcbi.1010681. eCollection 2022 Dec.

A bioinformatic framework for immune repertoire diversity profiling enables detection of immunological status.

Genome Med. 2015 May 28;7(1):49. doi: 10.1186/s13073-015-0169-8. eCollection 2015.

引用本文的文献

Spatiotemporal Single-Cell Analysis Reveals T Cell Clonal Dynamics and Phenotypic Plasticity in Human Graft-versus-Host Disease.

bioRxiv. 2025 May 28:2025.05.24.655962. doi: 10.1101/2025.05.24.655962.

Deep learning-based prediction of autoimmune diseases.

Sci Rep. 2025 Feb 7;15(1):4576. doi: 10.1038/s41598-025-88477-4.

BertTCR: a Bert-based deep learning framework for predicting cancer-related immune status based on T cell receptor repertoire.

Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae420.

Microbiota dictate T cell clonal selection to augment graft-versus-host disease after stem cell transplantation.

Immunity. 2024 Jul 9;57(7):1648-1664.e9. doi: 10.1016/j.immuni.2024.05.018. Epub 2024 Jun 13.

DeepLION2: deep multi-instance contrastive learning framework enhancing the prediction of cancer-associated T cell receptors by attention strategy on motifs.

Front Immunol. 2024 Mar 7;15:1345586. doi: 10.3389/fimmu.2024.1345586. eCollection 2024.

Entropic analysis of antigen-specific CDR3 domains identifies essential binding motifs shared by CDR3s with different antigen specificities.

Cell Syst. 2023 Apr 19;14(4):273-284.e5. doi: 10.1016/j.cels.2023.03.001. Epub 2023 Mar 30.

T cell repertoire profiling in allografts and native tissues in recipients with COVID-19 after solid organ transplantation: Insight into T cell-mediated allograft protection from viral infection.

Front Immunol. 2022 Dec 14;13:1056703. doi: 10.3389/fimmu.2022.1056703. eCollection 2022.

Comparing T cell receptor repertoires using optimal transport.

PLoS Comput Biol. 2022 Dec 7;18(12):e1010681. doi: 10.1371/journal.pcbi.1010681. eCollection 2022 Dec.

Machine Learning Approaches to TCR Repertoire Analysis.

Front Immunol. 2022 Jul 15;13:858057. doi: 10.3389/fimmu.2022.858057. eCollection 2022.

DeepLION: Deep Multi-Instance Learning Improves the Prediction of Cancer-Associated T Cell Receptors for Accurate Cancer Detection.

Front Genet. 2022 Apr 11;13:860510. doi: 10.3389/fgene.2022.860510. eCollection 2022.

本文引用的文献

Quantifiable predictive features define epitope-specific T cell receptor repertoires.

Nature. 2017 Jul 6;547(7661):89-93. doi: 10.1038/nature22383. Epub 2017 Jun 21.

A phylogenetic transform enhances analysis of compositional microbiota data.

Elife. 2017 Feb 15;6:e21887. doi: 10.7554/eLife.21887.

Vidjil: A Web Platform for Analysis of High-Throughput Repertoire Sequencing.

PLoS One. 2016 Nov 11;11(11):e0166126. doi: 10.1371/journal.pone.0166126. eCollection 2016.

High-Throughput Sequencing-Based Immune Repertoire Study during Infectious Disease.

Front Immunol. 2016 Aug 31;7:336. doi: 10.3389/fimmu.2016.00336. eCollection 2016.

Diversity and divergence of the glioma-infiltrating T-cell receptor repertoire.

Proc Natl Acad Sci U S A. 2016 Jun 21;113(25):E3529-37. doi: 10.1073/pnas.1601012113. Epub 2016 Jun 3.

PERMANOVA-S: association test for microbial community composition that accommodates confounders and multiple distances.

Bioinformatics. 2016 Sep 1;32(17):2618-25. doi: 10.1093/bioinformatics/btw311. Epub 2016 May 19.

Parasail: SIMD C library for global, semi-global, and local pairwise sequence alignments.

BMC Bioinformatics. 2016 Feb 10;17:81. doi: 10.1186/s12859-016-0930-z.

A Monte Carlo Study of the Recovery of Cluster Structure in Binary Data by Hierarchical Clustering Techniques.

Multivariate Behav Res. 1987 Apr 1;22(2):235-43. doi: 10.1207/s15327906mbr2202_6.

Bioinformatic and Statistical Analysis of Adaptive Immune Repertoires.

Trends Immunol. 2015 Nov;36(11):738-749. doi: 10.1016/j.it.2015.09.006. Epub 2015 Oct 25.

Computational analysis of stochastic heterogeneity in PCR amplification efficiency revealed by single molecule barcoding.

Sci Rep. 2015 Oct 13;5:14629. doi: 10.1038/srep14629.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用基于序列的信息对T细胞受体库中的样本间差异进行定量分析。

Quantification of Inter-Sample Differences in T-Cell Receptor Repertoires Using Sequence-Based Information.

作者信息

Yokota Ryo, Kaminaga Yuki, Kobayashi Tetsuya J

机构信息

Institute of Industrial Science, The University of Tokyo, Tokyo, Japan.

Department of Electrical Engineering and Information Systems, Graduate School of Engineering, The University of Tokyo, Tokyo, Japan.

出版信息

Front Immunol. 2017 Nov 15;8:1500. doi: 10.3389/fimmu.2017.01500. eCollection 2017.

DOI:10.3389/fimmu.2017.01500

PMID:29187849

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5694755/

Abstract

摘要

利用基于序列的信息对T细胞受体库中的样本间差异进行定量分析。

Quantification of Inter-Sample Differences in T-Cell Receptor Repertoires Using Sequence-Based Information.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

利用基于序列的信息对T细胞受体库中的样本间差异进行定量分析。

Quantification of Inter-Sample Differences in T-Cell Receptor Repertoires Using Sequence-Based Information.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献