基于聚类的方法用于鉴定与疾病相关的 T 细胞受体 β 链 CDR3 序列的群体水平。

Clustering based approach for population level identification of condition-associated T-cell receptor β-chain CDR3 sequences.

机构信息

Research Programs Unit, Translational Immunology, University of Helsinki, Helsinki, Finland.

Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland.

出版信息

BMC Bioinformatics. 2021 Mar 25;22(1):159. doi: 10.1186/s12859-021-04087-7.

DOI:10.1186/s12859-021-04087-7

PMID:33765908

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7993519/

Abstract

BACKGROUND

Deep immune receptor sequencing, RepSeq, provides unprecedented opportunities for identifying and studying condition-associated T-cell clonotypes, represented by T-cell receptor (TCR) CDR3 sequences. However, due to the immense diversity of the immune repertoire, identification of condition relevant TCR CDR3s from total repertoires has mostly been limited to either "public" CDR3 sequences or to comparisons of CDR3 frequencies observed in a single individual. A methodology for the identification of condition-associated TCR CDR3s by direct population level comparison of RepSeq samples is currently lacking.

RESULTS

We present a method for direct population level comparison of RepSeq samples using immune repertoire sub-units (or sub-repertoires) that are shared across individuals. The method first performs unsupervised clustering of CDR3s within each sample. It then finds matching clusters across samples, called immune sub-repertoires, and performs statistical differential abundance testing at the level of the identified sub-repertoires. It finally ranks CDR3s in differentially abundant sub-repertoires for relevance to the condition. We applied the method on total TCR CDR3β RepSeq datasets of celiac disease patients, as well as on public datasets of yellow fever vaccination. The method successfully identified celiac disease associated CDR3β sequences, as evidenced by considerable agreement of TRBV-gene and positional amino acid usage patterns in the detected CDR3β sequences with previously known CDR3βs specific to gluten in celiac disease. It also successfully recovered significantly high numbers of previously known CDR3β sequences relevant to each condition than would be expected by chance.

CONCLUSION

We conclude that immune sub-repertoires of similar immuno-genomic features shared across unrelated individuals can serve as viable units of immune repertoire comparison, serving as proxy for identification of condition-associated CDR3s.

摘要

背景

深度免疫受体测序（RepSeq）为识别和研究与疾病相关的 T 细胞克隆型（以 T 细胞受体（TCR）CDR3 序列表示）提供了前所未有的机会。然而，由于免疫库的巨大多样性，从总免疫库中鉴定与疾病相关的 TCR CDR3 主要限于“公共”CDR3 序列或单个个体中观察到的 CDR3 频率比较。目前缺乏一种通过直接对 RepSeq 样本进行群体水平比较来鉴定与疾病相关的 TCR CDR3 的方法。

结果

我们提出了一种使用个体间共享的免疫库亚单位（或亚库）直接进行 RepSeq 样本群体水平比较的方法。该方法首先在每个样本内对 CDR3 进行无监督聚类。然后，它在样本之间找到匹配的簇，称为免疫亚库，并在鉴定的亚库水平上进行统计差异丰度测试。最后，对差异丰度亚库中的 CDR3 进行相关性排序。我们将该方法应用于乳糜泻患者的总 TCR CDR3β RepSeq 数据集，以及黄热病疫苗的公共数据集。该方法成功地鉴定了乳糜泻相关的 CDR3β 序列，这一点从检测到的 CDR3β 序列中 TRBV 基因和位置氨基酸使用模式与乳糜泻中特定于麸质的已知 CDR3β 序列之间的一致性得到了证明。它还成功地恢复了比预期更多的与每种疾病相关的已知 CDR3β 序列。

结论

我们得出结论，不同个体之间具有相似免疫基因组特征的免疫亚库可以作为可行的免疫库比较单位，可作为鉴定与疾病相关的 CDR3 的替代方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f5a8/7993519/545caec1939d/12859_2021_4087_Fig1_HTML.jpg

相似文献

Clustering based approach for population level identification of condition-associated T-cell receptor β-chain CDR3 sequences.基于聚类的方法用于鉴定与疾病相关的 T 细胞受体 β 链 CDR3 序列的群体水平。

BMC Bioinformatics. 2021 Mar 25;22(1):159. doi: 10.1186/s12859-021-04087-7.

Comprehensive Analysis of CDR3 Sequences in Gluten-Specific T-Cell Receptors Reveals a Dominant R-Motif and Several New Minor Motifs.全面分析麦胶蛋白特异性 T 细胞受体的 CDR3 序列揭示一个主要的 R 基序和几个新的次要基序。

Front Immunol. 2021 Apr 13;12:639672. doi: 10.3389/fimmu.2021.639672. eCollection 2021.

High-throughput sequencing of CD4 T cell repertoire reveals disease-specific signatures in IgG4-related disease.CD4 T 细胞 repertoire 的高通量测序揭示了 IgG4 相关疾病中的疾病特异性特征。

Arthritis Res Ther. 2019 Dec 19;21(1):295. doi: 10.1186/s13075-019-2069-6.

Epstein-Barr Virus Epitope-Major Histocompatibility Complex Interaction Combined with Convergent Recombination Drives Selection of Diverse T Cell Receptor α and β Repertoires. Epstein-Barr 病毒表位-主要组织相容性复合体相互作用结合收敛重组驱动多样化 T 细胞受体α和β库的选择。

mBio. 2020 Mar 17;11(2):e00250-20. doi: 10.1128/mBio.00250-20.

A new high-throughput sequencing method for determining diversity and similarity of T cell receptor (TCR) α and β repertoires and identifying potential new invariant TCR α chains.一种用于确定T细胞受体（TCR）α和β谱系的多样性和相似性并鉴定潜在新恒定TCRα链的新型高通量测序方法。

BMC Immunol. 2016 Oct 11;17(1):38. doi: 10.1186/s12865-016-0177-5.

Identification of antigen-specific TCR sequences based on biological and statistical enrichment in unselected individuals.基于未筛选个体中的生物学和统计学富集鉴定抗原特异性 TCR 序列。

JCI Insight. 2021 Jul 8;6(13):140028. doi: 10.1172/jci.insight.140028.

[Analysis of T cell repertoire in children with acute B lymphoblastic leukemia].[急性B淋巴细胞白血病患儿T细胞受体谱分析]

Zhonghua Er Ke Za Zhi. 2004 Jan;42(1):66-9.

[T cell repertoires correlate with pathogenesis of chronic idiopathic thrombocytopenic purpura].[T细胞受体库与慢性特发性血小板减少性紫癜的发病机制相关]

Zhonghua Yi Xue Za Zhi. 2005 Dec 14;85(47):3316-22.

Entropic analysis of antigen-specific CDR3 domains identifies essential binding motifs shared by CDR3s with different antigen specificities.对抗原特异性 CDR3 结构域的熵分析确定了具有不同抗原特异性的 CDR3 之间共享的基本结合基序。

Cell Syst. 2023 Apr 19;14(4):273-284.e5. doi: 10.1016/j.cels.2023.03.001. Epub 2023 Mar 30.

Analysis of the Repertoire Features of TCR Beta Chain CDR3 in Human by High-Throughput Sequencing.高通量测序分析人类TCRβ链CDR3的 repertoire 特征

Cell Physiol Biochem. 2016;39(2):651-67. doi: 10.1159/000445656. Epub 2016 Jul 21.

引用本文的文献

BertTCR: a Bert-based deep learning framework for predicting cancer-related immune status based on T cell receptor repertoire.BertTCR：一种基于 Bert 的深度学习框架，用于基于 T 细胞受体库预测癌症相关的免疫状态。

Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae420.

Reference-based comparison of adaptive immune receptor repertoires.基于参考的适应性免疫受体库比较。

Cell Rep Methods. 2022 Aug 22;2(8):100269. doi: 10.1016/j.crmeth.2022.100269.

CompAIRR: ultra-fast comparison of adaptive immune receptor repertoires by exact and approximate sequence matching.CompAIRR：通过精确和近似序列匹配进行适应性免疫受体库的超快速比较。

Bioinformatics. 2022 Sep 2;38(17):4230-4232. doi: 10.1093/bioinformatics/btac505.

Utility of Bulk T-Cell Receptor Repertoire Sequencing Analysis in Understanding Immune Responses to COVID-19.批量T细胞受体库测序分析在理解COVID-19免疫反应中的作用

Diagnostics (Basel). 2022 May 13;12(5):1222. doi: 10.3390/diagnostics12051222.

Specificity of Adaptive Immune Responses in Central Nervous System Health, Aging and Diseases.中枢神经系统健康、衰老及疾病中适应性免疫反应的特异性

Front Neurosci. 2022 Jan 20;15:806260. doi: 10.3389/fnins.2021.806260. eCollection 2021.

TCR meta-clonotypes for biomarker discovery with enabled identification of public, HLA-restricted clusters of SARS-CoV-2 TCRs.利用 TCR 元克隆型进行生物标志物发现，能够识别 SARS-CoV-2 TCR 的公共、HLA 限制簇。

Elife. 2021 Nov 30;10:e68605. doi: 10.7554/eLife.68605.

本文引用的文献

Molecular T-Cell Repertoire Analysis as Source of Prognostic and Predictive Biomarkers for Checkpoint Blockade Immunotherapy.分子 T 细胞 repertoire 分析作为免疫检查点阻断治疗的预后和预测生物标志物的来源。

Int J Mol Sci. 2020 Mar 30;21(7):2378. doi: 10.3390/ijms21072378.

Detection of Enriched T Cell Epitope Specificity in Full T Cell Receptor Sequence Repertoires.检测全 T 细胞受体序列库中富集的 T 细胞表位特异性。

Front Immunol. 2019 Nov 29;10:2820. doi: 10.3389/fimmu.2019.02820. eCollection 2019.

Detecting T cell receptors involved in immune responses from single repertoire snapshots.从单个免疫库快照中检测参与免疫反应的 T 细胞受体。

PLoS Biol. 2019 Jun 13;17(6):e3000314. doi: 10.1371/journal.pbio.3000314. eCollection 2019 Jun.

Model to improve specificity for identification of clinically-relevant expanded T cells in peripheral blood.用于提高外周血中临床相关扩展 T 细胞鉴定特异性的模型。

PLoS One. 2019 Mar 14;14(3):e0213684. doi: 10.1371/journal.pone.0213684. eCollection 2019.

Precise tracking of vaccine-responding T cell clones reveals convergent and personalized response in identical twins.精确追踪疫苗反应 T 细胞克隆揭示了同卵双胞胎中趋同和个性化的反应。

Proc Natl Acad Sci U S A. 2018 Dec 11;115(50):12704-12709. doi: 10.1073/pnas.1809642115. Epub 2018 Nov 20.

Disease-driving CD4+ T cell clonotypes persist for decades in celiac disease.在乳糜泻患者中，驱动疾病的 CD4+ T 细胞克隆型可存在数十年。

J Clin Invest. 2018 Jun 1;128(6):2642-2650. doi: 10.1172/JCI98819. Epub 2018 May 14.

Method for identification of condition-associated public antigen receptor sequences.条件相关公共抗原受体序列的鉴定方法。

Elife. 2018 Mar 13;7:e33050. doi: 10.7554/eLife.33050.

Deep sequencing of blood and gut T-cell receptor β-chains reveals gluten-induced immune signatures in celiac disease.血液和肠道 T 细胞受体 β 链的深度测序揭示了乳糜泻中麸质诱导的免疫特征。

Sci Rep. 2017 Dec 21;7(1):17977. doi: 10.1038/s41598-017-18137-9.

Optimizing and evaluating the reconstruction of Metagenome-assembled microbial genomes.优化和评估宏基因组组装微生物基因组的重建。

BMC Genomics. 2017 Nov 28;18(1):915. doi: 10.1186/s12864-017-4294-1.

VDJdb: a curated database of T-cell receptor sequences with known antigen specificity.VDJdb：一个经策展的 T 细胞受体序列数据库，具有已知的抗原特异性。

Nucleic Acids Res. 2018 Jan 4;46(D1):D419-D427. doi: 10.1093/nar/gkx760.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于聚类的方法用于鉴定与疾病相关的 T 细胞受体 β 链 CDR3 序列的群体水平。

Clustering based approach for population level identification of condition-associated T-cell receptor β-chain CDR3 sequences.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献