• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从 B 细胞受体深度测序数据推断每个样本的免疫球蛋白种系。

Per-sample immunoglobulin germline inference from B cell receptor deep sequencing data.

机构信息

Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America.

出版信息

PLoS Comput Biol. 2019 Jul 22;15(7):e1007133. doi: 10.1371/journal.pcbi.1007133. eCollection 2019 Jul.

DOI:10.1371/journal.pcbi.1007133
PMID:31329576
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6675132/
Abstract

The collection of immunoglobulin genes in an individual's germline, which gives rise to B cell receptors via recombination, is known to vary significantly across individuals. In humans, for example, each individual has only a fraction of the several hundred known V alleles. Furthermore, the currently-accepted set of known V alleles is both incomplete (particularly for non-European samples), and contains a significant number of spurious alleles. The resulting uncertainty as to which immunoglobulin alleles are present in any given sample results in inaccurate B cell receptor sequence annotations, and in particular inaccurate inferred naive ancestors. In this paper we first show that the currently widespread practice of aligning each sequence to its closest match in the full set of IMGT alleles results in a very large number of spurious alleles that are not in the sample's true set of germline V alleles. We then describe a new method for inferring each individual's germline gene set from deep sequencing data, and show that it improves upon existing methods by making a detailed comparison on a variety of simulated and real data samples. This new method has been integrated into the partis annotation and clonal family inference package, available at https://github.com/psathyrella/partis, and is run by default without affecting overall run time.

摘要

个体的免疫球蛋白基因在个体的胚系中收集,通过重组产生 B 细胞受体,已知在个体之间存在显著差异。例如,在人类中,每个个体只有几百个已知 V 等位基因中的一部分。此外,目前公认的已知 V 等位基因集既不完整(特别是对于非欧洲样本),也包含大量虚假等位基因。由于对任何给定样本中存在哪些免疫球蛋白等位基因存在不确定性,导致 B 细胞受体序列注释不准确,特别是推断的幼稚祖先不准确。在本文中,我们首先表明,目前广泛采用的将每个序列与完整的 IMGT 等位基因集中的最接近匹配进行对齐的做法,会导致大量虚假等位基因,这些基因不在样本的真实胚系 V 等位基因集中。然后,我们描述了一种从深度测序数据推断每个个体的胚系基因集的新方法,并表明它通过在各种模拟和真实数据样本上进行详细比较,改进了现有方法。该新方法已集成到 partis 注释和克隆家族推断包中,可在 https://github.com/psathyrella/partis 上获得,并默认运行,不会影响整体运行时间。

相似文献

1
Per-sample immunoglobulin germline inference from B cell receptor deep sequencing data.从 B 细胞受体深度测序数据推断每个样本的免疫球蛋白种系。
PLoS Comput Biol. 2019 Jul 22;15(7):e1007133. doi: 10.1371/journal.pcbi.1007133. eCollection 2019 Jul.
2
Inferred Allelic Variants of Immunoglobulin Receptor Genes: A System for Their Evaluation, Documentation, and Naming.推断的免疫球蛋白受体基因等位变体:一种用于评估、记录和命名的系统。
Front Immunol. 2019 Mar 18;10:435. doi: 10.3389/fimmu.2019.00435. eCollection 2019.
3
Critical steps for computational inference of the 3'-end of novel alleles of immunoglobulin heavy chain variable genes - illustrated by an allele of IGHV3-7.计算推断免疫球蛋白重链可变基因新型等位基因 3’末端的关键步骤 - 以 IGHV3-7 等位基因为例。
Mol Immunol. 2018 Nov;103:1-6. doi: 10.1016/j.molimm.2018.08.018. Epub 2018 Aug 30.
4
Automated analysis of high-throughput B-cell sequencing data reveals a high frequency of novel immunoglobulin V gene segment alleles.高通量B细胞测序数据的自动化分析揭示了新型免疫球蛋白V基因片段等位基因的高频率。
Proc Natl Acad Sci U S A. 2015 Feb 24;112(8):E862-70. doi: 10.1073/pnas.1417683112. Epub 2015 Feb 9.
5
IMGT(®) tools for the nucleotide analysis of immunoglobulin (IG) and T cell receptor (TR) V-(D)-J repertoires, polymorphisms, and IG mutations: IMGT/V-QUEST and IMGT/HighV-QUEST for NGS.用于免疫球蛋白(IG)和T细胞受体(TR)V-(D)-J基因库、多态性及IG突变核苷酸分析的IMGT(®)工具:用于二代测序的IMGT/V-QUEST和IMGT/HighV-QUEST
Methods Mol Biol. 2012;882:569-604. doi: 10.1007/978-1-61779-842-9_32.
6
Using B cell receptor lineage structures to predict affinity.利用 B 细胞受体谱系结构预测亲和力。
PLoS Comput Biol. 2020 Nov 11;16(11):e1008391. doi: 10.1371/journal.pcbi.1008391. eCollection 2020 Nov.
7
IGHV allele similarity clustering improves genotype inference from adaptive immune receptor repertoire sequencing data.IGHV 等位基因相似聚类可提高适应性免疫受体谱系测序数据的基因型推断。
Nucleic Acids Res. 2023 Sep 8;51(16):e86. doi: 10.1093/nar/gkad603.
8
TypeLoader2: Automated submission of novel HLA and killer-cell immunoglobulin-like receptor alleles in full length.TypeLoader2:全长自动提交新的 HLA 和杀伤细胞免疫球蛋白样受体等位基因。
HLA. 2019 Apr;93(4):195-202. doi: 10.1111/tan.13508.
9
Consistency of VDJ Rearrangement and Substitution Parameters Enables Accurate B Cell Receptor Sequence Annotation.VDJ重排和替换参数的一致性可实现准确的B细胞受体序列注释。
PLoS Comput Biol. 2016 Jan 11;12(1):e1004409. doi: 10.1371/journal.pcbi.1004409. eCollection 2016 Jan.
10
High-Quality Library Preparation for NGS-Based Immunoglobulin Germline Gene Inference and Repertoire Expression Analysis.基于下一代测序的高质量文库制备用于免疫球蛋白胚系基因推断和库表达分析。
Front Immunol. 2019 Apr 5;10:660. doi: 10.3389/fimmu.2019.00660. eCollection 2019.

引用本文的文献

1
Thrifty wide-context models of B cell receptor somatic hypermutation.B细胞受体体细胞超突变的节俭宽背景模型
Elife. 2025 Aug 29;14:RP105471. doi: 10.7554/eLife.105471.
2
A Sitewise Model of Natural Selection on Individual Antibodies via a Transformer-Encoder.一种通过Transformer编码器对个体抗体进行自然选择的位点特异性模型。
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf186.
3
Genotyping of selected germline adaptive immune system loci using short-read sequencing data.利用短读长测序数据对选定的种系适应性免疫系统基因座进行基因分型。

本文引用的文献

1
Worldwide genetic variation of the IGHV and TRBV immune receptor gene families in humans.人类免疫球蛋白重链和 T 细胞受体β可变基因家族的全球遗传变异。
Life Sci Alliance. 2019 Feb 26;2(2). doi: 10.26508/lsa.201800221. Print 2019 Apr.
2
Gene-Specific Substitution Profiles Describe the Types and Frequencies of Amino Acid Changes during Antibody Somatic Hypermutation.基因特异性替换图谱描述了抗体体细胞高频突变过程中氨基酸变化的类型和频率。
Front Immunol. 2017 May 10;8:537. doi: 10.3389/fimmu.2017.00537. eCollection 2017.
3
Comment on "A Database of Human Immune Receptor Alleles Recovered from Population Sequencing Data".
Genome Res. 2025 Sep 2;35(9):2076-2086. doi: 10.1101/gr.280314.124.
4
Nucleotide context models outperform protein language models for predicting antibody affinity maturation.在预测抗体亲和力成熟方面,核苷酸上下文模型优于蛋白质语言模型。
bioRxiv. 2025 Jun 18:2025.06.16.659977. doi: 10.1101/2025.06.16.659977.
5
Mapping essential somatic hypermutations in a CD4-binding site bNAb informs HIV-1 vaccine design.绘制CD4结合位点广谱中和抗体中的关键体细胞超突变可指导HIV-1疫苗设计。
Cell Rep. 2025 May 27;44(5):115713. doi: 10.1016/j.celrep.2025.115713. Epub 2025 May 15.
6
Detection of PCR chimeras in adaptive immune receptor repertoire sequencing using hidden Markov models.使用隐马尔可夫模型在适应性免疫受体组库测序中检测PCR嵌合体。
bioRxiv. 2025 Feb 26:2025.02.21.638809. doi: 10.1101/2025.02.21.638809.
7
Thrifty wide-context models of B cell receptor somatic hypermutation.B细胞受体体细胞超突变的节俭宽背景模型
bioRxiv. 2025 May 19:2024.11.26.625407. doi: 10.1101/2024.11.26.625407.
8
Learning antibody sequence constraints from allelic inclusion.从等位基因包含中学习抗体序列约束条件。
bioRxiv. 2024 Oct 25:2024.10.22.619760. doi: 10.1101/2024.10.22.619760.
9
Ultrasensitive allele inference from immune repertoire sequencing data with MiXCR.使用MiXCR从免疫组库测序数据中进行超灵敏等位基因推断。
Genome Res. 2024 Dec 23;34(12):2293-2303. doi: 10.1101/gr.278775.123.
10
Full-length single-cell BCR sequencing paired with RNA sequencing reveals convergent responses to pneumococcal vaccination.全长单细胞 BCR 测序与 RNA 测序相结合揭示了对肺炎球菌疫苗接种的趋同反应。
Commun Biol. 2024 Sep 28;7(1):1208. doi: 10.1038/s42003-024-06823-0.
关于《从群体测序数据中恢复的人类免疫受体等位基因数据库》的评论
J Immunol. 2017 May 1;198(9):3371-3373. doi: 10.4049/jimmunol.1700306.
4
A Database of Human Immune Receptor Alleles Recovered from Population Sequencing Data.从群体测序数据中获得的人类免疫受体等位基因数据库。
J Immunol. 2017 Mar 1;198(5):2202-2210. doi: 10.4049/jimmunol.1601710. Epub 2017 Jan 23.
5
Dysregulation of B Cell Repertoire Formation in Myasthenia Gravis Patients Revealed through Deep Sequencing.通过深度测序揭示重症肌无力患者B细胞库形成的失调
J Immunol. 2017 Feb 15;198(4):1460-1473. doi: 10.4049/jimmunol.1601415. Epub 2017 Jan 13.
6
Production of individualized V gene databases reveals high levels of immunoglobulin genetic diversity.生成个体化 V 基因数据库揭示了高度的免疫球蛋白遗传多样性。
Nat Commun. 2016 Dec 20;7:13642. doi: 10.1038/ncomms13642.
7
Likelihood-Based Inference of B Cell Clonal Families.基于似然性的B细胞克隆家族推断
PLoS Comput Biol. 2016 Oct 17;12(10):e1005086. doi: 10.1371/journal.pcbi.1005086. eCollection 2016 Oct.
8
A Model of Somatic Hypermutation Targeting in Mice Based on High-Throughput Ig Sequencing Data.基于高通量Ig序列数据的小鼠体细胞超突变靶向模型。
J Immunol. 2016 Nov 1;197(9):3566-3574. doi: 10.4049/jimmunol.1502263. Epub 2016 Oct 5.
9
IGHV1-69 polymorphism modulates anti-influenza antibody repertoires, correlates with IGHV utilization shifts and varies by ethnicity.IGHV1-69基因多态性调节抗流感抗体库,与IGHV基因使用的变化相关,且因种族而异。
Sci Rep. 2016 Feb 16;6:20842. doi: 10.1038/srep20842.
10
Consistency of VDJ Rearrangement and Substitution Parameters Enables Accurate B Cell Receptor Sequence Annotation.VDJ重排和替换参数的一致性可实现准确的B细胞受体序列注释。
PLoS Comput Biol. 2016 Jan 11;12(1):e1004409. doi: 10.1371/journal.pcbi.1004409. eCollection 2016 Jan.