• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

对 40 亿个人类抗体可变区的大规模数据挖掘揭示了治疗性抗体和天然抗体之间的趋同,这限制了生物药物发现的搜索空间。

Large-scale data mining of four billion human antibody variable regions reveals convergence between therapeutic and natural antibodies that constrains search space for biologics drug discovery.

机构信息

NaturalAntibody, Szczecin, Poland.

Discovery Data Science, Genmab, Copenhagen, Denmark.

出版信息

MAbs. 2024 Jan-Dec;16(1):2361928. doi: 10.1080/19420862.2024.2361928. Epub 2024 Jun 6.

DOI:10.1080/19420862.2024.2361928
PMID:38844871
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11164219/
Abstract

The naïve human antibody repertoire has theoretical access to an estimated > 10 antibodies. Identifying subsets of this prohibitively large space where therapeutically relevant antibodies may be found is useful for development of these agents. It was previously demonstrated that, despite the immense sequence space, different individuals can produce the same antibodies. It was also shown that therapeutic antibodies, which typically follow seemingly unnatural development processes, can arise independently naturally. To check for biases in how the sequence space is explored, we data mined public repositories to identify 220 bioprojects with a combined seven billion reads. Of these, we created a subset of human bioprojects that we make available as the AbNGS database (https://naturalantibody.com/ngs/). AbNGS contains 135 bioprojects with four billion productive human heavy variable region sequences and 385 million unique complementarity-determining region (CDR)-H3s. We find that 270,000 (0.07% of 385 million) unique CDR-H3s are highly public in that they occur in at least five of 135 bioprojects. Of 700 unique therapeutic CDR-H3, a total of 6% has direct matches in the small set of 270,000. This observation extends to a match between CDR-H3 and V-gene call as well. Thus, the subspace of shared ('public') CDR-H3s shows utility for serving as a starting point for therapeutic antibody design.

摘要

人类幼稚抗体库理论上可以接触到估计超过 10 亿种抗体。识别出这个庞大空间中的亚群,其中可能存在有治疗意义的抗体,这对于这些药物的开发很有用。先前已经证明,尽管序列空间巨大,但不同的个体可以产生相同的抗体。也表明,尽管治疗性抗体通常遵循看似非自然的开发过程,但它们可以自然独立地产生。为了检查序列空间的探索是否存在偏差,我们从公共存储库中挖掘数据,以确定 220 个生物项目,共有 70 亿个读数。在这些项目中,我们创建了一个人类生物项目子集,我们将其作为 AbNGS 数据库(https://naturalantibody.com/ngs/)提供。AbNGS 包含 135 个生物项目,其中包含 40 亿个人类重链可变区序列和 3.85 亿个独特的互补决定区(CDR)-H3。我们发现,27 万(3850 万的 0.07%)个独特的 CDR-H3 在至少 135 个生物项目中的 5 个以上高度公开。在 700 个独特的治疗性 CDR-H3 中,共有 6%在这 27 万的小集中有直接匹配。这一观察结果也扩展到了 CDR-H3 和 V 基因调用之间的匹配。因此,共享(“公共”)CDR-H3 的子空间对于作为治疗性抗体设计的起点很有用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da0b/11164219/ce70ddf4a784/KMAB_A_2361928_F0004_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da0b/11164219/5d44bd24fcfc/KMAB_A_2361928_F0001_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da0b/11164219/8e3dc2d6bd25/KMAB_A_2361928_F0002_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da0b/11164219/437f9d325e19/KMAB_A_2361928_F0003_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da0b/11164219/ce70ddf4a784/KMAB_A_2361928_F0004_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da0b/11164219/5d44bd24fcfc/KMAB_A_2361928_F0001_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da0b/11164219/8e3dc2d6bd25/KMAB_A_2361928_F0002_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da0b/11164219/437f9d325e19/KMAB_A_2361928_F0003_OC.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da0b/11164219/ce70ddf4a784/KMAB_A_2361928_F0004_OC.jpg

相似文献

1
Large-scale data mining of four billion human antibody variable regions reveals convergence between therapeutic and natural antibodies that constrains search space for biologics drug discovery.对 40 亿个人类抗体可变区的大规模数据挖掘揭示了治疗性抗体和天然抗体之间的趋同,这限制了生物药物发现的搜索空间。
MAbs. 2024 Jan-Dec;16(1):2361928. doi: 10.1080/19420862.2024.2361928. Epub 2024 Jun 6.
2
The functional repertoire of rabbit antibodies and antibody discovery via next-generation sequencing.兔抗体的功能库及通过下一代测序进行抗体发现
MAbs. 2014 May-Jun;6(3):628-36. doi: 10.4161/mabs.28059. Epub 2014 Jan 30.
3
Analysis of 5518 unique, productively rearranged human VH3-23*01 gene sequences reveals CDR-H3 length-dependent usage of the IGHD2 gene family.对5518个独特的、发生有效重排的人类VH3-23*01基因序列进行分析,揭示了IGHD2基因家族的互补决定区H3(CDR-H3)长度依赖性使用情况。
Protein Eng Des Sel. 2017 Sep 1;30(9):603-609. doi: 10.1093/protein/gzx027.
4
Looking for therapeutic antibodies in next-generation sequencing repositories.从下一代测序数据库中寻找治疗性抗体。
MAbs. 2019 Oct;11(7):1197-1205. doi: 10.1080/19420862.2019.1633884. Epub 2019 Jul 17.
5
Diversity in the Cow Ultralong CDR H3 Antibody Repertoire.奶牛超长互补决定区H3抗体库的多样性。
Front Immunol. 2018 Jun 4;9:1262. doi: 10.3389/fimmu.2018.01262. eCollection 2018.
6
The Unusual Genetics and Biochemistry of Bovine Immunoglobulins.牛免疫球蛋白的不寻常遗传学和生物化学。
Adv Immunol. 2018;137:135-164. doi: 10.1016/bs.ai.2017.12.004. Epub 2018 Feb 9.
7
Alterations in B cell development, CDR-H3 repertoire and dsDNA-binding antibody production among C57BL/6 ΔD-iD mice congenic for the lupus susceptibility loci sle1, sle2 or sle3.在狼疮易感性位点sle1、sle2或sle3同基因的C57BL/6 ΔD-iD小鼠中B细胞发育、互补决定区-重链3(CDR-H3)库和双链DNA结合抗体产生的改变。
Autoimmunity. 2017 Feb;50(1):42-51. doi: 10.1080/08916934.2016.1272597.
8
Comparison of antibody repertoires produced by HIV-1 infection, other chronic and acute infections, and systemic autoimmune disease.比较 HIV-1 感染、其他慢性和急性感染以及系统性自身免疫性疾病产生的抗体库。
PLoS One. 2011 Mar 30;6(3):e16857. doi: 10.1371/journal.pone.0016857.
9
Landscape of Non-canonical Cysteines in Human V Repertoire Revealed by Immunogenetic Analysis.免疫遗传学分析揭示人类 V 区中非典型半胱氨酸景观。
Cell Rep. 2020 Jun 30;31(13):107831. doi: 10.1016/j.celrep.2020.107831.
10
A Broad Role for Cysteines in Bovine Antibody Diversity.半胱氨酸在牛抗体多样性中的广泛作用。
Immunohorizons. 2019 Oct 16;3(10):478-487. doi: 10.4049/immunohorizons.1900058.

引用本文的文献

1
Conserved heavy/light contacts and germline preferences revealed by a large-scale analysis of natively paired human antibody sequences and structural data.通过对天然配对的人类抗体序列和结构数据的大规模分析揭示的保守重链/轻链接触和种系偏好。
Commun Biol. 2025 Jul 26;8(1):1110. doi: 10.1038/s42003-025-08388-y.
2
Applying computational protein design to therapeutic antibody discovery - current state and perspectives.将计算蛋白质设计应用于治疗性抗体发现——现状与展望。
Front Immunol. 2025 May 22;16:1571371. doi: 10.3389/fimmu.2025.1571371. eCollection 2025.
3
RIOT-Rapid Immunoglobulin Overview Tool-annotation of nucleotide and amino acid immunoglobulin sequences using an open germline database.

本文引用的文献

1
Benchmarking antibody clustering methods using sequence, structural, and machine learning similarity measures for antibody discovery applications.使用序列、结构和机器学习相似性度量对抗体聚类方法进行基准测试,以用于抗体发现应用。
Front Mol Biosci. 2024 Mar 28;11:1352508. doi: 10.3389/fmolb.2024.1352508. eCollection 2024.
2
Insights into next generation sequencing guided antibody selection strategies.下一代测序指导的抗体选择策略的见解。
Sci Rep. 2023 Oct 26;13(1):18370. doi: 10.1038/s41598-023-45538-w.
3
Structural modeling of antibody variable regions using deep learning-progress and perspectives on drug discovery.
RIOT-快速免疫球蛋白概述工具-使用开放种系数据库对核苷酸和氨基酸免疫球蛋白序列进行注释。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae632.
利用深度学习进行抗体可变区的结构建模——药物发现的进展与展望
Front Mol Biosci. 2023 Jul 7;10:1214424. doi: 10.3389/fmolb.2023.1214424. eCollection 2023.
4
Identifying developability risks for clinical progression of antibodies using high-throughput in vitro and in silico approaches.利用高通量体外和计算方法鉴定抗体临床进展的可开发性风险。
MAbs. 2023 Jan-Dec;15(1):2200540. doi: 10.1080/19420862.2023.2200540.
5
Germline-encoded amino acid-binding motifs drive immunodominant public antibody responses.胚系编码的氨基酸结合基序驱动免疫显性公共抗体反应。
Science. 2023 Apr 7;380(6640):eadc9498. doi: 10.1126/science.adc9498.
6
2022 FDA approvals.2022年美国食品药品监督管理局的批准情况。
Nat Rev Drug Discov. 2023 Feb;22(2):83-88. doi: 10.1038/d41573-023-00001-3.
7
Antibodies to watch in 2023.2023 年值得关注的抗体药物
MAbs. 2023 Jan-Dec;15(1):2153410. doi: 10.1080/19420862.2022.2153410.
8
Simultaneous affinity maturation and developability enhancement using natural liability-free CDRs.利用天然无不良倾向的 CDR 同时进行亲和力成熟和可开发性增强。
MAbs. 2022 Jan-Dec;14(1):2115200. doi: 10.1080/19420862.2022.2115200.
9
Deciphering the language of antibodies using self-supervised learning.利用自监督学习破解抗体语言。
Patterns (N Y). 2022 May 18;3(7):100513. doi: 10.1016/j.patter.2022.100513. eCollection 2022 Jul 8.
10
Machine-designed biotherapeutics: opportunities, feasibility and advantages of deep learning in computational antibody discovery.机器设计的生物疗法:深度学习在计算性抗体发现中的机会、可行性和优势。
Brief Bioinform. 2022 Jul 18;23(4). doi: 10.1093/bib/bbac267.