基于机器学习的抗体设计无约束尺度的计算机原理证明。

In silico proof of principle of machine learning-based antibody design at unconstrained scale.

机构信息

Department of Immunology, Oslo University Hospital Rikshospitalet and University of Oslo, Norway.

Department of Biosystems Science and Engineering, ETH Zürich, Basel, Switzerland.

出版信息

MAbs. 2022 Jan-Dec;14(1):2031482. doi: 10.1080/19420862.2022.2031482.

DOI:10.1080/19420862.2022.2031482

PMID:35377271

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8986205/

Abstract

Generative machine learning (ML) has been postulated to become a major driver in the computational design of antigen-specific monoclonal antibodies (mAb). However, efforts to confirm this hypothesis have been hindered by the infeasibility of testing arbitrarily large numbers of antibody sequences for their most critical design parameters: paratope, epitope, affinity, and developability. To address this challenge, we leveraged a lattice-based antibody-antigen binding simulation framework, which incorporates a wide range of physiological antibody-binding parameters. The simulation framework enables the computation of synthetic antibody-antigen 3D-structures, and it functions as an oracle for unrestricted prospective evaluation and benchmarking of antibody design parameters of ML-generated antibody sequences. We found that a deep generative model, trained exclusively on antibody sequence (one dimensional: 1D) data can be used to design conformational (three dimensional: 3D) epitope-specific antibodies, matching, or exceeding the training dataset in affinity and developability parameter value variety. Furthermore, we established a lower threshold of sequence diversity necessary for high-accuracy generative antibody ML and demonstrated that this lower threshold also holds on experimental real-world data. Finally, we show that transfer learning enables the generation of high-affinity antibody sequences from low-N training data. Our work establishes a priori feasibility and the theoretical foundation of high-throughput ML-based mAb design.

摘要

生成式机器学习（ML）被认为是抗原特异性单克隆抗体（mAb）计算设计的主要驱动力。然而，由于无法测试任意数量的抗体序列的最关键设计参数：表位、抗原结合亲和力和可开发性，验证这一假设的努力受到了阻碍。为了解决这一挑战，我们利用了基于格点的抗体-抗原结合模拟框架，该框架结合了广泛的生理抗体结合参数。该模拟框架能够计算合成的抗体-抗原 3D 结构，并作为不受限制的前瞻性评估和基准测试 ML 生成的抗体序列的抗体设计参数的工具。我们发现，仅基于抗体序列（一维：1D）数据训练的深度生成模型可用于设计构象（三维：3D）表位特异性抗体，其亲和力和可开发性参数值的多样性与训练数据集相匹配或超过。此外，我们确定了高精度生成性抗体 ML 所需的序列多样性的下限，并证明该下限在实验真实世界数据上同样适用。最后，我们表明，迁移学习能够从低 N 训练数据生成高亲和力的抗体序列。我们的工作建立了基于 ML 的高通量 mAb 设计的先验可行性和理论基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ee3c/8986205/b9bf2b5c2412/KMAB_A_2031482_F0001_OC.jpg

相似文献

In silico proof of principle of machine learning-based antibody design at unconstrained scale.基于机器学习的抗体设计无约束尺度的计算机原理证明。

MAbs. 2022 Jan-Dec;14(1):2031482. doi: 10.1080/19420862.2022.2031482.

Unconstrained generation of synthetic antibody-antigen structures to guide machine learning methodology for antibody specificity prediction.无约束生成合成抗体-抗原结构，以指导用于抗体特异性预测的机器学习方法。

Nat Comput Sci. 2022 Dec;2(12):845-865. doi: 10.1038/s43588-022-00372-4. Epub 2022 Dec 19.

A compact vocabulary of paratope-epitope interactions enables predictability of antibody-antigen binding.一套简洁的互补位-表位相互作用词汇表能够实现抗体-抗原结合的可预测性。

Cell Rep. 2021 Mar 16;34(11):108856. doi: 10.1016/j.celrep.2021.108856.

Prediction of Paratope-Epitope Pairs Using Convolutional Neural Networks.使用卷积神经网络预测表位-抗体互补决定区（CDR）对。

Int J Mol Sci. 2024 May 16;25(10):5434. doi: 10.3390/ijms25105434.

Progress and challenges for the machine learning-based design of fit-for-purpose monoclonal antibodies.基于机器学习的定制化单克隆抗体设计的进展与挑战。

MAbs. 2022 Jan-Dec;14(1):2008790. doi: 10.1080/19420862.2021.2008790.

Limited conformational flexibility in the paratope may be responsible for degenerate specificity of HIV epitope recognition.变构表位的构象灵活性有限可能是 HIV 表位识别特异性退化的原因。

Int Immunol. 2013 Feb;25(2):77-90. doi: 10.1093/intimm/dxs093. Epub 2012 Sep 11.

Beyond B-Cell Epitopes: Curating Positive Data on Antipeptide Paratope Binding to Support Peptide-Based Vaccine Design.超越 B 细胞表位：整理抗肽变构结合的阳性数据以支持基于肽的疫苗设计。

Protein Pept Lett. 2021;28(8):953-962. doi: 10.2174/0929866528666210218215624.

Machine-learning-based structural analysis of interactions between antibodies and antigens.基于机器学习的抗体与抗原相互作用的结构分析。

Biosystems. 2024 Sep;243:105264. doi: 10.1016/j.biosystems.2024.105264. Epub 2024 Jul 2.

Predicting monoclonal antibody binding sequences from a sparse sampling of all possible sequences.从所有可能序列的稀疏采样中预测单克隆抗体结合序列。

Commun Biol. 2024 Aug 12;7(1):979. doi: 10.1038/s42003-024-06650-3.

Inadequate Reference Datasets Biased toward Short Non-epitopes Confound B-cell Epitope Prediction.偏向短非表位的不充分参考数据集混淆了B细胞表位预测。

J Biol Chem. 2016 Jul 8;291(28):14585-99. doi: 10.1074/jbc.M116.729020. Epub 2016 May 9.

引用本文的文献

Nanodesigner: resolving the complex-CDR interdependency with iterative refinement.纳米设计师：通过迭代优化解决复杂的互补决定区相互依赖性。

J Cheminform. 2025 Aug 7;17(1):120. doi: 10.1186/s13321-025-01069-2.

Applications of Artificial Intelligence in Biotech Drug Discovery and Product Development.人工智能在生物技术药物发现与产品开发中的应用。

MedComm (2020). 2025 Jul 30;6(8):e70317. doi: 10.1002/mco2.70317. eCollection 2025 Aug.

Profiling antigen-binding affinity of B cell repertoires in tumors by deep learning predicts immune-checkpoint inhibitor treatment outcomes.通过深度学习分析肿瘤中B细胞受体库的抗原结合亲和力可预测免疫检查点抑制剂的治疗效果。

Nat Cancer. 2025 Jun 27. doi: 10.1038/s43018-025-01001-5.

NanoBinder: a machine learning assisted nanobody binding prediction tool using Rosetta energy scores.纳米抗体结合预测器：一种使用罗塞塔能量分数的机器学习辅助纳米抗体结合预测工具。

J Cheminform. 2025 Jun 16;17(1):96. doi: 10.1186/s13321-025-01040-1.

AI in optimized cancer treatment: laying the groundwork for interdisciplinary progress.人工智能在优化癌症治疗中的应用：为跨学科进展奠定基础。

Oxf Open Immunol. 2025 May 12;6(1):iqaf004. doi: 10.1093/oxfimm/iqaf004. eCollection 2025.

$\mathcal{S}$ able: bridging the gap in protein structure understanding with an empowering and versatile pre-training paradigm.$\mathcal{S}$ able：通过一种强大且通用的预训练范式弥合蛋白质结构理解方面的差距。

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf120.

Revolutionizing oncology: the role of Artificial Intelligence (AI) as an antibody design, and optimization tools.肿瘤学的变革：人工智能（AI）作为抗体设计与优化工具的作用。

Biomark Res. 2025 Mar 29;13(1):52. doi: 10.1186/s40364-025-00764-4.

A Novel Human Anti-FV mAb as a Potential Tool for Diagnostic and Coagulation Inhibitory Approaches.一种新型人抗FV单克隆抗体作为诊断和凝血抑制方法的潜在工具。

Int J Mol Sci. 2025 Mar 18;26(6):2721. doi: 10.3390/ijms26062721.

Simulation of adaptive immune receptors and repertoires with complex immune information to guide the development and benchmarking of AIRR machine learning.利用复杂免疫信息模拟适应性免疫受体和库，以指导适应性免疫受体库（AIRR）机器学习的开发和基准测试。

Nucleic Acids Res. 2025 Jan 24;53(3). doi: 10.1093/nar/gkaf025.

Deep learning-based design and experimental validation of a medicine-like human antibody library.基于深度学习的类药物人源抗体文库设计与实验验证

Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf023.

本文引用的文献

The immuneML ecosystem for machine learning analysis of adaptive immune receptor repertoires.用于适应性免疫受体库机器学习分析的immuneML生态系统。

Nat Mach Intell. 2021 Nov;3(11):936-944. doi: 10.1038/s42256-021-00413-z. Epub 2021 Nov 16.

Deciphering the language of antibodies using self-supervised learning.利用自监督学习破解抗体语言。

Patterns (N Y). 2022 May 18;3(7):100513. doi: 10.1016/j.patter.2022.100513. eCollection 2022 Jul 8.

Massively multiplexed affinity characterization of therapeutic antibodies against SARS-CoV-2 variants.针对严重急性呼吸综合征冠状病毒2（SARS-CoV-2）变体的治疗性抗体的大规模多重亲和特性分析。

Antib Ther. 2022 May 12;5(2):130-137. doi: 10.1093/abt/tbac011. eCollection 2022 Apr.

Antibody structure prediction using interpretable deep learning.使用可解释深度学习进行抗体结构预测。

Patterns (N Y). 2021 Dec 9;3(2):100406. doi: 10.1016/j.patter.2021.100406. eCollection 2022 Feb 11.

Neural networks to learn protein sequence-function relationships from deep mutational scanning data.神经网络从深度突变扫描数据中学习蛋白质序列-功能关系。

Proc Natl Acad Sci U S A. 2021 Nov 30;118(48). doi: 10.1073/pnas.2104878118.

Factors of Influence for Transfer Learning Across Diverse Appearance Domains and Task Types.跨不同外观领域和任务类型的迁移学习影响因素。

IEEE Trans Pattern Anal Mach Intell. 2022 Dec;44(12):9298-9314. doi: 10.1109/TPAMI.2021.3129870. Epub 2022 Nov 7.

Machine Learning Detects Anti-DENV Signatures in Antibody Repertoire Sequences.机器学习可在抗体库序列中检测抗登革病毒特征。

Front Artif Intell. 2021 Oct 11;4:715462. doi: 10.3389/frai.2021.715462. eCollection 2021.

Differentiable biology: using deep learning for biophysics-based and data-driven modeling of molecular mechanisms.可微分生物学：基于深度学习的生物物理和数据驱动的分子机制建模。

Nat Methods. 2021 Oct;18(10):1169-1180. doi: 10.1038/s41592-021-01283-4. Epub 2021 Oct 4.

The challenges with developing therapeutic monoclonal antibodies for pandemic application.开发用于大流行应用的治疗性单克隆抗体所面临的挑战。

Expert Opin Drug Discov. 2022 Jan;17(1):5-8. doi: 10.1080/17460441.2021.1976141. Epub 2021 Sep 10.

: A 3D structural affinity model for multi-epitope vaccine simulations.用于多表位疫苗模拟的三维结构亲和模型。

iScience. 2021 Aug 14;24(9):102979. doi: 10.1016/j.isci.2021.102979. eCollection 2021 Sep 24.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于机器学习的抗体设计无约束尺度的计算机原理证明。

In silico proof of principle of machine learning-based antibody design at unconstrained scale.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献