Ig-VAE：通过直接 3D 坐标生成对蛋白质结构进行生成式建模。

Ig-VAE: Generative modeling of protein structure by direct 3D coordinate generation.

机构信息

Department of Biochemistry, Stanford University, Stanford, California, United States of America.

Department of Statistics, Stanford University, Stanford, California, United States of America.

出版信息

PLoS Comput Biol. 2022 Jun 27;18(6):e1010271. doi: 10.1371/journal.pcbi.1010271. eCollection 2022 Jun.

DOI:10.1371/journal.pcbi.1010271

PMID:35759518

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9269947/

Abstract

While deep learning models have seen increasing applications in protein science, few have been implemented for protein backbone generation-an important task in structure-based problems such as active site and interface design. We present a new approach to building class-specific backbones, using a variational auto-encoder to directly generate the 3D coordinates of immunoglobulins. Our model is torsion- and distance-aware, learns a high-resolution embedding of the dataset, and generates novel, high-quality structures compatible with existing design tools. We show that the Ig-VAE can be used with Rosetta to create a computational model of a SARS-CoV2-RBD binder via latent space sampling. We further demonstrate that the model's generative prior is a powerful tool for guiding computational protein design, motivating a new paradigm under which backbone design is solved as constrained optimization problem in the latent space of a generative model.

摘要

虽然深度学习模型在蛋白质科学中的应用越来越广泛，但很少有模型被用于生成蛋白质骨架——这是结构基础问题（如活性位点和界面设计）中的一项重要任务。我们提出了一种新的方法来构建特定类别的骨架，使用变分自动编码器直接生成免疫球蛋白的 3D 坐标。我们的模型是扭转和距离感知的，它学习了数据集的高分辨率嵌入，并生成了新颖的、高质量的结构，与现有的设计工具兼容。我们展示了 Ig-VAE 可以与 Rosetta 一起使用，通过潜在空间采样创建 SARS-CoV2-RBD 结合物的计算模型。我们进一步证明，该模型的生成先验是指导计算蛋白质设计的有力工具，这激发了一种新的范例，其中骨架设计作为生成模型潜在空间中的约束优化问题来解决。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0185/9269947/365c57a45946/pcbi.1010271.g001.jpg

相似文献

Ig-VAE: Generative modeling of protein structure by direct 3D coordinate generation.Ig-VAE：通过直接 3D 坐标生成对蛋白质结构进行生成式建模。

PLoS Comput Biol. 2022 Jun 27;18(6):e1010271. doi: 10.1371/journal.pcbi.1010271. eCollection 2022 Jun.

ProtWave-VAE: Integrating Autoregressive Sampling with Latent-Based Inference for Data-Driven Protein Design.ProtWave-VAE：用于数据驱动蛋白质设计的基于潜在信息的推断与自回归采样的整合。

ACS Synth Biol. 2023 Dec 15;12(12):3544-3561. doi: 10.1021/acssynbio.3c00261. Epub 2023 Nov 21.

Deep Generative Models for Molecular Science.深度生成模型在分子科学中的应用

Mol Inform. 2018 Jan;37(1-2). doi: 10.1002/minf.201700133. Epub 2018 Feb 6.

Efficient 3D Molecular Design with an E(3) Invariant Transformer VAE.使用E(3)不变变压器变分自编码器进行高效3D分子设计。

J Phys Chem A. 2023 Sep 21;127(37):7844-7852. doi: 10.1021/acs.jpca.3c04188. Epub 2023 Sep 5.

Molecular substructure tree generative model for de novo drug design.用于从头药物设计的分子子结构树生成模型。

Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab592.

3D-Scaffold: A Deep Learning Framework to Generate 3D Coordinates of Drug-like Molecules with Desired Scaffolds.3D 支架：一个深度学习框架，用于生成具有所需支架的类药物分子的 3D 坐标。

J Phys Chem B. 2021 Nov 11;125(44):12166-12176. doi: 10.1021/acs.jpcb.1c06437. Epub 2021 Oct 18.

Embedding high-dimensional Bayesian optimization via generative modeling: Parameter personalization of cardiac electrophysiological models.基于生成式建模的高维贝叶斯优化嵌入：心脏电生理模型的参数个性化。

Med Image Anal. 2020 May;62:101670. doi: 10.1016/j.media.2020.101670. Epub 2020 Feb 27.

Searching for protein variants with desired properties using deep generative models.使用深度生成模型搜索具有所需特性的蛋白质变体。

BMC Bioinformatics. 2023 Jul 21;24(1):297. doi: 10.1186/s12859-023-05415-9.

Clustering Analysis via Deep Generative Models With Mixture Models.基于混合模型的深度生成模型的聚类分析

IEEE Trans Neural Netw Learn Syst. 2022 Jan;33(1):340-350. doi: 10.1109/TNNLS.2020.3027761. Epub 2022 Jan 5.

Decoding regulatory structures and features from epigenomics profiles: A Roadmap-ENCODE Variational Auto-Encoder (RE-VAE) model.从表观基因组学图谱中解码调控结构和特征：路线图-ENCODE 变分自动编码器 (RE-VAE) 模型。

Methods. 2021 May;189:44-53. doi: 10.1016/j.ymeth.2019.10.012. Epub 2019 Oct 28.

引用本文的文献

Applications of Artificial Intelligence in Biotech Drug Discovery and Product Development.人工智能在生物技术药物发现与产品开发中的应用。

MedComm (2020). 2025 Jul 30;6(8):e70317. doi: 10.1002/mco2.70317. eCollection 2025 Aug.

Assessing generative model coverage of protein structures with SHAPES.使用SHAPES评估蛋白质结构的生成模型覆盖率。

Cell Syst. 2025 Jul 23:101347. doi: 10.1016/j.cels.2025.101347.

Artificial intelligence-driven computational methods for antibody design and optimization.用于抗体设计与优化的人工智能驱动的计算方法。

MAbs. 2025 Dec;17(1):2528902. doi: 10.1080/19420862.2025.2528902. Epub 2025 Jul 18.

Applying computational protein design to therapeutic antibody discovery - current state and perspectives.将计算蛋白质设计应用于治疗性抗体发现——现状与展望。

Front Immunol. 2025 May 22;16:1571371. doi: 10.3389/fimmu.2025.1571371. eCollection 2025.

Revolutionizing oncology: the role of Artificial Intelligence (AI) as an antibody design, and optimization tools.肿瘤学的变革：人工智能（AI）作为抗体设计与优化工具的作用。

Biomark Res. 2025 Mar 29;13(1):52. doi: 10.1186/s40364-025-00764-4.

Engineering Dehalogenase Enzymes Using Variational Autoencoder-Generated Latent Spaces and Microfluidics.利用变分自编码器生成的潜在空间和微流体技术设计脱卤酶

JACS Au. 2025 Feb 13;5(2):838-850. doi: 10.1021/jacsau.4c01101. eCollection 2025 Feb 24.

Assessing Generative Model Coverage of Protein Structures with SHAPES.使用SHAPES评估蛋白质结构的生成模型覆盖率。

bioRxiv. 2025 Jan 17:2025.01.09.632260. doi: 10.1101/2025.01.09.632260.

Deep learning-based design and experimental validation of a medicine-like human antibody library.基于深度学习的类药物人源抗体文库设计与实验验证

Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf023.

De novo protein design with a denoising diffusion network independent of pretrained structure prediction models.基于去噪扩散网络的从头蛋白质设计，无需预先训练的结构预测模型。

Nat Methods. 2024 Nov;21(11):2107-2116. doi: 10.1038/s41592-024-02437-w. Epub 2024 Oct 9.

Deep learning in template-free de novo biosynthetic pathway design of natural products.无模板的天然产物从头生物合成途径设计中的深度学习。

Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae495.

本文引用的文献

Antibody structure prediction using interpretable deep learning.使用可解释深度学习进行抗体结构预测。

Patterns (N Y). 2021 Dec 9;3(2):100406. doi: 10.1016/j.patter.2021.100406. eCollection 2022 Feb 11.

Protein sequence design with a learned potential.利用学习到的势能进行蛋白质序列设计。

Nat Commun. 2022 Feb 8;13(1):746. doi: 10.1038/s41467-022-28313-9.

De novo protein design by deep network hallucination.基于深度网络幻觉的从头设计蛋白质。

Nature. 2021 Dec;600(7889):547-552. doi: 10.1038/s41586-021-04184-w. Epub 2021 Dec 1.

Structure-based protein design with deep learning.基于结构的深度学习蛋白质设计。

Curr Opin Chem Biol. 2021 Dec;65:136-144. doi: 10.1016/j.cbpa.2021.08.004. Epub 2021 Sep 20.

Protein design and variant prediction using autoregressive generative models.使用自回归生成模型进行蛋白质设计和变体预测。

Nat Commun. 2021 Apr 23;12(1):2403. doi: 10.1038/s41467-021-22732-w.

Neutralizing nanobodies bind SARS-CoV-2 spike RBD and block interaction with ACE2.中和纳米抗体结合 SARS-CoV-2 刺突 RBD 并阻断与 ACE2 的相互作用。

Nat Struct Mol Biol. 2020 Sep;27(9):846-854. doi: 10.1038/s41594-020-0469-6. Epub 2020 Jul 13.

Potential Role of ACE2 in Coronavirus Disease 2019 (COVID-19) Prevention and Management.血管紧张素转换酶2（ACE2）在2019冠状病毒病（COVID-19）预防和管理中的潜在作用

J Transl Int Med. 2020 May 9;8(1):9-19. doi: 10.2478/jtim-2020-0003. eCollection 2020 Mar.

Computational design of closely related proteins that adopt two well-defined but structurally divergent folds.紧密相关的蛋白质的计算设计，这些蛋白质采用两种定义明确但结构上不同的折叠。

Proc Natl Acad Sci U S A. 2020 Mar 31;117(13):7208-7215. doi: 10.1073/pnas.1914808117. Epub 2020 Mar 18.

Structure-guided discovery of a single-domain antibody agonist against human apelin receptor.基于结构的人源 apelin 受体单域抗体激动剂的发现。

Sci Adv. 2020 Jan 15;6(3):eaax7379. doi: 10.1126/sciadv.aax7379. eCollection 2020 Jan.

Improved protein structure prediction using potentials from deep learning.利用深度学习势进行蛋白质结构预测的改进。

Nature. 2020 Jan;577(7792):706-710. doi: 10.1038/s41586-019-1923-7. Epub 2020 Jan 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

Ig-VAE：通过直接 3D 坐标生成对蛋白质结构进行生成式建模。

Ig-VAE: Generative modeling of protein structure by direct 3D coordinate generation.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献