Suppr超能文献

基于每个残基二级结构约束条件,利用整合长短期记忆网络(LSTM)和注意力机制的轻量级扩散模型进行大型多肽的从头设计。

De Novo Design of Large Polypeptides Using a Lightweight Diffusion Model Integrating LSTM and Attention Mechanism Under Per-Residue Secondary Structure Constraints.

作者信息

Liao Sisheng, Xu Gang, Jin Li, Ma Jianpeng

机构信息

School of Life Sciences, Fudan University, Shanghai 200433, China.

Multiscale Research Institute of Complex Systems, Fudan University, Shanghai 200433, China.

出版信息

Molecules. 2025 Feb 28;30(5):1116. doi: 10.3390/molecules30051116.

Abstract

This study presents PolypeptideDesigner (PPD), a novel conditional diffusion-based model for de novo polypeptide sequence design and generation based on per-residue secondary structure conditions. By integrating a lightweight LSTM-attention neural network as the denoiser within a diffusion framework, PPD offers an innovative and efficient approach to polypeptide generation. Evaluations demonstrate that the PPD model can generate diverse and novel polypeptide sequences across various testing conditions, achieving high pLDDT scores when folded by ESMFold. In comparison to the ProteinDiffusionGenerator B (PDG-B) model, a relevant benchmark in the field, PPD exhibits the ability to produce longer and more diverse polypeptide sequences. This improvement is attributed to PPD's optimized architecture and expanded training dataset, which enhance its understanding of protein structural pattern. The PPD model shows significant potential for optimizing functional polypeptides with known structures, paving the way for advancements in biomaterial design. Future work will focus on further refining the model and exploring its broader applications in polypeptide engineering.

摘要

本研究介绍了多肽设计器(PPD),这是一种基于每个残基二级结构条件的、用于从头进行多肽序列设计和生成的新型条件扩散模型。通过在扩散框架内集成一个轻量级的长短期记忆注意力神经网络作为去噪器,PPD提供了一种创新且高效的多肽生成方法。评估表明,PPD模型能够在各种测试条件下生成多样且新颖的多肽序列,经ESMFold折叠后可获得较高的pLDDT分数。与该领域的相关基准模型蛋白质扩散生成器B(PDG-B)相比,PPD展现出能够生成更长且更多样化的多肽序列的能力。这种改进归因于PPD优化的架构和扩展的训练数据集,它们增强了模型对蛋白质结构模式的理解。PPD模型在优化具有已知结构的功能性多肽方面显示出巨大潜力,为生物材料设计的进步铺平了道路。未来的工作将集中于进一步优化该模型,并探索其在多肽工程中的更广泛应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/47f2c7b16597/molecules-30-01116-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验