• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于每个残基二级结构约束条件,利用整合长短期记忆网络(LSTM)和注意力机制的轻量级扩散模型进行大型多肽的从头设计。

De Novo Design of Large Polypeptides Using a Lightweight Diffusion Model Integrating LSTM and Attention Mechanism Under Per-Residue Secondary Structure Constraints.

作者信息

Liao Sisheng, Xu Gang, Jin Li, Ma Jianpeng

机构信息

School of Life Sciences, Fudan University, Shanghai 200433, China.

Multiscale Research Institute of Complex Systems, Fudan University, Shanghai 200433, China.

出版信息

Molecules. 2025 Feb 28;30(5):1116. doi: 10.3390/molecules30051116.

DOI:10.3390/molecules30051116
PMID:40076339
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11902264/
Abstract

This study presents PolypeptideDesigner (PPD), a novel conditional diffusion-based model for de novo polypeptide sequence design and generation based on per-residue secondary structure conditions. By integrating a lightweight LSTM-attention neural network as the denoiser within a diffusion framework, PPD offers an innovative and efficient approach to polypeptide generation. Evaluations demonstrate that the PPD model can generate diverse and novel polypeptide sequences across various testing conditions, achieving high pLDDT scores when folded by ESMFold. In comparison to the ProteinDiffusionGenerator B (PDG-B) model, a relevant benchmark in the field, PPD exhibits the ability to produce longer and more diverse polypeptide sequences. This improvement is attributed to PPD's optimized architecture and expanded training dataset, which enhance its understanding of protein structural pattern. The PPD model shows significant potential for optimizing functional polypeptides with known structures, paving the way for advancements in biomaterial design. Future work will focus on further refining the model and exploring its broader applications in polypeptide engineering.

摘要

本研究介绍了多肽设计器(PPD),这是一种基于每个残基二级结构条件的、用于从头进行多肽序列设计和生成的新型条件扩散模型。通过在扩散框架内集成一个轻量级的长短期记忆注意力神经网络作为去噪器,PPD提供了一种创新且高效的多肽生成方法。评估表明,PPD模型能够在各种测试条件下生成多样且新颖的多肽序列,经ESMFold折叠后可获得较高的pLDDT分数。与该领域的相关基准模型蛋白质扩散生成器B(PDG-B)相比,PPD展现出能够生成更长且更多样化的多肽序列的能力。这种改进归因于PPD优化的架构和扩展的训练数据集,它们增强了模型对蛋白质结构模式的理解。PPD模型在优化具有已知结构的功能性多肽方面显示出巨大潜力,为生物材料设计的进步铺平了道路。未来的工作将集中于进一步优化该模型,并探索其在多肽工程中的更广泛应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/b03beb08773b/molecules-30-01116-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/47f2c7b16597/molecules-30-01116-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/faee3d92c774/molecules-30-01116-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/24eef1199a6d/molecules-30-01116-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/d97d622b8faf/molecules-30-01116-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/1a32e823a3f2/molecules-30-01116-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/f545ccd48a65/molecules-30-01116-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/b03beb08773b/molecules-30-01116-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/47f2c7b16597/molecules-30-01116-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/faee3d92c774/molecules-30-01116-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/24eef1199a6d/molecules-30-01116-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/d97d622b8faf/molecules-30-01116-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/1a32e823a3f2/molecules-30-01116-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/f545ccd48a65/molecules-30-01116-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1d54/11902264/b03beb08773b/molecules-30-01116-g007.jpg

相似文献

1
De Novo Design of Large Polypeptides Using a Lightweight Diffusion Model Integrating LSTM and Attention Mechanism Under Per-Residue Secondary Structure Constraints.基于每个残基二级结构约束条件,利用整合长短期记忆网络(LSTM)和注意力机制的轻量级扩散模型进行大型多肽的从头设计。
Molecules. 2025 Feb 28;30(5):1116. doi: 10.3390/molecules30051116.
2
Random, de novo, and conserved proteins: How structure and disorder predictors perform differently.随机、从头开始和保守的蛋白质:结构和无序预测器的表现有何不同。
Proteins. 2024 Jun;92(6):757-767. doi: 10.1002/prot.26652. Epub 2024 Jan 16.
3
Generative design of proteins based on secondary structure constraints using an attention-based diffusion model.基于二级结构约束,使用基于注意力的扩散模型进行蛋白质的生成式设计。
Chem. 2023 Jul 13;9(7):1828-1849. doi: 10.1016/j.chempr.2023.03.020. Epub 2023 Apr 20.
4
Simulated folding in polypeptides of diversified molecular tacticity: implications for protein folding and de novo design.不同分子规整性多肽的模拟折叠:对蛋白质折叠和从头设计的启示
Biopolymers. 2005 Jun 5;78(2):96-105. doi: 10.1002/bip.20241.
5
ILMCNet: A Deep Neural Network Model That Uses PLM to Process Features and Employs CRF to Predict Protein Secondary Structure.ILMCNet:一种利用 PLM 处理特征并采用 CRF 预测蛋白质二级结构的深度神经网络模型。
Genes (Basel). 2024 Oct 21;15(10):1350. doi: 10.3390/genes15101350.
6
Recurrent Neural Network Model for Constructive Peptide Design.用于构建肽设计的递归神经网络模型。
J Chem Inf Model. 2018 Feb 26;58(2):472-479. doi: 10.1021/acs.jcim.7b00414. Epub 2018 Jan 22.
7
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
8
Protein secondary structure assignment using residual networks.使用残差网络进行蛋白质二级结构预测。
J Mol Model. 2022 Aug 23;28(9):269. doi: 10.1007/s00894-022-05271-z.
9
Direct Prediction of the Toxic Gas Diffusion Rule in a Real Environment Based on LSTM.基于 LSTM 的真实环境下有毒气体扩散规律的直接预测。
Int J Environ Res Public Health. 2019 Jun 17;16(12):2133. doi: 10.3390/ijerph16122133.
10
De novo profile generation based on sequence context specificity with the long short-term memory network.基于序列上下文特异性的长短期记忆网络从头生成模型。
BMC Bioinformatics. 2018 Jul 18;19(1):272. doi: 10.1186/s12859-018-2284-1.

本文引用的文献

1
Rapid in silico directed evolution by a protein language model with EVOLVEpro.通过带有EVOLVEpro的蛋白质语言模型进行快速计算机辅助定向进化。
Science. 2025 Jan 24;387(6732):eadr6006. doi: 10.1126/science.adr6006.
2
De novo protein design with a denoising diffusion network independent of pretrained structure prediction models.基于去噪扩散网络的从头蛋白质设计,无需预先训练的结构预测模型。
Nat Methods. 2024 Nov;21(11):2107-2116. doi: 10.1038/s41592-024-02437-w. Epub 2024 Oct 9.
3
Multistate and functional protein design using RoseTTAFold sequence space diffusion.
使用RoseTTAFold序列空间扩散进行多状态和功能性蛋白质设计。
Nat Biotechnol. 2024 Sep 25. doi: 10.1038/s41587-024-02395-w.
4
Unsupervised evolution of protein and antibody complexes with a structure-informed language model.无监督的蛋白质和抗体复合物的进化与结构信息语言模型。
Science. 2024 Jul 5;385(6704):46-53. doi: 10.1126/science.adk8946. Epub 2024 Jul 4.
5
Accurate structure prediction of biomolecular interactions with AlphaFold 3.利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.
6
Generalized biomolecular modeling and design with RoseTTAFold All-Atom.基于 RoseTTAFold All-Atom 的广义生物分子建模与设计。
Science. 2024 Apr 19;384(6693):eadl2528. doi: 10.1126/science.adl2528.
7
Single-sequence protein structure prediction using supervised transformer protein language models.使用监督式转换器蛋白质语言模型进行单序列蛋白质结构预测。
Nat Comput Sci. 2022 Dec;2(12):804-814. doi: 10.1038/s43588-022-00373-3. Epub 2022 Dec 19.
8
Illuminating protein space with a programmable generative model.用可编程生成模型照亮蛋白质空间。
Nature. 2023 Nov;623(7989):1070-1078. doi: 10.1038/s41586-023-06728-8. Epub 2023 Nov 15.
9
PET image denoising based on denoising diffusion probabilistic model.基于去噪扩散概率模型的 PET 图像去噪。
Eur J Nucl Med Mol Imaging. 2024 Jan;51(2):358-368. doi: 10.1007/s00259-023-06417-8. Epub 2023 Oct 3.
10
Generative design of proteins based on secondary structure constraints using an attention-based diffusion model.基于二级结构约束,使用基于注意力的扩散模型进行蛋白质的生成式设计。
Chem. 2023 Jul 13;9(7):1828-1849. doi: 10.1016/j.chempr.2023.03.020. Epub 2023 Apr 20.