• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CPL-Diff:一种用于从头设计固定长度功能肽序列的扩散模型。

CPL-Diff: A Diffusion Model for De Novo Design of Functional Peptide Sequences with Fixed Length.

作者信息

Luo Zhenjie, Geng Aoyun, Wei Leyi, Zou Quan, Cui Feifei, Zhang Zilong

机构信息

College of Computer Science and Technology, Hainan University, No. 58, Renmin Avenue, Haikou, 570228, China.

Centre for Artificial Intelligence driven Drug Discovery, Faculty of Applied Science, Macao Polytechnic University, Macao SAR, 999078, China.

出版信息

Adv Sci (Weinh). 2025 May;12(20):e2412926. doi: 10.1002/advs.202412926. Epub 2025 Apr 15.

DOI:10.1002/advs.202412926
PMID:40231709
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12120732/
Abstract

Peptides are recognized as next-generation therapeutic drugs due to their unique properties and are essential for treating human diseases. In recent years, a number of deep generation models for generating peptides have been proposed and have shown great potential. However, these models cannot well control the length of the generated sequence, while the sequence length has a very important impact on the physical and chemical properties and therapeutic effects of peptides. Here, a diffusion model is introduced, capable of controlling the length of generated functional peptide sequences, named CPL-Diff. CPL-Diff can control the length of generated polypeptide sequences using only attention masking. Additionally, CPL-Diff can generate single-functional polypeptide sequences based on given conditional information. Experiments demonstrate that the peptides generated by CPL-Diff exhibit lower perplexity and similarity compared to those produced by the current state-of-the-art models, and further exhibit relevant physicochemical properties similar to real sequences. The interpretability analysis is also performed on CPL-Diff to understand how it controls the length of generated sequences and the decision-making process involved in generating polypeptide sequences, with the aim of providing important theoretical guidance for polypeptide design. The code for CPL-Diff is available at https://github.com/luozhenjie1997/CPL-Diff.

摘要

由于其独特的性质,肽被认为是下一代治疗药物,对治疗人类疾病至关重要。近年来,已经提出了许多用于生成肽的深度生成模型,并显示出巨大的潜力。然而,这些模型不能很好地控制生成序列的长度,而序列长度对肽的物理化学性质和治疗效果有非常重要的影响。在此,引入了一种能够控制生成的功能性肽序列长度的扩散模型,名为CPL-Diff。CPL-Diff仅使用注意力掩码就能控制生成的多肽序列的长度。此外,CPL-Diff可以根据给定的条件信息生成单功能多肽序列。实验表明,与当前最先进的模型生成的肽相比,CPL-Diff生成的肽具有更低的困惑度和相似度,并且进一步表现出与真实序列相似的相关物理化学性质。还对CPL-Diff进行了解释性分析,以了解它如何控制生成序列的长度以及生成多肽序列所涉及的决策过程,旨在为多肽设计提供重要的理论指导。CPL-Diff的代码可在https://github.com/luozhenjie1997/CPL-Diff获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/8ee5cb84b72b/ADVS-12-2412926-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/9752aad7f34b/ADVS-12-2412926-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/93c521f6d742/ADVS-12-2412926-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/bb565733e7fa/ADVS-12-2412926-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/a7d5a2233bef/ADVS-12-2412926-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/b099e9328432/ADVS-12-2412926-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/789b1ae16294/ADVS-12-2412926-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/f217c89654ca/ADVS-12-2412926-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/caf8fa4ad2f6/ADVS-12-2412926-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/8ee5cb84b72b/ADVS-12-2412926-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/9752aad7f34b/ADVS-12-2412926-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/93c521f6d742/ADVS-12-2412926-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/bb565733e7fa/ADVS-12-2412926-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/a7d5a2233bef/ADVS-12-2412926-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/b099e9328432/ADVS-12-2412926-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/789b1ae16294/ADVS-12-2412926-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/f217c89654ca/ADVS-12-2412926-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/caf8fa4ad2f6/ADVS-12-2412926-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/097d/12120732/8ee5cb84b72b/ADVS-12-2412926-g007.jpg

相似文献

1
CPL-Diff: A Diffusion Model for De Novo Design of Functional Peptide Sequences with Fixed Length.CPL-Diff:一种用于从头设计固定长度功能肽序列的扩散模型。
Adv Sci (Weinh). 2025 May;12(20):e2412926. doi: 10.1002/advs.202412926. Epub 2025 Apr 15.
2
ProT-Diff: A Modularized and Efficient Strategy for De Novo Generation of Antimicrobial Peptide Sequences by Integrating Protein Language and Diffusion Models.ProT-Diff:一种通过整合蛋白质语言模型和扩散模型从头生成抗菌肽序列的模块化高效策略。
Adv Sci (Weinh). 2024 Nov;11(43):e2406305. doi: 10.1002/advs.202406305. Epub 2024 Sep 25.
3
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
4
GM-Pep: A High Efficiency Strategy to De Novo Design Functional Peptide Sequences.GM-Pep:从头设计功能肽序列的高效策略。
J Chem Inf Model. 2022 May 23;62(10):2617-2629. doi: 10.1021/acs.jcim.2c00089. Epub 2022 May 9.
5
De Novo Design of Large Polypeptides Using a Lightweight Diffusion Model Integrating LSTM and Attention Mechanism Under Per-Residue Secondary Structure Constraints.基于每个残基二级结构约束条件,利用整合长短期记忆网络(LSTM)和注意力机制的轻量级扩散模型进行大型多肽的从头设计。
Molecules. 2025 Feb 28;30(5):1116. doi: 10.3390/molecules30051116.
6
PFB-Diff: Progressive Feature Blending diffusion for text-driven image editing.PFB-Diff:用于文本驱动图像编辑的渐进特征融合扩散
Neural Netw. 2025 Jan;181:106777. doi: 10.1016/j.neunet.2024.106777. Epub 2024 Oct 9.
7
Diff-Retinex++: Retinex-Driven Reinforced Diffusion Model for Low-Light Image Enhancement.Diff-Retinex++:用于低光图像增强的基于视网膜模型的强化扩散模型
IEEE Trans Pattern Anal Mach Intell. 2025 Aug;47(8):6823-6841. doi: 10.1109/TPAMI.2025.3563612.
8
HelixGAN a deep-learning methodology for conditional de novo design of α-helix structures.HelixGAN:一种用于从头设计α-螺旋结构的条件生成对抗网络方法。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btad036.
9
Identification of human viral protein-derived ligands recognized by individual MHCI-restricted T-cell receptors.鉴定由单个MHC I类限制性T细胞受体识别的人类病毒蛋白衍生配体。
Immunol Cell Biol. 2016 Jul;94(6):573-82. doi: 10.1038/icb.2016.12. Epub 2016 Feb 5.
10
Deep Learning-Based Bioactive Therapeutic Peptide Generation and Screening.基于深度学习的生物活性治疗性肽生成与筛选。
J Chem Inf Model. 2023 Feb 13;63(3):835-845. doi: 10.1021/acs.jcim.2c01485. Epub 2023 Feb 1.

本文引用的文献

1
Stack-AVP: A Stacked Ensemble Predictor Based on Multi-view Information for Fast and Accurate Discovery of Antiviral Peptides.堆叠式抗血管加压素:一种基于多视图信息的堆叠集成预测器,用于快速准确地发现抗病毒肽。
J Mol Biol. 2025 Mar 15;437(6):168853. doi: 10.1016/j.jmb.2024.168853. Epub 2024 Nov 6.
2
ProT-Diff: A Modularized and Efficient Strategy for De Novo Generation of Antimicrobial Peptide Sequences by Integrating Protein Language and Diffusion Models.ProT-Diff:一种通过整合蛋白质语言模型和扩散模型从头生成抗菌肽序列的模块化高效策略。
Adv Sci (Weinh). 2024 Nov;11(43):e2406305. doi: 10.1002/advs.202406305. Epub 2024 Sep 25.
3
Design and Synthesis of Antifungal Peptides Guided by Quantitative Antifungal Activity.
定量抗真菌活性指导的抗真菌肽的设计与合成。
J Chem Inf Model. 2024 May 27;64(10):4277-4285. doi: 10.1021/acs.jcim.4c00142. Epub 2024 May 14.
4
ForceGen: End-to-end de novo protein generation based on nonlinear mechanical unfolding responses using a language diffusion model.ForceGen:基于语言扩散模型的非线性机械展开响应的从头开始的蛋白质从头生成。
Sci Adv. 2024 Feb 9;10(6):eadl4000. doi: 10.1126/sciadv.adl4000. Epub 2024 Feb 7.
5
Generative models for protein sequence modeling: recent advances and future directions.蛋白质序列建模的生成模型:最新进展和未来方向。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad358.
6
Structure of a fungal 1,3-β-glucan synthase.真菌 1,3-β-葡聚糖合酶的结构。
Sci Adv. 2023 Sep 15;9(37):eadh7820. doi: 10.1126/sciadv.adh7820. Epub 2023 Sep 13.
7
Generative design of proteins based on secondary structure constraints using an attention-based diffusion model.基于二级结构约束,使用基于注意力的扩散模型进行蛋白质的生成式设计。
Chem. 2023 Jul 13;9(7):1828-1849. doi: 10.1016/j.chempr.2023.03.020. Epub 2023 Apr 20.
8
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
9
Discovering highly potent antimicrobial peptides with deep generative model HydrAMP.利用深度生成模型HydrAMP发现高效抗菌肽。
Nat Commun. 2023 Mar 15;14(1):1453. doi: 10.1038/s41467-023-36994-z.
10
Large language models generate functional protein sequences across diverse families.大型语言模型可生成不同家族的功能性蛋白质序列。
Nat Biotechnol. 2023 Aug;41(8):1099-1106. doi: 10.1038/s41587-022-01618-2. Epub 2023 Jan 26.