• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生物活性肽的分类:模型与编码的系统基准测试

Classification of bioactive peptides: A systematic benchmark of models and encodings.

作者信息

Bizzotto Edoardo, Zampieri Guido, Treu Laura, Filannino Pasquale, Di Cagno Raffaella, Campanaro Stefano

机构信息

Department of Biology, University of Padua, Via U. Bassi 58/b, Padova 35131, Italy.

Department of Soil, Plant and Food Science, University of Bari Aldo Moro, Via G. Amendola 165/a, Bari 70126, Italy.

出版信息

Comput Struct Biotechnol J. 2024 May 23;23:2442-2452. doi: 10.1016/j.csbj.2024.05.040. eCollection 2024 Dec.

DOI:10.1016/j.csbj.2024.05.040
PMID:38867723
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11168199/
Abstract

Bioactive peptides are short amino acid chains possessing biological activity and exerting physiological effects relevant to human health. Despite their therapeutic value, their identification remains a major problem, as it mainly relies on time-consuming in vitro tests. While bioinformatic tools for the identification of bioactive peptides are available, they are focused on specific functional classes and have not been systematically tested on realistic settings. To tackle this problem, bioactive peptide sequences and functions were here gathered from a variety of databases to generate a unified collection of bioactive peptides from microbial fermentation. This collection was organized into nine functional classes including some previously studied and some unexplored such as immunomodulatory, opioid and cardiovascular peptides. Upon assessing their sequence properties, four alternative encoding methods were tested in combination with a multitude of machine learning algorithms, from basic classifiers like logistic regression to advanced algorithms like BERT. Tests on a total of 171 models showed that, while some functions are intrinsically easier to detect, no single combination of classifiers and encoders worked universally well for all classes. For this reason, we unified all the best individual models for each class and generated CICERON (Classification of bIoaCtive pEptides fRom micrObial fermeNtation), a classification tool for the functional classification of peptides. State-of-the-art classifiers were found to underperform on our realistic benchmark dataset compared to the models included in CICERON. Altogether, our work provides a tool for real-world peptide classification and can serve as a benchmark for future model development.

摘要

生物活性肽是具有生物活性的短氨基酸链,对人体健康发挥着相关生理作用。尽管它们具有治疗价值,但其鉴定仍然是一个主要问题,因为这主要依赖于耗时的体外试验。虽然有用于鉴定生物活性肽的生物信息学工具,但它们专注于特定的功能类别,尚未在实际环境中进行系统测试。为了解决这个问题,这里从各种数据库收集了生物活性肽序列和功能,以生成一个来自微生物发酵的生物活性肽统一集合。这个集合被组织成九个功能类别,包括一些以前研究过的和一些未探索的类别,如免疫调节肽、阿片样肽和心血管肽。在评估它们的序列特性后,测试了四种替代编码方法,并与多种机器学习算法相结合,从逻辑回归等基本分类器到BERT等先进算法。对总共171个模型的测试表明,虽然有些功能本质上更容易检测,但没有一种分类器和编码器的组合对所有类别都普遍适用。因此,我们统一了每个类别的所有最佳个体模型,并生成了CICERON(来自微生物发酵的生物活性肽分类),这是一种用于肽功能分类的工具。与CICERON中包含的模型相比,发现最先进的分类器在我们的实际基准数据集上表现不佳。总之,我们的工作提供了一种用于现实世界肽分类的工具,并可作为未来模型开发的基准。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e33e/11168199/2eb1684bc264/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e33e/11168199/bd2624648771/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e33e/11168199/297a7881ddc7/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e33e/11168199/ee54cbfe105e/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e33e/11168199/2eb1684bc264/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e33e/11168199/bd2624648771/ga1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e33e/11168199/297a7881ddc7/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e33e/11168199/ee54cbfe105e/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e33e/11168199/2eb1684bc264/gr3.jpg

相似文献

1
Classification of bioactive peptides: A systematic benchmark of models and encodings.生物活性肽的分类:模型与编码的系统基准测试
Comput Struct Biotechnol J. 2024 May 23;23:2442-2452. doi: 10.1016/j.csbj.2024.05.040. eCollection 2024 Dec.
2
CAPTURE: Comprehensive anti-cancer peptide predictor with a unique amino acid sequence encoder.CAPTURE:具有独特氨基酸序列编码器的综合抗癌肽预测器。
Comput Biol Med. 2024 Jun;176:108538. doi: 10.1016/j.compbiomed.2024.108538. Epub 2024 May 3.
3
Encodings and models for antimicrobial peptide classification for multi-resistant pathogens.用于多重耐药病原体抗菌肽分类的编码与模型
BioData Min. 2019 Mar 4;12:7. doi: 10.1186/s13040-019-0196-x. eCollection 2019.
4
Bioactive Peptide Recognition Based on NLP Pre-Train Algorithm.基于 NLP 预训练算法的生物活性肽识别。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Nov-Dec;20(6):3809-3819. doi: 10.1109/TCBB.2023.3323295. Epub 2023 Dec 25.
5
Deep2Pep: A deep learning method in multi-label classification of bioactive peptide.Deep2Pep:一种用于生物活性肽多标签分类的深度学习方法。
Comput Biol Chem. 2024 Apr;109:108021. doi: 10.1016/j.compbiolchem.2024.108021. Epub 2024 Jan 22.
6
UMPred-FRL: A New Approach for Accurate Prediction of Umami Peptides Using Feature Representation Learning.UMPred-FRL:一种使用特征表示学习准确预测鲜味肽的新方法。
Int J Mol Sci. 2021 Dec 4;22(23):13124. doi: 10.3390/ijms222313124.
7
SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.支持向量机折叠法:一种用于判别式多类别蛋白质折叠和超家族识别的工具。
BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-8-S4-S2.
8
NTpred: a robust and precise machine learning framework for in silico identification of Tyrosine nitration sites in protein sequences.NTpred:一种用于在蛋白质序列中通过计算机模拟鉴定酪氨酸硝化位点的强大且精确的机器学习框架。
Brief Funct Genomics. 2024 Mar 20;23(2):163-179. doi: 10.1093/bfgp/elad018.
9
PredAPP: Predicting Anti-Parasitic Peptides with Undersampling and Ensemble Approaches.PredAPP:采用欠采样和集成方法预测抗寄生虫肽。
Interdiscip Sci. 2022 Mar;14(1):258-268. doi: 10.1007/s12539-021-00484-x. Epub 2021 Oct 4.
10
Putting microbes to work: dairy fermentation, cell factories and bioactive peptides. Part II: bioactive peptide functions.让微生物发挥作用:乳制品发酵、细胞工厂与生物活性肽。第二部分:生物活性肽的功能
Biotechnol J. 2007 Apr;2(4):435-49. doi: 10.1002/biot.200700045.

引用本文的文献

1
Recent advances in bioactive hydrogel microspheres: Material engineering strategies and biomedical prospects.生物活性水凝胶微球的最新进展:材料工程策略与生物医学前景
Mater Today Bio. 2025 Feb 25;31:101614. doi: 10.1016/j.mtbio.2025.101614. eCollection 2025 Apr.
2
AI Methods for Antimicrobial Peptides: Progress and Challenges.抗菌肽的人工智能方法:进展与挑战
Microb Biotechnol. 2025 Jan;18(1):e70072. doi: 10.1111/1751-7915.70072.
3
A Novel Workflow for In Silico Prediction of Bioactive Peptides: An Exploration of By-Products.

本文引用的文献

1
Anticancer Potential of Antimicrobial Peptides: Focus on Buforins.抗菌肽的抗癌潜力:聚焦于蟾蜍抗菌肽
Polymers (Basel). 2024 Mar 7;16(6):728. doi: 10.3390/polym16060728.
2
Antioxidant and Renin Inhibitory Activities of Peptides from Food Proteins on Hypertension: A Review.食物蛋白肽的抗氧化和肾素抑制活性对高血压的作用:综述。
Plant Foods Hum Nutr. 2023 Sep;78(3):493-505. doi: 10.1007/s11130-023-01085-3. Epub 2023 Aug 14.
3
ACP-MLC: A two-level prediction engine for identification of anticancer peptides and multi-label classification of their functional types.
一种新型生物活性肽计算机预测工作流程:对副产物的探索。
Biomolecules. 2024 Jul 31;14(8):930. doi: 10.3390/biom14080930.
ACP-MLC:一种用于识别抗癌肽及其功能类型多标签分类的两级预测引擎。
Comput Biol Med. 2023 May;158:106844. doi: 10.1016/j.compbiomed.2023.106844. Epub 2023 Apr 4.
4
UniDL4BioPep: a universal deep learning architecture for binary classification in peptide bioactivity.UniDL4BioPep:用于肽生物活性二元分类的通用深度学习架构。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad135.
5
Designing antimicrobial peptides using deep learning and molecular dynamic simulations.利用深度学习和分子动力学模拟设计抗菌肽。
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad058.
6
Spent Yeast Waste Streams as a Sustainable Source of Bioactive Peptides for Skin Applications.废酵母废料流作为皮肤应用的生物活性肽的可持续来源。
Int J Mol Sci. 2023 Jan 23;24(3):2253. doi: 10.3390/ijms24032253.
7
Prediction of celiac disease associated epitopes and motifs in a protein.预测蛋白质中与乳糜泻相关的表位和基序。
Front Immunol. 2023 Jan 19;14:1056101. doi: 10.3389/fimmu.2023.1056101. eCollection 2023.
8
Prediction of antioxidant peptides using a quantitative structure-activity relationship predictor (AnOxPP) based on bidirectional long short-term memory neural network and interpretable amino acid descriptors.基于双向长短期记忆神经网络和可解释氨基酸描述符的抗氧化肽定量构效关系预测器 (AnOxPP) 的抗氧化肽预测。
Comput Biol Med. 2023 Mar;154:106591. doi: 10.1016/j.compbiomed.2023.106591. Epub 2023 Jan 24.
9
AFP-MFL: accurate identification of antifungal peptides using multi-view feature learning.AFP-MFL:使用多视图特征学习准确识别抗真菌肽
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac606.
10
The Therapeutic Potential of Naturally Occurring Peptides in Counteracting SH-SY5Y Cells Injury.天然肽在对抗 SH-SY5Y 细胞损伤中的治疗潜力。
Int J Mol Sci. 2022 Oct 4;23(19):11778. doi: 10.3390/ijms231911778.