生物活性肽的分类：模型与编码的系统基准测试

Classification of bioactive peptides: A systematic benchmark of models and encodings.

作者信息

Bizzotto Edoardo, Zampieri Guido, Treu Laura, Filannino Pasquale, Di Cagno Raffaella, Campanaro Stefano

机构信息

Department of Biology, University of Padua, Via U. Bassi 58/b, Padova 35131, Italy.

Department of Soil, Plant and Food Science, University of Bari Aldo Moro, Via G. Amendola 165/a, Bari 70126, Italy.

出版信息

Comput Struct Biotechnol J. 2024 May 23;23:2442-2452. doi: 10.1016/j.csbj.2024.05.040. eCollection 2024 Dec.

DOI:10.1016/j.csbj.2024.05.040

PMID:38867723

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11168199/

Abstract

Bioactive peptides are short amino acid chains possessing biological activity and exerting physiological effects relevant to human health. Despite their therapeutic value, their identification remains a major problem, as it mainly relies on time-consuming in vitro tests. While bioinformatic tools for the identification of bioactive peptides are available, they are focused on specific functional classes and have not been systematically tested on realistic settings. To tackle this problem, bioactive peptide sequences and functions were here gathered from a variety of databases to generate a unified collection of bioactive peptides from microbial fermentation. This collection was organized into nine functional classes including some previously studied and some unexplored such as immunomodulatory, opioid and cardiovascular peptides. Upon assessing their sequence properties, four alternative encoding methods were tested in combination with a multitude of machine learning algorithms, from basic classifiers like logistic regression to advanced algorithms like BERT. Tests on a total of 171 models showed that, while some functions are intrinsically easier to detect, no single combination of classifiers and encoders worked universally well for all classes. For this reason, we unified all the best individual models for each class and generated CICERON (Classification of bIoaCtive pEptides fRom micrObial fermeNtation), a classification tool for the functional classification of peptides. State-of-the-art classifiers were found to underperform on our realistic benchmark dataset compared to the models included in CICERON. Altogether, our work provides a tool for real-world peptide classification and can serve as a benchmark for future model development.

摘要

生物活性肽是具有生物活性的短氨基酸链，对人体健康发挥着相关生理作用。尽管它们具有治疗价值，但其鉴定仍然是一个主要问题，因为这主要依赖于耗时的体外试验。虽然有用于鉴定生物活性肽的生物信息学工具，但它们专注于特定的功能类别，尚未在实际环境中进行系统测试。为了解决这个问题，这里从各种数据库收集了生物活性肽序列和功能，以生成一个来自微生物发酵的生物活性肽统一集合。这个集合被组织成九个功能类别，包括一些以前研究过的和一些未探索的类别，如免疫调节肽、阿片样肽和心血管肽。在评估它们的序列特性后，测试了四种替代编码方法，并与多种机器学习算法相结合，从逻辑回归等基本分类器到BERT等先进算法。对总共171个模型的测试表明，虽然有些功能本质上更容易检测，但没有一种分类器和编码器的组合对所有类别都普遍适用。因此，我们统一了每个类别的所有最佳个体模型，并生成了CICERON（来自微生物发酵的生物活性肽分类），这是一种用于肽功能分类的工具。与CICERON中包含的模型相比，发现最先进的分类器在我们的实际基准数据集上表现不佳。总之，我们的工作提供了一种用于现实世界肽分类的工具，并可作为未来模型开发的基准。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e33e/11168199/bd2624648771/ga1.jpg

相似文献

Classification of bioactive peptides: A systematic benchmark of models and encodings.

Comput Struct Biotechnol J. 2024 May 23;23:2442-2452. doi: 10.1016/j.csbj.2024.05.040. eCollection 2024 Dec.

CAPTURE: Comprehensive anti-cancer peptide predictor with a unique amino acid sequence encoder.

Comput Biol Med. 2024 Jun;176:108538. doi: 10.1016/j.compbiomed.2024.108538. Epub 2024 May 3.

Encodings and models for antimicrobial peptide classification for multi-resistant pathogens.

BioData Min. 2019 Mar 4;12:7. doi: 10.1186/s13040-019-0196-x. eCollection 2019.

Bioactive Peptide Recognition Based on NLP Pre-Train Algorithm.

IEEE/ACM Trans Comput Biol Bioinform. 2023 Nov-Dec;20(6):3809-3819. doi: 10.1109/TCBB.2023.3323295. Epub 2023 Dec 25.

Deep2Pep: A deep learning method in multi-label classification of bioactive peptide.

Comput Biol Chem. 2024 Apr;109:108021. doi: 10.1016/j.compbiolchem.2024.108021. Epub 2024 Jan 22.

UMPred-FRL: A New Approach for Accurate Prediction of Umami Peptides Using Feature Representation Learning.

Int J Mol Sci. 2021 Dec 4;22(23):13124. doi: 10.3390/ijms222313124.

SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.

BMC Bioinformatics. 2007 May 22;8 Suppl 4(Suppl 4):S2. doi: 10.1186/1471-2105-8-S4-S2.

NTpred: a robust and precise machine learning framework for in silico identification of Tyrosine nitration sites in protein sequences.

Brief Funct Genomics. 2024 Mar 20;23(2):163-179. doi: 10.1093/bfgp/elad018.

PredAPP: Predicting Anti-Parasitic Peptides with Undersampling and Ensemble Approaches.

Interdiscip Sci. 2022 Mar;14(1):258-268. doi: 10.1007/s12539-021-00484-x. Epub 2021 Oct 4.

Putting microbes to work: dairy fermentation, cell factories and bioactive peptides. Part II: bioactive peptide functions.

Biotechnol J. 2007 Apr;2(4):435-49. doi: 10.1002/biot.200700045.

引用本文的文献

Recent advances in bioactive hydrogel microspheres: Material engineering strategies and biomedical prospects.

Mater Today Bio. 2025 Feb 25;31:101614. doi: 10.1016/j.mtbio.2025.101614. eCollection 2025 Apr.

AI Methods for Antimicrobial Peptides: Progress and Challenges.

Microb Biotechnol. 2025 Jan;18(1):e70072. doi: 10.1111/1751-7915.70072.

A Novel Workflow for In Silico Prediction of Bioactive Peptides: An Exploration of By-Products.

Biomolecules. 2024 Jul 31;14(8):930. doi: 10.3390/biom14080930.

本文引用的文献

Anticancer Potential of Antimicrobial Peptides: Focus on Buforins.

Polymers (Basel). 2024 Mar 7;16(6):728. doi: 10.3390/polym16060728.

Antioxidant and Renin Inhibitory Activities of Peptides from Food Proteins on Hypertension: A Review.

Plant Foods Hum Nutr. 2023 Sep;78(3):493-505. doi: 10.1007/s11130-023-01085-3. Epub 2023 Aug 14.

ACP-MLC: A two-level prediction engine for identification of anticancer peptides and multi-label classification of their functional types.

Comput Biol Med. 2023 May;158:106844. doi: 10.1016/j.compbiomed.2023.106844. Epub 2023 Apr 4.

UniDL4BioPep: a universal deep learning architecture for binary classification in peptide bioactivity.

Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad135.

Designing antimicrobial peptides using deep learning and molecular dynamic simulations.

Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad058.

Spent Yeast Waste Streams as a Sustainable Source of Bioactive Peptides for Skin Applications.

Int J Mol Sci. 2023 Jan 23;24(3):2253. doi: 10.3390/ijms24032253.

Prediction of celiac disease associated epitopes and motifs in a protein.

Front Immunol. 2023 Jan 19;14:1056101. doi: 10.3389/fimmu.2023.1056101. eCollection 2023.

Prediction of antioxidant peptides using a quantitative structure-activity relationship predictor (AnOxPP) based on bidirectional long short-term memory neural network and interpretable amino acid descriptors.

Comput Biol Med. 2023 Mar;154:106591. doi: 10.1016/j.compbiomed.2023.106591. Epub 2023 Jan 24.

AFP-MFL: accurate identification of antifungal peptides using multi-view feature learning.

Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac606.

The Therapeutic Potential of Naturally Occurring Peptides in Counteracting SH-SY5Y Cells Injury.

Int J Mol Sci. 2022 Oct 4;23(19):11778. doi: 10.3390/ijms231911778.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

生物活性肽的分类：模型与编码的系统基准测试

Classification of bioactive peptides: A systematic benchmark of models and encodings.

作者信息

Bizzotto Edoardo, Zampieri Guido, Treu Laura, Filannino Pasquale, Di Cagno Raffaella, Campanaro Stefano

机构信息

Department of Biology, University of Padua, Via U. Bassi 58/b, Padova 35131, Italy.

Department of Soil, Plant and Food Science, University of Bari Aldo Moro, Via G. Amendola 165/a, Bari 70126, Italy.

出版信息

Comput Struct Biotechnol J. 2024 May 23;23:2442-2452. doi: 10.1016/j.csbj.2024.05.040. eCollection 2024 Dec.

DOI:10.1016/j.csbj.2024.05.040

PMID:38867723

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11168199/

Abstract

摘要

生物活性肽的分类：模型与编码的系统基准测试

Classification of bioactive peptides: A systematic benchmark of models and encodings.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

生物活性肽的分类：模型与编码的系统基准测试

Classification of bioactive peptides: A systematic benchmark of models and encodings.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献