• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 NLP 预训练算法的生物活性肽识别。

Bioactive Peptide Recognition Based on NLP Pre-Train Algorithm.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2023 Nov-Dec;20(6):3809-3819. doi: 10.1109/TCBB.2023.3323295. Epub 2023 Dec 25.

DOI:10.1109/TCBB.2023.3323295
PMID:37815965
Abstract

Bioactive peptides are defined as peptide sequences within a protein that can regulate important bodily functions through their myriad activities. With the development of machine learning, more computational methods were proposed for bioactive peptides recognition so that this task does not only rely on tedious and time-consuming wet-experiment. But the training and testing process of existing models are limited to small datasets, which affects model performance. Inspired by the success of sequence classification in natural language processing with unlabeled data, we proposed a pre-training method for Bioactive peptides recognition. By pre-trained with large-scale of protein sequences, our method achieved the best performance in multiple functional peptides identification including anti-cancer, anti-diabetic, anti-hypertensive, anti-inflammatory and anti-microbial peptides. Compared with the advanced model, our model's precision, coverage, accuracy and absolute true are improved by 7.2%, 6.9%, 6.1% and 4.2% in the result of 5-fold cross-validation. In addition, the results indicate the model has superior prediction performance in single functional peptides recognition, especially for anti-cancer peptides and anti-microbial peptides which with longer sequences.

摘要

生物活性肽是指蛋白质中的肽序列,通过其多种活性可以调节重要的身体功能。随着机器学习的发展,提出了更多用于生物活性肽识别的计算方法,使这项任务不仅依赖于繁琐且耗时的湿实验。但是,现有模型的训练和测试过程仅限于小数据集,这会影响模型性能。受自然语言处理中使用未标记数据进行序列分类成功的启发,我们提出了一种用于生物活性肽识别的预训练方法。通过对大规模蛋白质序列进行预训练,我们的方法在多种功能肽识别中取得了最佳性能,包括抗癌肽、抗糖尿病肽、抗高血压肽、抗炎肽和抗菌肽。与先进模型相比,我们的模型在 5 倍交叉验证的结果中,精度、覆盖度、准确率和绝对真度分别提高了 7.2%、6.9%、6.1%和 4.2%。此外,结果表明该模型在单功能肽识别方面具有卓越的预测性能,尤其是对于序列较长的抗癌肽和抗菌肽。

相似文献

1
Bioactive Peptide Recognition Based on NLP Pre-Train Algorithm.基于 NLP 预训练算法的生物活性肽识别。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Nov-Dec;20(6):3809-3819. doi: 10.1109/TCBB.2023.3323295. Epub 2023 Dec 25.
2
A novel antibacterial peptide recognition algorithm based on BERT.基于 BERT 的新型抗菌肽识别算法。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab200.
3
Identifying multi-functional bioactive peptide functions using multi-label deep learning.利用多标签深度学习识别多功能生物活性肽功能。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab414.
4
An Intelligent System for Classifying Patient Complaints Using Machine Learning and Natural Language Processing: Development and Validation Study.一种使用机器学习和自然语言处理对患者投诉进行分类的智能系统:开发与验证研究。
J Med Internet Res. 2025 Jan 8;27:e55721. doi: 10.2196/55721.
5
Can We Geographically Validate a Natural Language Processing Algorithm for Automated Detection of Incidental Durotomy Across Three Independent Cohorts From Two Continents?能否通过来自两大洲的三个独立队列对用于自动检测偶然硬脊膜切开术的自然语言处理算法进行地理验证?
Clin Orthop Relat Res. 2022 Sep 1;480(9):1766-1775. doi: 10.1097/CORR.0000000000002200. Epub 2022 Apr 12.
6
Development of machine learning and natural language processing algorithms for preoperative prediction and automated identification of intraoperative vascular injury in anterior lumbar spine surgery.开发机器学习和自然语言处理算法,用于在前路腰椎手术中进行术前预测和术中血管损伤的自动识别。
Spine J. 2021 Oct;21(10):1635-1642. doi: 10.1016/j.spinee.2020.04.001. Epub 2020 Apr 12.
7
A clinical text classification paradigm using weak supervision and deep representation.一种使用弱监督和深度表示的临床文本分类范式。
BMC Med Inform Decis Mak. 2019 Jan 7;19(1):1. doi: 10.1186/s12911-018-0723-6.
8
DeepBP: Ensemble deep learning strategy for bioactive peptide prediction.DeepBP:用于生物活性肽预测的集成深度学习策略。
BMC Bioinformatics. 2024 Nov 11;25(1):352. doi: 10.1186/s12859-024-05974-5.
9
CAPTURE: Comprehensive anti-cancer peptide predictor with a unique amino acid sequence encoder.CAPTURE:具有独特氨基酸序列编码器的综合抗癌肽预测器。
Comput Biol Med. 2024 Jun;176:108538. doi: 10.1016/j.compbiomed.2024.108538. Epub 2024 May 3.
10
MFP-MFL: Leveraging Graph Attention and Multi-Feature Integration for Superior Multifunctional Bioactive Peptide Prediction.MFP-MFL:利用图注意力和多特征整合实现卓越的多功能生物活性肽预测
Int J Mol Sci. 2025 Feb 4;26(3):1317. doi: 10.3390/ijms26031317.

引用本文的文献

1
MFP-MFL: Leveraging Graph Attention and Multi-Feature Integration for Superior Multifunctional Bioactive Peptide Prediction.MFP-MFL:利用图注意力和多特征整合实现卓越的多功能生物活性肽预测
Int J Mol Sci. 2025 Feb 4;26(3):1317. doi: 10.3390/ijms26031317.
2
DeepBP: Ensemble deep learning strategy for bioactive peptide prediction.DeepBP:用于生物活性肽预测的集成深度学习策略。
BMC Bioinformatics. 2024 Nov 11;25(1):352. doi: 10.1186/s12859-024-05974-5.