• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的防御素家族和亚家族预测方法的最新进展。

Recent development of machine learning-based methods for the prediction of defensin family and subfamily.

作者信息

Charoenkwan Phasit, Schaduangrat Nalini, Mahmud S M Hasan, Thinnukool Orawit, Shoombuatong Watshara

机构信息

Modern Management and Information Technology, College of Arts, Media and Technology, Chiang Mai University, Chiang Mai, Thailand, 50200.

Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand, 10700.

出版信息

EXCLI J. 2022 May 5;21:757-771. doi: 10.17179/excli2022-4913. eCollection 2022.

DOI:10.17179/excli2022-4913
PMID:35949489
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9360473/
Abstract

Nearly all living species comprise of host defense peptides called defensins, that are crucial for innate immunity. These peptides work by activating the immune system which kills the microbes directly or indirectly, thus providing protection to the host. Thus far, numerous preclinical and clinical trials for peptide-based drugs are currently being evaluated. Although, experimental methods can help to precisely identify the defensin peptide family and subfamily, these approaches are often time-consuming and cost-ineffective. On the other hand, machine learning (ML) methods are able to effectively employ protein sequence information without the knowledge of a protein's three-dimensional structure, thus highlighting their predictive ability for the large-scale identification. To date, several ML methods have been developed for the identification of the defensin peptide family and subfamily. Therefore, summarizing the advantages and disadvantages of the existing methods is urgently needed in order to provide useful suggestions for the development and improvement of new computational models for the identification of the defensin peptide family and subfamily. With this goal in mind, we first provide a comprehensive survey on a collection of six state-of-the-art computational approaches for predicting the defensin peptide family and subfamily. Herein, we cover different important aspects, including the dataset quality, feature encoding methods, feature selection schemes, ML algorithms, cross-validation methods and web server availability/usability. Moreover, we provide our thoughts on the limitations of existing methods and future perspectives for improving the prediction performance and model interpretability. The insights and suggestions gained from this review are anticipated to serve as a valuable guidance for researchers for the development of more robust and useful predictors.

摘要

几乎所有生物物种都包含被称为防御素的宿主防御肽,这些肽对先天免疫至关重要。这些肽通过激活免疫系统来发挥作用,免疫系统直接或间接地杀死微生物,从而为宿主提供保护。到目前为止,许多基于肽的药物的临床前和临床试验正在进行评估。虽然实验方法有助于精确识别防御素肽家族和亚家族,但这些方法通常既耗时又成本低效。另一方面,机器学习(ML)方法能够在不了解蛋白质三维结构的情况下有效利用蛋白质序列信息,从而突出了它们在大规模识别方面的预测能力。迄今为止,已经开发了几种用于识别防御素肽家族和亚家族的ML方法。因此,迫切需要总结现有方法的优缺点,以便为开发和改进用于识别防御素肽家族和亚家族的新计算模型提供有用的建议。出于这个目标,我们首先对六种用于预测防御素肽家族和亚家族的最先进计算方法进行了全面综述。在此,我们涵盖了不同的重要方面,包括数据集质量、特征编码方法、特征选择方案、ML算法、交叉验证方法以及网络服务器的可用性/易用性。此外,我们对现有方法的局限性以及提高预测性能和模型可解释性的未来前景提出了自己的看法。预计从本综述中获得的见解和建议将为研究人员开发更强大、更有用的预测器提供有价值的指导。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ebb/9360473/78fe11df9a68/EXCLI-21-757-g-002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ebb/9360473/d7a7b743dc64/EXCLI-21-757-t-001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ebb/9360473/f46741ee86f2/EXCLI-21-757-t-002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ebb/9360473/3e21d4aea53f/EXCLI-21-757-t-003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ebb/9360473/10fa61b058b7/EXCLI-21-757-t-004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ebb/9360473/39e96298604b/EXCLI-21-757-t-005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ebb/9360473/caf336e7e113/EXCLI-21-757-g-001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ebb/9360473/78fe11df9a68/EXCLI-21-757-g-002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ebb/9360473/d7a7b743dc64/EXCLI-21-757-t-001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ebb/9360473/f46741ee86f2/EXCLI-21-757-t-002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ebb/9360473/3e21d4aea53f/EXCLI-21-757-t-003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ebb/9360473/10fa61b058b7/EXCLI-21-757-t-004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ebb/9360473/39e96298604b/EXCLI-21-757-t-005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ebb/9360473/caf336e7e113/EXCLI-21-757-g-001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ebb/9360473/78fe11df9a68/EXCLI-21-757-g-002.jpg

相似文献

1
Recent development of machine learning-based methods for the prediction of defensin family and subfamily.基于机器学习的防御素家族和亚家族预测方法的最新进展。
EXCLI J. 2022 May 5;21:757-771. doi: 10.17179/excli2022-4913. eCollection 2022.
2
Empirical comparison and analysis of machine learning-based predictors for predicting and analyzing of thermophilic proteins.用于预测和分析嗜热蛋白的基于机器学习的预测器的实证比较与分析
EXCLI J. 2022 Mar 2;21:554-570. doi: 10.17179/excli2022-4723. eCollection 2022.
3
Large-scale comparative review and assessment of computational methods for anti-cancer peptide identification.大规模比较综述与评估抗癌肽鉴定的计算方法。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa312.
4
Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences.全面综述和评估基于 RNA 序列预测 RNA 转录后修饰位点的计算方法。
Brief Bioinform. 2020 Sep 25;21(5):1676-1696. doi: 10.1093/bib/bbz112.
5
iDPF-PseRAAAC: A Web-Server for Identifying the Defensin Peptide Family and Subfamily Using Pseudo Reduced Amino Acid Alphabet Composition.iDPF-PseRAAAC:一个使用伪简化氨基酸字母组成来识别防御素肽家族和亚家族的网络服务器。
PLoS One. 2015 Dec 29;10(12):e0145541. doi: 10.1371/journal.pone.0145541. eCollection 2015.
6
Defensinpred: defensin and defensin types prediction server.防御素预测工具:防御素及防御素类型预测服务器。
Protein Pept Lett. 2012 Dec;19(12):1318-23. doi: 10.2174/092986612803521594.
7
In Silico Approaches for the Prediction and Analysis of Antiviral Peptides: A Review.计算机方法在抗病毒肽预测和分析中的应用:综述
Curr Pharm Des. 2021;27(18):2180-2188. doi: 10.2174/1381612826666201102105827.
8
iDEF-PseRAAC: Identifying the Defensin Peptide by Using Reduced Amino Acid Composition Descriptor.iDEF-PseRAAC:利用简化氨基酸组成描述符鉴定防御素肽
Evol Bioinform Online. 2019 Jul 31;15:1176934319867088. doi: 10.1177/1176934319867088. eCollection 2019.
9
Empirical comparison and analysis of machine learning-based approaches for druggable protein identification.基于机器学习的可成药蛋白识别方法的实证比较与分析
EXCLI J. 2023 Aug 29;22:915-927. doi: 10.17179/excli2023-6410. eCollection 2023.
10
Using reduced amino acid composition to predict defensin family and subfamily: Integrating similarity measure and structural alphabet.使用简化的氨基酸组成预测防御素家族和亚家族:整合相似性度量和结构字母。
Peptides. 2009 Oct;30(10):1788-93. doi: 10.1016/j.peptides.2009.06.032. Epub 2009 Jul 8.

本文引用的文献

1
Empirical comparison and analysis of machine learning-based predictors for predicting and analyzing of thermophilic proteins.用于预测和分析嗜热蛋白的基于机器学习的预测器的实证比较与分析
EXCLI J. 2022 Mar 2;21:554-570. doi: 10.17179/excli2022-4723. eCollection 2022.
2
SCORPION is a stacking-based ensemble learning framework for accurate prediction of phage virion proteins.SCORPION 是一个基于堆叠的集成学习框架,用于准确预测噬菌体病毒蛋白。
Sci Rep. 2022 Mar 8;12(1):4106. doi: 10.1038/s41598-022-08173-5.
3
Large-scale comparative review and assessment of computational methods for phage virion proteins identification.
噬菌体病毒粒子蛋白质鉴定计算方法的大规模比较综述与评估
EXCLI J. 2022 Jan 3;21:11-29. doi: 10.17179/excli2021-4411. eCollection 2022.
4
StackDPPIV: A novel computational approach for accurate prediction of dipeptidyl peptidase IV (DPP-IV) inhibitory peptides.StackDPPIV:一种用于准确预测二肽基肽酶 IV(DPP-IV)抑制肽的新型计算方法。
Methods. 2022 Aug;204:189-198. doi: 10.1016/j.ymeth.2021.12.001. Epub 2021 Dec 6.
5
Tool for Predicting, Scanning, and Designing Defensins.用于预测、扫描和设计防御素的工具。
Front Immunol. 2021 Nov 22;12:780610. doi: 10.3389/fimmu.2021.780610. eCollection 2021.
6
Review and Comparative Analysis of Machine Learning-based Predictors for Predicting and Analyzing Anti-angiogenic Peptides.基于机器学习的抗血管生成肽预测和分析预测因子的回顾与比较分析。
Curr Med Chem. 2022;29(5):849-864. doi: 10.2174/0929867328666210810145806.
7
Deep-4mCW2V: A sequence-based predictor to identify N4-methylcytosine sites in Escherichia coli.Deep-4mCW2V:一种基于序列的预测工具,用于鉴定大肠杆菌中的 N4-甲基胞嘧啶位点。
Methods. 2022 Jul;203:558-563. doi: 10.1016/j.ymeth.2021.07.011. Epub 2021 Aug 2.
8
Porpoise: a new approach for accurate prediction of RNA pseudouridine sites.海豚:一种准确预测 RNA 假尿嘧啶位点的新方法。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab245.
9
Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method.利用新型灵活评分卡方法提高肽类抗癌活性的预测和表征。
Sci Rep. 2021 Feb 4;11(1):3017. doi: 10.1038/s41598-021-82513-9.
10
Large-scale comparative review and assessment of computational methods for anti-cancer peptide identification.大规模比较综述与评估抗癌肽鉴定的计算方法。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa312.