基于结构的机器学习策略在抗菌肽发现中的应用。

Structure-aware machine learning strategies for antimicrobial peptide discovery.

机构信息

Department of Biotechnology and Biochemistry, Center for Research and Advanced Studies of the National Polytechnic Institute (CINVESTAV-IPN), Irapuato Unit, 36824, Irapuato, Guanajuato, Mexico.

出版信息

Sci Rep. 2024 May 25;14(1):11995. doi: 10.1038/s41598-024-62419-y.

DOI:10.1038/s41598-024-62419-y

PMID:38796582

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11127937/

Abstract

Machine learning models are revolutionizing our approaches to discovering and designing bioactive peptides. These models often need protein structure awareness, as they heavily rely on sequential data. The models excel at identifying sequences of a particular biological nature or activity, but they frequently fail to comprehend their intricate mechanism(s) of action. To solve two problems at once, we studied the mechanisms of action and structural landscape of antimicrobial peptides as (i) membrane-disrupting peptides, (ii) membrane-penetrating peptides, and (iii) protein-binding peptides. By analyzing critical features such as dipeptides and physicochemical descriptors, we developed models with high accuracy (86-88%) in predicting these categories. However, our initial models (1.0 and 2.0) exhibited a bias towards α-helical and coiled structures, influencing predictions. To address this structural bias, we implemented subset selection and data reduction strategies. The former gave three structure-specific models for peptides likely to fold into α-helices (models 1.1 and 2.1), coils (1.3 and 2.3), or mixed structures (1.4 and 2.4). The latter depleted over-represented structures, leading to structure-agnostic predictors 1.5 and 2.5. Additionally, our research highlights the sensitivity of important features to different structure classes across models.

摘要

机器学习模型正在彻底改变我们发现和设计生物活性肽的方法。这些模型通常需要蛋白质结构意识，因为它们严重依赖于序列数据。这些模型擅长识别具有特定生物学性质或活性的序列，但它们经常无法理解其复杂的作用机制。为了同时解决两个问题，我们研究了抗菌肽的作用机制和结构景观，包括 (i) 破坏膜的肽、(ii) 穿透膜的肽和 (iii) 与蛋白质结合的肽。通过分析二肽和物理化学描述符等关键特征，我们开发了具有高准确性（86-88%）的模型，用于预测这些类别。然而，我们的初始模型（1.0 和 2.0）表现出对α-螺旋和卷曲结构的偏见，这影响了预测。为了解决这种结构偏差，我们实施了子集选择和数据缩减策略。前者为可能折叠成α-螺旋的肽（模型 1.1 和 2.1）、卷曲结构的肽（1.3 和 2.3）或混合结构的肽（1.4 和 2.4）提供了三个结构特异性模型。后者去除了过度代表的结构，导致了无结构预测器 1.5 和 2.5。此外，我们的研究还强调了重要特征对不同结构类别的模型的敏感性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2b08/11127937/f33313bfec78/41598_2024_62419_Fig1_HTML.jpg

相似文献

Structure-aware machine learning strategies for antimicrobial peptide discovery.基于结构的机器学习策略在抗菌肽发现中的应用。

Sci Rep. 2024 May 25;14(1):11995. doi: 10.1038/s41598-024-62419-y.

Machine Learning Prediction of Antimicrobial Peptides.机器学习预测抗菌肽。

Methods Mol Biol. 2022;2405:1-37. doi: 10.1007/978-1-0716-1855-4_1.

Machine learning-enabled discovery and design of membrane-active peptides.基于机器学习的膜活性肽的发现和设计。

Bioorg Med Chem. 2018 Jun 1;26(10):2708-2718. doi: 10.1016/j.bmc.2017.07.012. Epub 2017 Jul 8.

AniAMPpred: artificial intelligence guided discovery of novel antimicrobial peptides in animal kingdom.AniAMPpred：人工智能引导的动物王国中新型抗菌肽的发现。

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab242.

Bacteria-Specific Feature Selection for Enhanced Antimicrobial Peptide Activity Predictions Using Machine-Learning Methods.使用机器学习方法进行细菌特异性特征选择以增强抗菌肽活性预测

J Chem Inf Model. 2023 Mar 27;63(6):1723-1733. doi: 10.1021/acs.jcim.2c01551. Epub 2023 Mar 13.

Feature importance of machine learning prediction models shows structurally active part and important physicochemical features in drug design.机器学习预测模型的特征重要性展示了药物设计中结构活跃部分和重要的物理化学特征。

Drug Metab Pharmacokinet. 2021 Aug;39:100401. doi: 10.1016/j.dmpk.2021.100401. Epub 2021 May 3.

AMPDeep: hemolytic activity prediction of antimicrobial peptides using transfer learning.AMPDeeP：基于迁移学习的抗菌肽溶血活性预测。

BMC Bioinformatics. 2022 Sep 26;23(1):389. doi: 10.1186/s12859-022-04952-z.

Machine learning assisted design of highly active peptides for drug discovery.用于药物发现的高活性肽的机器学习辅助设计。

PLoS Comput Biol. 2015 Apr 7;11(4):e1004074. doi: 10.1371/journal.pcbi.1004074. eCollection 2015 Apr.

Understanding a protein fold: The physics, chemistry, and biology of α-helical coiled coils.理解蛋白质折叠：α-螺旋卷曲螺旋的物理、化学和生物学。

J Biol Chem. 2023 Apr;299(4):104579. doi: 10.1016/j.jbc.2023.104579. Epub 2023 Mar 5.

What can AlphaFold do for antimicrobial amyloids?AlphaFold 能对抗菌性淀粉样蛋白做些什么？

Proteins. 2024 Feb;92(2):265-281. doi: 10.1002/prot.26618. Epub 2023 Oct 19.

引用本文的文献

A unified model of transient poration induced by antimicrobial peptides.抗菌肽诱导瞬时孔形成的统一模型。

Proc Natl Acad Sci U S A. 2025 Sep 2;122(35):e2510294122. doi: 10.1073/pnas.2510294122. Epub 2025 Aug 29.

Antimicrobial peptides: structure, functions and translational applications.抗菌肽：结构、功能及转化应用

Nat Rev Microbiol. 2025 Jul 11. doi: 10.1038/s41579-025-01200-y.

Molecular Modelling in Bioactive Peptide Discovery and Characterisation.生物活性肽发现与表征中的分子建模

Biomolecules. 2025 Apr 3;15(4):524. doi: 10.3390/biom15040524.

Immunomodulation in Non-traditional Therapies for Methicillin-resistant Staphylococcus aureus (MRSA) Management.非传统疗法治疗耐甲氧西林金黄色葡萄球菌（MRSA）管理中的免疫调节。

Curr Microbiol. 2024 Sep 6;81(10):346. doi: 10.1007/s00284-024-03875-7.

Can large language models predict antimicrobial peptide activity and toxicity?大语言模型能预测抗菌肽的活性和毒性吗？

RSC Med Chem. 2024 Apr 23;15(6):2030-2036. doi: 10.1039/d4md00159a. eCollection 2024 Jun 19.

本文引用的文献

Accelerating the Discovery and Design of Antimicrobial Peptides with Artificial Intelligence.利用人工智能加速抗菌肽的发现和设计。

Methods Mol Biol. 2024;2714:329-352. doi: 10.1007/978-1-0716-3441-7_18.

Geometric deep learning as a potential tool for antimicrobial peptide prediction.几何深度学习作为抗菌肽预测的潜在工具。

Front Bioinform. 2023 Jul 13;3:1216362. doi: 10.3389/fbinf.2023.1216362. eCollection 2023.

Recent Advances in Machine Learning-Based Models for Prediction of Antiviral Peptides.基于机器学习的抗病毒肽预测模型的最新进展

Arch Comput Methods Eng. 2023 Apr 29:1-12. doi: 10.1007/s11831-023-09933-w.

Improving de novo protein binder design with deep learning.利用深度学习改进从头设计的蛋白质结合物。

Nat Commun. 2023 May 6;14(1):2625. doi: 10.1038/s41467-023-38328-5.

Deconstructing the Potency and Cell-Line Selectivity of Membranolytic Anticancer Peptides.剖析膜溶抗肿瘤肽的效力和细胞系选择性。

Chembiochem. 2023 Jul 17;24(14):e202300058. doi: 10.1002/cbic.202300058. Epub 2023 Jun 20.

Large language models generate functional protein sequences across diverse families.大型语言模型可生成不同家族的功能性蛋白质序列。

Nat Biotechnol. 2023 Aug;41(8):1099-1106. doi: 10.1038/s41587-022-01618-2. Epub 2023 Jan 26.

Computational and artificial intelligence-based methods for antibody development.基于计算和人工智能的抗体开发方法。

Trends Pharmacol Sci. 2023 Mar;44(3):175-189. doi: 10.1016/j.tips.2022.12.005. Epub 2023 Jan 18.

Machine learning methods for protein-protein binding affinity prediction in protein design.蛋白质设计中用于蛋白质-蛋白质结合亲和力预测的机器学习方法。

Front Bioinform. 2022 Dec 16;2:1065703. doi: 10.3389/fbinf.2022.1065703. eCollection 2022.

Machine Learning Guided Discovery of Non-Hemolytic Membrane Disruptive Anticancer Peptides.机器学习指导下的非溶血膜破坏型抗癌肽的发现。

ChemMedChem. 2022 Sep 5;17(17):e202200291. doi: 10.1002/cmdc.202200291. Epub 2022 Aug 5.

ColabFold: making protein folding accessible to all.ColabFold：让蛋白质折叠变得人人可用。

Nat Methods. 2022 Jun;19(6):679-682. doi: 10.1038/s41592-022-01488-1. Epub 2022 May 30.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于结构的机器学习策略在抗菌肽发现中的应用。

Structure-aware machine learning strategies for antimicrobial peptide discovery.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献