iAMAP-SCM：一种利用二肽估计倾向得分大规模鉴定抗疟肽的新型计算工具。

iAMAP-SCM: A Novel Computational Tool for Large-Scale Identification of Antimalarial Peptides Using Estimated Propensity Scores of Dipeptides.

作者信息

Charoenkwan Phasit, Schaduangrat Nalini, Lio Pietro, Moni Mohammad Ali, Chumnanpuen Pramote, Shoombuatong Watshara

机构信息

Modern Management and Information Technology, College of Arts, Media and Technology, Chiang Mai University, Chiang Mai50200, Thailand.

Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok10700, Thailand.

出版信息

ACS Omega. 2022 Nov 2;7(45):41082-41095. doi: 10.1021/acsomega.2c04465. eCollection 2022 Nov 15.

DOI:10.1021/acsomega.2c04465

PMID:36406571

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9670693/

Abstract

Antimalarial peptides (AMAPs) varying in length, amino acid composition, charge, conformational structure, hydrophobicity, and amphipathicity reflect their diversity in antimalarial mechanisms. Due to the worldwide major health problem concerning antimicrobial resistance, these peptides possess great therapeutic value owing to their low incidences of drug resistance as compared to conventional antibiotics. Although well-known experimental methods are able to precisely determine the antimalarial activity of peptides, these methods are still time-consuming and costly. Thus, machine learning (ML)-based methods that are capable of identifying AMAPs rapidly by using only sequence information would be beneficial for the high-throughput identification of AMAPs. In this study, we propose the first computational model (termed iAMAP-SCM) for the large-scale identification and characterization of peptides with antimalarial activity by using only sequence information. Specifically, we employed an interpretable scoring card method (SCM) to develop iAMAP-SCM and estimate propensities of 20 amino acids and 400 dipeptides to be AMAPs in a supervised manner. Experimental results showed that iAMAP-SCM could achieve a maximum accuracy and Matthew's coefficient correlation of 0.957 and 0.834, respectively, on the independent test dataset. In addition, SCM-derived propensities of 20 amino acids and selected physicochemical properties were used to provide an understanding of the functional mechanisms of AMAPs. Finally, a user-friendly online computational platform of iAMAP-SCM is publicly available at http://pmlabstack.pythonanywhere.com/iAMAP-SCM. The iAMAP-SCM predictor is anticipated to assist experimental scientists in the high-throughput identification of potential AMAP candidates for the treatment of malaria and other clinical applications.

摘要

抗疟肽（AMAPs）在长度、氨基酸组成、电荷、构象结构、疏水性和两亲性方面存在差异，这反映了它们在抗疟机制上的多样性。由于全球范围内与抗菌药物耐药性相关的重大健康问题，与传统抗生素相比，这些肽具有较低的耐药发生率，因此具有很大的治疗价值。尽管已知的实验方法能够精确测定肽的抗疟活性，但这些方法仍然耗时且成本高昂。因此，基于机器学习（ML）的方法能够仅通过序列信息快速识别AMAPs，这将有助于高通量识别AMAPs。在本研究中，我们提出了第一个计算模型（称为iAMAP-SCM），用于仅通过序列信息大规模识别和表征具有抗疟活性的肽。具体而言，我们采用了一种可解释的评分卡方法（SCM）来开发iAMAP-SCM，并以监督的方式估计20种氨基酸和400种二肽成为AMAPs的倾向。实验结果表明，iAMAP-SCM在独立测试数据集上分别能够达到最大准确率0.957和马修斯系数相关性0.834。此外，SCM得出的20种氨基酸的倾向和选定的物理化学性质被用于理解AMAPs的功能机制。最后，一个用户友好的iAMAP-SCM在线计算平台可在http://pmlabstack.pythonanywhere.com/iAMAP-SCM上公开获取。预计iAMAP-SCM预测器将协助实验科学家高通量识别潜在的AMAP候选物，用于治疗疟疾和其他临床应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/93bd/9670693/b22980b7f994/ao2c04465_0002.jpg

相似文献

iAMAP-SCM: A Novel Computational Tool for Large-Scale Identification of Antimalarial Peptides Using Estimated Propensity Scores of Dipeptides.iAMAP-SCM：一种利用二肽估计倾向得分大规模鉴定抗疟肽的新型计算工具。

ACS Omega. 2022 Nov 2;7(45):41082-41095. doi: 10.1021/acsomega.2c04465. eCollection 2022 Nov 15.

iBitter-SCM: Identification and characterization of bitter peptides using a scoring card method with propensity scores of dipeptides.iBitter-SCM：利用二肽倾向评分的评分卡方法鉴定和表征苦味肽。

Genomics. 2020 Jul;112(4):2813-2822. doi: 10.1016/j.ygeno.2020.03.019. Epub 2020 Mar 28.

iUmami-SCM: A Novel Sequence-Based Predictor for Prediction and Analysis of Umami Peptides Using a Scoring Card Method with Propensity Scores of Dipeptides.iUmami-SCM：一种新颖的基于序列的预测器，用于使用基于二肽倾向分数的评分卡方法预测和分析鲜味肽。

J Chem Inf Model. 2020 Dec 28;60(12):6666-6678. doi: 10.1021/acs.jcim.0c00707. Epub 2020 Oct 23.

SCMRSA: a New Approach for Identifying and Analyzing Anti-MRSA Peptides Using Estimated Propensity Scores of Dipeptides.SCMRSA：一种利用二肽估计倾向得分鉴定和分析抗耐甲氧西林金黄色葡萄球菌肽的新方法。

ACS Omega. 2022 Sep 1;7(36):32653-32664. doi: 10.1021/acsomega.2c04305. eCollection 2022 Sep 13.

iAMY-SCM: Improved prediction and analysis of amyloid proteins using a scoring card method with propensity scores of dipeptides.iAMY-SCM：使用具有二肽倾向得分的评分卡方法改进淀粉样蛋白的预测与分析

Genomics. 2021 Jan;113(1 Pt 2):689-698. doi: 10.1016/j.ygeno.2020.09.065. Epub 2020 Oct 2.

iDPPIV-SCM: A Sequence-Based Predictor for Identifying and Analyzing Dipeptidyl Peptidase IV (DPP-IV) Inhibitory Peptides Using a Scoring Card Method.iDPPIV-SCM：一种基于序列的预测器，用于使用评分卡方法识别和分析二肽基肽酶 IV（DPP-IV）抑制肽。

J Proteome Res. 2020 Oct 2;19(10):4125-4136. doi: 10.1021/acs.jproteome.0c00590. Epub 2020 Sep 19.

A novel sequence-based predictor for identifying and characterizing thermophilic proteins using estimated propensity scores of dipeptides.一种新的基于序列的预测器，用于使用二肽的估计倾向分数来识别和描述嗜热蛋白。

Sci Rep. 2021 Dec 10;11(1):23782. doi: 10.1038/s41598-021-03293-w.

Improved prediction and characterization of blood-brain barrier penetrating peptides using estimated propensity scores of dipeptides.使用二肽的预测倾向评分提高血脑屏障穿透肽的预测和表征。

J Comput Aided Mol Des. 2022 Nov;36(11):781-796. doi: 10.1007/s10822-022-00476-z. Epub 2022 Oct 26.

Prediction and analysis of protein solubility using a novel scoring card method with dipeptide composition.利用新型评分卡方法和二肽组成预测和分析蛋白质溶解度。

BMC Bioinformatics. 2012;13 Suppl 17(Suppl 17):S3. doi: 10.1186/1471-2105-13-S17-S3. Epub 2012 Dec 13.

PSRTTCA: A new approach for improving the prediction and characterization of tumor T cell antigens using propensity score representation learning.PSRTTCA：一种使用倾向评分表示学习改进肿瘤T细胞抗原预测和表征的新方法。

Comput Biol Med. 2023 Jan;152:106368. doi: 10.1016/j.compbiomed.2022.106368. Epub 2022 Nov 26.

引用本文的文献

PSR-MAPMS: A new approach for the interpretable prediction of myelin autoantigenic peptides in multiple sclerosis using multi-source propensity scores.PSR-MAPMS：一种使用多源倾向评分对多发性硬化症中髓鞘自身抗原肽进行可解释预测的新方法。

Protein Sci. 2025 Aug;34(8):e70010. doi: 10.1002/pro.70010.

Progress and challenges for the application of machine learning for neglected tropical diseases.机器学习在 neglected tropical diseases 中的应用进展与挑战。（注：“neglected tropical diseases”直译为“被忽视的热带病” ）

F1000Res. 2025 May 20;12:287. doi: 10.12688/f1000research.129064.2. eCollection 2023.

Advancing the Accuracy of Anti-MRSA Peptide Prediction Through Integrating Multi-Source Protein Language Models.通过整合多源蛋白质语言模型提高抗耐甲氧西林金黄色葡萄球菌肽预测的准确性

Interdiscip Sci. 2025 Mar 11. doi: 10.1007/s12539-025-00696-5.

Advancing the accuracy of tyrosinase inhibitory peptides prediction via a multiview feature fusion strategy.通过多视图特征融合策略提高酪氨酸酶抑制肽预测的准确性。

Sci Rep. 2025 Feb 8;15(1):4762. doi: 10.1038/s41598-024-81807-y.

AutoPeptideML: a study on how to build more trustworthy peptide bioactivity predictors.AutoPeptideML：关于如何构建更可信的肽生物活性预测器的研究。

Bioinformatics. 2024 Sep 2;40(9). doi: 10.1093/bioinformatics/btae555.

Tackling the Antimicrobial Resistance "Pandemic" with Machine Learning Tools: A Summary of Available Evidence.使用机器学习工具应对抗微生物药物耐药性“大流行”：现有证据综述

Microorganisms. 2024 Apr 23;12(5):842. doi: 10.3390/microorganisms12050842.

本文引用的文献

Antiplasmodial Cyclodecapeptides from Tyrothricin Share a Target with Chloroquine.来自短杆菌肽的抗疟环十肽与氯喹有共同靶点。

Antibiotics (Basel). 2022 Jun 14;11(6):801. doi: 10.3390/antibiotics11060801.

SCMTHP: A New Approach for Identifying and Characterizing of Tumor-Homing Peptides Using Estimated Propensity Scores of Amino Acids.SCMTHP：一种利用氨基酸估计倾向得分来鉴定和表征肿瘤归巢肽的新方法。

Pharmaceutics. 2022 Jan 4;14(1):122. doi: 10.3390/pharmaceutics14010122.

Sci Rep. 2021 Dec 10;11(1):23782. doi: 10.1038/s41598-021-03293-w.

Spiers Memorial Lecture: Analysis and design of membrane-interactive peptides.斯皮尔斯纪念讲座：膜相互作用肽的分析与设计。

Faraday Discuss. 2021 Dec 24;232(0):9-48. doi: 10.1039/d1fd00061f.

Novel Dipeptides Bearing Sulfonamide as Antimalarial and Antitrypanosomal Agents: Synthesis and Molecular Docking.新型含磺胺的二肽类抗疟和抗锥虫药物的合成及分子对接研究

Med Chem. 2022;18(3):394-405. doi: 10.2174/1573406417666210604101201.

iAMP-CA2L: a new CNN-BiLSTM-SVM classifier based on cellular automata image for identifying antimicrobial peptides and their functional types.iAMP-CA2L：一种基于细胞自动机图像的新型 CNN-BiLSTM-SVM 分类器，用于识别抗菌肽及其功能类型。

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab209.

StackIL6: a stacking ensemble model for improving the prediction of IL-6 inducing peptides.StackIL6：一种用于提高白细胞介素 6 诱导肽预测能力的堆叠集成模型。

Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab172.

BERT4Bitter: a bidirectional encoder representations from transformers (BERT)-based model for improving the prediction of bitter peptides.BERT4Bitter：一种基于变换器双向编码器表征（BERT）的模型，用于改进苦味肽的预测。

Bioinformatics. 2021 Sep 9;37(17):2556-2562. doi: 10.1093/bioinformatics/btab133.

A sequence-based deep learning approach to predict CTCF-mediated chromatin loop.基于序列的深度学习方法预测 CTCF 介导的染色质环。

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbab031.

Improved prediction and characterization of anticancer activities of peptides using a novel flexible scoring card method.利用新型灵活评分卡方法提高肽类抗癌活性的预测和表征。

Sci Rep. 2021 Feb 4;11(1):3017. doi: 10.1038/s41598-021-82513-9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

iAMAP-SCM：一种利用二肽估计倾向得分大规模鉴定抗疟肽的新型计算工具。

iAMAP-SCM: A Novel Computational Tool for Large-Scale Identification of Antimalarial Peptides Using Estimated Propensity Scores of Dipeptides.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献