Biomedical Informatics Research Laboratory, Department of Computer and Information Sciences (DCIS), Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan.
Biomedical Informatics Research Laboratory, Department of Computer and Information Sciences (DCIS), Pakistan Institute of Engineering and Applied Sciences (PIEAS), Nilore, Islamabad, Pakistan.
Comput Biol Med. 2019 Apr;107:172-181. doi: 10.1016/j.compbiomed.2019.02.018. Epub 2019 Feb 25.
Due to increase in antibiotic resistance in recent years, development of efficient and accurate techniques for discovery and design of biologically active peptides such as antimicrobial peptides (AMPs) has become essential. The screening of natural and synthetic AMPs in the wet lab is a challenge due to time and cost involved in such experiments. Bioinformatics methods can be used to speed up discovery and design of antimicrobial peptides by limiting the wet-lab search to promising peptide sequences. However, most such tools are typically limited to the prediction of whether a peptide exhibits antimicrobial activity or not and they do not identify the exact type of the biological activities of these peptides. In this work, we have designed a machine learning based model called AMAP for predicting biological activity of peptides with a specialized focus on antimicrobial activity prediction. AMAP used multi-label classification to predict 14 different types of biological functions of a given peptide sequence with improved accuracy in comparison to existing state of the art techniques. We have performed stringent performance analyses of the proposed method. In addition to cross-validation and performance comparison with existing AMP predictors, AMAP has also been benchmarked on recently published experimentally verified peptides that were not a part of our training set. We have also analyzed features used in this work and our analysis shows that the proposed predictor can generalize well in predicting biological activity of novel peptide sequences. A webserver of the proposed method is available at the URL: http://faculty.pieas.edu.pk/fayyaz/software.html#AMAP.
由于近年来抗生素耐药性的增加,开发高效、准确的技术来发现和设计具有生物活性的肽,如抗菌肽(AMPs),已变得至关重要。由于涉及到时间和成本,在湿实验室中筛选天然和合成的 AMPs 是一个挑战。生物信息学方法可用于通过将湿实验室搜索限制在有前途的肽序列上来加速抗菌肽的发现和设计。然而,大多数此类工具通常仅限于预测肽是否具有抗菌活性,而不能确定这些肽的具体生物活性类型。在这项工作中,我们设计了一种基于机器学习的模型,称为 AMAP,用于预测肽的生物活性,特别关注抗菌活性预测。AMAP 使用多标签分类来预测给定肽序列的 14 种不同类型的生物学功能,与现有最先进的技术相比,准确性得到了提高。我们对所提出的方法进行了严格的性能分析。除了与现有的 AMP 预测器进行交叉验证和性能比较外,AMAP 还在最近发表的实验验证肽上进行了基准测试,这些肽不在我们的训练集中。我们还分析了本工作中使用的特征,我们的分析表明,所提出的预测器可以很好地预测新型肽序列的生物活性。该方法的网络服务器可在以下网址获得:http://faculty.pieas.edu.pk/fayyaz/software.html#AMAP。