Khamis Abdullah M, Essack Magbubah, Gao Xin, Bajic Vladimir B
Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia.
Bioinformatics. 2015 Mar 15;31(6):849-56. doi: 10.1093/bioinformatics/btu738. Epub 2014 Nov 10.
The increased prevalence of multi-drug resistant (MDR) pathogens heightens the need to design new antimicrobial agents. Antimicrobial peptides (AMPs) exhibit broad-spectrum potent activity against MDR pathogens and kills rapidly, thus giving rise to AMPs being recognized as a potential substitute for conventional antibiotics. Designing new AMPs using current in-silico approaches is, however, challenging due to the absence of suitable models, large number of design parameters, testing cycles, production time and cost. To date, AMPs have merely been categorized into families according to their primary sequences, structures and functions. The ability to computationally determine the properties that discriminate AMP families from each other could help in exploring the key characteristics of these families and facilitate the in-silico design of synthetic AMPs.
Here we studied 14 AMP families and sub-families. We selected a specific description of AMP amino acid sequence and identified compositional and physicochemical properties of amino acids that accurately distinguish each AMP family from all other AMPs with an average sensitivity, specificity and precision of 92.88%, 99.86% and 95.96%, respectively. Many of our identified discriminative properties have been shown to be compositional or functional characteristics of the corresponding AMP family in literature. We suggest that these properties could serve as guides for in-silico methods in design of novel synthetic AMPs. The methodology we developed is generic and has a potential to be applied for characterization of any protein family.
多重耐药(MDR)病原体的流行率增加,使得设计新型抗菌剂的需求更为迫切。抗菌肽(AMP)对多重耐药病原体表现出广谱强效活性且杀菌迅速,因此抗菌肽被认为是传统抗生素的潜在替代品。然而,由于缺乏合适的模型、大量的设计参数、测试周期、生产时间和成本,利用当前的计算机模拟方法设计新型抗菌肽具有挑战性。迄今为止,抗菌肽仅根据其一级序列、结构和功能被归类为不同的家族。通过计算确定区分抗菌肽家族的特性,有助于探索这些家族的关键特征,并促进合成抗菌肽的计算机模拟设计。
在此,我们研究了14个抗菌肽家族和亚家族。我们选择了一种对抗菌肽氨基酸序列的特定描述,并确定了氨基酸的组成和理化特性,这些特性能够准确区分每个抗菌肽家族与所有其他抗菌肽,平均灵敏度、特异性和精确度分别为92.88%、99.86%和95.96%。我们确定的许多鉴别特性在文献中已被证明是相应抗菌肽家族的组成或功能特征。我们认为这些特性可以作为计算机模拟方法设计新型合成抗菌肽的指导。我们开发的方法具有通用性,有潜力应用于任何蛋白质家族的表征。