Nair Vinay, Dutta Monalisa, Manian Sowmya S, S Ramya Kumari, Jayaraman Valadi K
Center for Development of Advanced Computing, Pune, India.
Bioinformation. 2013 May 25;9(9):481-4. doi: 10.6026/97320630009481. Print 2013.
Penicillin-Binding Proteins are peptidases that play an important role in cell-wall biogenesis in bacteria and thus maintaining bacterial infections. A wide class of β-lactam drugs are known to act on these proteins and inhibit bacterial infections by disrupting the cell-wall biogenesis pathway. Penicillin-Binding proteins have recently gained importance with the increase in the number of multi-drug resistant bacteria. In this work, we have collected a dataset of over 700 Penicillin-Binding and non-Penicillin Binding Proteins and extracted various sequence-related features. We then created models to classify the proteins into Penicillin-Binding and non-binding using supervised machine learning algorithms such as Support Vector Machines and Random Forest. We obtain a good classification performance for both the models using both the methods.
青霉素结合蛋白是肽酶,在细菌细胞壁生物合成中起重要作用,从而维持细菌感染。已知一大类β-内酰胺药物作用于这些蛋白质,并通过破坏细胞壁生物合成途径来抑制细菌感染。随着多重耐药细菌数量的增加,青霉素结合蛋白最近变得越发重要。在这项工作中,我们收集了一个包含700多种青霉素结合蛋白和非青霉素结合蛋白的数据集,并提取了各种与序列相关的特征。然后,我们使用支持向量机和随机森林等监督机器学习算法创建模型,将蛋白质分类为青霉素结合蛋白和非结合蛋白。我们使用这两种方法对两个模型都获得了良好的分类性能。