Teimouri Hamid, Medvedeva Angela, Kolomeisky Anatoly B
Department of Chemistry, Rice University, Houston, Texas 77005, United States.
Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, United States.
J Chem Inf Model. 2023 Mar 27;63(6):1723-1733. doi: 10.1021/acs.jcim.2c01551. Epub 2023 Mar 13.
There are several classes of short peptide molecules, known as antimicrobial peptides (AMPs), which are produced during the immune responses of living organisms against various infections. In recent years, substantial progress has been achieved in applying machine-learning methods to predict the activities of AMPs against bacteria. In most investigated cases, however, the outcome is not bacterium-specific since the specific features of bacteria, such as chemical composition and structure of membranes, are not considered. To overcome this problem, we developed a new computational approach that allowed us to train several supervised machine-learning models using a specific set of data associated with peptides targeting bacteria. LASSO regression and Support Vector Machine techniques have been utilized to select, among more than 1500 physicochemical descriptors, the most important features that can be used to classify a peptide as antimicrobial or ineffective against . We then performed the classification of active versus inactive AMPs using the Support Vector classifiers, Logistic Regression, and Random Forest methods. This computational study allows us to make recommendations of how to design more efficient antibacterial drug therapies.
有几类短肽分子,称为抗菌肽(AMPs),它们是在生物体针对各种感染的免疫反应过程中产生的。近年来,在应用机器学习方法预测抗菌肽对细菌的活性方面取得了重大进展。然而,在大多数研究案例中,结果并非针对特定细菌,因为未考虑细菌的特定特征,如膜的化学成分和结构。为了克服这个问题,我们开发了一种新的计算方法,使我们能够使用与靶向细菌的肽相关的特定数据集来训练几个有监督的机器学习模型。在1500多个物理化学描述符中,利用套索回归和支持向量机技术选择可用于将肽分类为抗菌肽或无抗菌活性的最重要特征。然后,我们使用支持向量分类器、逻辑回归和随机森林方法对抗菌肽的活性与非活性进行分类。这项计算研究使我们能够就如何设计更有效的抗菌药物疗法提出建议。