State Key Laboratory of Chemical Resource Engineering, Department of Pharmaceutical Engineering, Beijing University of Chemical Technology, P.O. Box 53, 15 BeiSanHuan East Road, Beijing, 100029, People's Republic of China.
Mol Divers. 2017 Aug;21(3):661-675. doi: 10.1007/s11030-017-9743-x. Epub 2017 May 8.
Human microsomal prostaglandin [Formula: see text] synthase (mPGES)-1 is a promising drug target for inflammation and other diseases with inflammatory symptoms. In this work, we built classification models which were able to classify mPGES-1 inhibitors into two groups: highly active inhibitors and weakly active inhibitors. A dataset of 1910 mPGES-1 inhibitors was separated into a training set and a test set by two methods, by a Kohonen's self-organizing map or by random selection. The molecules were represented by different types of fingerprint descriptors including MACCS keys (MACCS), CDK fingerprints, Estate fingerprints, PubChem fingerprints, substructure fingerprints and 2D atom pairs fingerprint. First, we used a support vector machine (SVM) to build twelve models with six types of fingerprints and found that MACCS had some advantage over the other fingerprints in modeling. Next, we used naïve Bayes (NB), random forest (RF) and multilayer perceptron (MLP) methods to build six models with MACCS only and found that models using RF and MLP methods were better than NB. Finally, all the models with MACCS keys were used to make predictions on an external test set of 41 compounds. In summary, the models built with MACCS keys and using SVM, RF and MLP methods show good prediction performance on the test sets and the external test set. Furthermore, we made a structure-activity relationship analysis between mPGES-1 and its inhibitors based on the information gain of fingerprints and could pinpoint some key functional groups for mPGES-1 activity. It was found that highly active inhibitors usually contained an amide group, an aromatic ring or a nitrogen heterocyclic ring, and several heteroatoms substituents such as fluorine and chlorine. The carboxyl group and sulfur atom groups mainly appeared in weakly active inhibitors.
人微粒体前列腺素[公式:见正文]合酶(mPGES-1)是炎症和其他具有炎症症状的疾病的有希望的药物靶标。在这项工作中,我们构建了分类模型,能够将 mPGES-1 抑制剂分为两组:高活性抑制剂和低活性抑制剂。通过两种方法,即 Kohonen 自组织映射或随机选择,将 1910 个 mPGES-1 抑制剂数据集分为训练集和测试集。分子由不同类型的指纹描述符表示,包括 MACCS 键(MACCS)、CDK 指纹、Estate 指纹、PubChem 指纹、子结构指纹和 2D 原子对指纹。首先,我们使用支持向量机(SVM)构建了十二个模型,使用了六种指纹,并发现 MACCS 在建模方面比其他指纹具有一些优势。接下来,我们使用朴素贝叶斯(NB)、随机森林(RF)和多层感知器(MLP)方法构建了仅使用 MACCS 的六个模型,并发现使用 RF 和 MLP 方法的模型优于 NB。最后,使用所有带有 MACCS 键的模型对 41 个化合物的外部测试集进行预测。综上所述,使用 MACCS 键和 SVM、RF 和 MLP 方法构建的模型在测试集和外部测试集上均表现出良好的预测性能。此外,我们还基于指纹的信息增益对 mPGES-1 与其抑制剂之间的构效关系进行了分析,可以确定 mPGES-1 活性的一些关键功能基团。结果发现,高活性抑制剂通常含有酰胺基、芳环或氮杂环,以及几个杂原子取代基,如氟和氯。羧基和硫原子基团主要出现在低活性抑制剂中。