a National Leading Research Laboratory (NLRL) of Molecular Modeling & Drug Design, College of Pharmacy and Graduate School of Pharmaceutical Sciences , Ewha Womans University , Seoul , Republic of Korea.
SAR QSAR Environ Res. 2017 Oct;28(10):863-874. doi: 10.1080/1062936X.2017.1399925.
The cytochrome P450 (CYP) enzyme superfamily is involved in phase I metabolism which chemically modifies a variety of substrates via oxidative reactions to make them more water-soluble and easier to eliminate. Inhibition of these enzymes leads to undesirable effects, including toxic drug accumulations and adverse drug-drug interactions. Hence, it is necessary to develop in silico models that can predict the inhibition potential of compounds for different CYP isoforms. This study focused on five major CYP isoforms, including CYP1A2, 2C9, 2C19, 2D6 and 3A4, that are responsible for more than 90% of the metabolism of clinical drugs. The main aim of this study is to develop a multiple-category classification model (MCM) for the major CYP isoforms using a Laplacian-modified naïve Bayesian method. The dataset composed of more than 4500 compounds was collected from the PubChem Bioassay database. VolSurf+ descriptors and FCFP_8 fingerprint were used as input features to build classification models. The results demonstrated that the developed MCM using Laplacian-modified naïve Bayesian method was successful in classifying inhibitors and non-inhibitors for each CYP isoform. Moreover, the accuracy, sensitivity and specificity values for both training and test sets were above 80% and also yielded satisfactory area under the receiver operating characteristic curve and Matthews correlation coefficient values.
细胞色素 P450(CYP)酶超家族参与 I 相代谢,通过氧化反应对各种底物进行化学修饰,使其更具水溶性,更容易消除。这些酶的抑制会导致不良影响,包括有毒药物的积累和药物相互作用。因此,有必要开发能够预测化合物对不同 CYP 同工酶抑制潜力的计算模型。本研究集中在五个主要的 CYP 同工酶上,包括 CYP1A2、2C9、2C19、2D6 和 3A4,它们负责超过 90%的临床药物代谢。本研究的主要目的是使用拉普拉斯修正朴素贝叶斯方法为主要 CYP 同工酶开发多类别分类模型(MCM)。该数据集由来自 PubChem 生物测定数据库的超过 4500 种化合物组成。VolSurf+描述符和 FCFP_8 指纹用作构建分类模型的输入特征。结果表明,使用拉普拉斯修正朴素贝叶斯方法开发的 MCM 成功地对每个 CYP 同工酶的抑制剂和非抑制剂进行了分类。此外,训练集和测试集的准确性、敏感性和特异性值均高于 80%,并且还产生了令人满意的接收者操作特征曲线和马修斯相关系数值。