Department of Chemical System Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, 113-8656, Japan.
Mol Inform. 2019 Oct;38(10):e1900010. doi: 10.1002/minf.201900010. Epub 2019 Jun 12.
Cytochrome P450 (CYP) is an enzyme family that plays a crucial role in metabolism, mainly metabolizing xenobiotics to produce non-toxic structures, however, some metabolized products can cause hepatotoxicity. Hence, predicting the structures of CYP products is an important task in designing non-hepatotoxic drugs. Here, we have developed novel atomic descriptors to predict the sites of metabolism (SoM) in CYP substrates. We proposed descriptors that describe topological and electrostatic characteristics of CYP substrates using Gasteiger charge. The proposed descriptors were applied to CYP3A4 data analysis as a case study. As a result of the descriptor selection, we obtained a gradient boosting decision tree-based SoM classification model that used 139 existing descriptors and the proposed 45 descriptors, and the model performed well in terms of the Matthews correlation coefficient. We also developed a structure converter to predict CYP products. This converter correctly generated 51 structural formulas of experimentally observed CYP3A4 products according to a manual evaluation.
细胞色素 P450(CYP)是一种酶家族,在代谢中起着至关重要的作用,主要代谢外来物质产生无毒结构,但一些代谢产物可能会引起肝毒性。因此,预测 CYP 产物的结构是设计非肝毒性药物的重要任务。在这里,我们开发了新的原子描述符来预测 CYP 底物的代谢部位(SoM)。我们提出了使用 Gasteiger 电荷描述 CYP 底物拓扑和静电特性的描述符。将提出的描述符应用于 CYP3A4 数据分析作为案例研究。通过描述符选择,我们获得了基于梯度提升决策树的 SoM 分类模型,该模型使用了 139 个现有描述符和 45 个提出的描述符,该模型在 Matthews 相关系数方面表现良好。我们还开发了一种结构转换器来预测 CYP 产物。该转换器根据手动评估,正确生成了 51 种实验观察到的 CYP3A4 产物的结构公式。