Wang Disha, Liu Wenjun, Shen Zihao, Jiang Lei, Wang Jie, Li Shiliang, Li Honglin
Shanghai Key Laboratory of New Drug Design, State Key Laboratory of Bioreactor Engineering, School of Pharmacy, East China University of Science and Technology, Shanghai, China.
Research and Development Department, Jiangzhong Pharmaceutical Co., Ltd., Nanchang, China.
Front Pharmacol. 2020 Jan 30;10:1586. doi: 10.3389/fphar.2019.01586. eCollection 2019.
Drug metabolism research plays a key role in the discovery and development of drugs. Based on the discovery of drug metabolites, new chemical entities can be identified and potential safety hazards caused by reactive or toxic metabolites can be minimized. Nowadays, computational methods are usually complementary tools for experiments. However, current metabolites prediction methods tend to have high false positive rates with low accuracy and are usually only used for specific enzyme systems. In order to overcome this difficulty, a method was developed in this paper by first establishing a database with broad coverage of SMARTS-coded metabolic reaction rule, and then extracting the molecular fingerprints of compounds to construct a classification model based on deep learning algorithms. The metabolic reaction rule database we built can supplement chemically reasonable negative reaction examples. Based on deep learning algorithms, the model could determine which reaction types are more likely to occur than the others. In the test set, our method can achieve the accuracy of 70% (Top-10), which is significantly higher than that of random guess and the rule-based method SyGMa. The results demonstrated that our method has a certain predictive ability and application value.
药物代谢研究在药物的发现和开发中起着关键作用。基于药物代谢物的发现,可以识别新的化学实体,并将由反应性或有毒代谢物引起的潜在安全危害降至最低。如今,计算方法通常是实验的补充工具。然而,当前的代谢物预测方法往往具有较高的假阳性率,准确性较低,并且通常仅用于特定的酶系统。为了克服这一困难,本文开发了一种方法,首先建立一个覆盖范围广泛的SMARTS编码代谢反应规则数据库,然后提取化合物的分子指纹,以构建基于深度学习算法的分类模型。我们建立的代谢反应规则数据库可以补充化学上合理的负反应示例。基于深度学习算法,该模型可以确定哪些反应类型比其他反应类型更有可能发生。在测试集中,我们的方法可以达到70%(前10)的准确率,这明显高于随机猜测和基于规则的方法SyGMa。结果表明,我们的方法具有一定的预测能力和应用价值。