Optibrium Ltd., F5-6 Blenheim House, Cambridge Innovation Park, Denny End Road, Cambridge, CB25 9PB, UK.
The European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge, CB10 1SD, UK.
J Comput Aided Mol Des. 2018 Apr;32(4):537-546. doi: 10.1007/s10822-018-0107-0. Epub 2018 Feb 20.
In the development of novel pharmaceuticals, the knowledge of how many, and which, Cytochrome P450 isoforms are involved in the phase I metabolism of a compound is important. Potential problems can arise if a compound is metabolised predominantly by a single isoform in terms of drug-drug interactions or genetic polymorphisms that would lead to variations in exposure in the general population. Combined with models of regioselectivities of metabolism by each isoform, such a model would also aid in the prediction of the metabolites likely to be formed by P450-mediated metabolism. We describe the generation of a multi-class random forest model to predict which, out of a list of the seven leading Cytochrome P450 isoforms, would be the major metabolising isoforms for a novel compound. The model has a 76% success rate with a top-1 criterion and an 88% success rate for a top-2 criterion and shows significant enrichment over randomised models.
在新型药物的开发中,了解有多少种细胞色素 P450 同工酶参与化合物的 I 相代谢,以及是哪一种同工酶参与,这一点很重要。如果一种化合物主要由单一同工酶代谢,那么在药物相互作用或遗传多态性方面可能会出现潜在问题,这会导致普通人群中的暴露量发生变化。如果将该模型与每种同工酶的代谢区域选择性模型相结合,也有助于预测由 P450 介导的代谢可能形成的代谢物。我们描述了一种多类随机森林模型的生成方法,该模型可以预测在一组七种主要细胞色素 P450 同工酶中,哪种同工酶将是新型化合物的主要代谢同工酶。该模型的成功率为 76%,采用第一标准;成功率为 88%,采用第二标准。与随机模型相比,该模型具有显著的富集性。