Ahlberg Ernst, Amberg Alexander, Beilke Lisa D, Bower David, Cross Kevin P, Custer Laura, Ford Kevin A, Van Gompel Jacky, Harvey James, Honma Masamitsu, Jolly Robert, Joossens Elisabeth, Kemper Raymond A, Kenyon Michelle, Kruhlak Naomi, Kuhnke Lara, Leavitt Penny, Naven Russell, Neilan Claire, Quigley Donald P, Shuey Dana, Spirkl Hans-Peter, Stavitskaya Lidiya, Teasdale Andrew, White Angela, Wichard Joerg, Zwickl Craig, Myatt Glenn J
AstraZeneca, Mölndal, Sweden.
Sanofi-Aventis Deutschland GmbH, Frankfurt, Germany.
Regul Toxicol Pharmacol. 2016 Jun;77:1-12. doi: 10.1016/j.yrtph.2016.02.003. Epub 2016 Feb 13.
Statistical-based and expert rule-based models built using public domain mutagenicity knowledge and data are routinely used for computational (Q)SAR assessments of pharmaceutical impurities in line with the approach recommended in the ICH M7 guideline. Knowledge from proprietary corporate mutagenicity databases could be used to increase the predictive performance for selected chemical classes as well as expand the applicability domain of these (Q)SAR models. This paper outlines a mechanism for sharing knowledge without the release of proprietary data. Primary aromatic amine mutagenicity was selected as a case study because this chemical class is often encountered in pharmaceutical impurity analysis and mutagenicity of aromatic amines is currently difficult to predict. As part of this analysis, a series of aromatic amine substructures were defined and the number of mutagenic and non-mutagenic examples for each chemical substructure calculated across a series of public and proprietary mutagenicity databases. This information was pooled across all sources to identify structural classes that activate or deactivate aromatic amine mutagenicity. This structure activity knowledge, in combination with newly released primary aromatic amine data, was incorporated into Leadscope's expert rule-based and statistical-based (Q)SAR models where increased predictive performance was demonstrated.
利用公共领域的致突变性知识和数据构建的基于统计和基于专家规则的模型,通常用于按照ICH M7指南推荐的方法对药物杂质进行计算(定量构效关系)SAR评估。来自企业专有致突变性数据库的知识可用于提高对选定化学类别的预测性能,并扩大这些(定量构效关系)SAR模型的适用范围。本文概述了一种在不泄露专有数据的情况下共享知识的机制。选择伯芳香胺致突变性作为案例研究,因为该化学类别在药物杂质分析中经常遇到,并且目前芳香胺的致突变性难以预测。作为该分析的一部分,定义了一系列芳香胺子结构,并在一系列公共和专有致突变性数据库中计算了每个化学子结构的致突变和非致突变实例数量。汇总所有来源的这些信息,以识别激活或失活芳香胺致突变性的结构类别。这种结构活性知识与新发布的伯芳香胺数据相结合,被纳入Leadscope基于专家规则和基于统计的(定量构效关系)SAR模型中,其中展示了提高的预测性能。