Computational Toxicology, Global Safety Assessment, AstraZeneca R&D, Pepparedsleden 1, 43153 Mölndal, Sweden.
J Chem Inf Model. 2013 Aug 26;53(8):2001-17. doi: 10.1021/ci400281y. Epub 2013 Jul 25.
State-of-the-art quantitative structure-activity relationship (QSAR) models are often based on nonlinear machine learning algorithms, which are difficult to interpret. From a pharmaceutical perspective, QSARs are used to enhance the chemical design process. Ultimately, they should not only provide a prediction but also contribute to a mechanistic understanding and guide modifications to the chemical structure, promoting compounds with desirable biological activity profiles. Global ranking of descriptor importance and inverse QSAR have been used for these purposes. This paper introduces localized heuristic inverse QSAR, which provides an assessment of the relative ability of the descriptors to influence the biological response in an area localized around the predicted compound. The method is based on numerical gradients with parameters optimized using data sets sampled from analytical functions. The heuristic character of the method reduces the computational requirements and makes it applicable not only to fragment based methods but also to QSARs based on bulk descriptors. The application of the method is illustrated on congeneric QSAR data sets, and it is shown that the predicted influential descriptors can be used to guide structural modifications that affect the biological response in the desired direction. The method is implemented into the AZOrange Open Source QSAR package. The current implementation of localized heuristic inverse QSAR is a step toward a generally applicable method for elucidating the structure activity relationship specifically for a congeneric region of chemical space when using QSARs based on bulk properties. Consequently, this method could contribute to accelerating the chemical design process in pharmaceutical projects, as well as provide information that could enhance the mechanistic understanding for individual scaffolds.
最先进的定量构效关系 (QSAR) 模型通常基于非线性机器学习算法,这些算法难以解释。从制药的角度来看,QSAR 用于增强化学设计过程。最终,它们不仅应该提供预测,还应该有助于对机制的理解,并指导对化学结构的修改,促进具有理想生物学活性特征的化合物。全局描述符重要性排名和逆 QSAR 已用于这些目的。本文介绍了局部启发式逆 QSAR,它提供了对描述符在预测化合物周围局部区域影响生物响应的相对能力的评估。该方法基于数值梯度,参数使用从分析函数采样的数据集进行优化。该方法的启发式特性降低了计算要求,使其不仅适用于基于片段的方法,也适用于基于整体描述符的 QSAR。该方法在同类 QSAR 数据集上的应用表明,可以使用预测的有影响的描述符来指导结构修改,从而朝着所需的方向影响生物响应。该方法已被实施到 AZOrange 开源 QSAR 软件包中。局部启发式逆 QSAR 的当前实现是朝着在使用基于整体性质的 QSAR 时阐明特定同类化学空间区域的结构活性关系的通用方法迈出的一步。因此,该方法可以为药物项目中的化学设计过程加速提供帮助,并为个别支架提供增强机制理解的信息。