Ehmki Emanuel S R, Kramer Christian
Chemical Biology/Therapeutic Modalities, F. Hoffmann-La Roche Ltd. , Roche Innovation Center Basel, Grenzacherstrasse 124, 4070 Basel, Switzerland.
J Chem Inf Model. 2017 May 22;57(5):1187-1196. doi: 10.1021/acs.jcim.6b00709. Epub 2017 May 1.
Suggesting novel compounds to be made on the basis of similarity to a previously seen structure-activity relationship (SAR) requires a measure for SAR similarity. While SAR similarity has intuitively been used by medicinal chemists for decades, no systematic comparison of candidate similarity metrics has been published to date. With this publication, we attempt to close that gap by providing a statistical framework that allows comparison of SAR similarity metrics by their ability to rank series that provide the best activity prediction of novel substituents. This prediction is a result of a two-step process that involves (a) judging the similarity between series and (b) transferring the SAR from one series to the other. We tested several SAR similarity metrics and found that a centered RMSD (cRMSD) in combination with a lineaar-regression-based prediction interpolation ranks SAR profiles best. Based on that ranking we can, with a given confidence, suggest novel substituents to be tested. The superiority of the cRMSD can be explained on the basis of experimental uncertainty of affinity data and measured affinity differences. The ability to measure SAR similarity is central to applications like matched molecular series (MMS) analysis, whose applicability depends on whether there is a potential for SAR transferability between series. With the new SAR similarity metric introduced here, we show how MMS can be used in a medicinal chemistry setting as an idea generator and a semiquantitative prediction tool.
基于与先前观察到的构效关系(SAR)的相似性来建议合成新型化合物,需要一种衡量SAR相似性的方法。尽管几十年来药物化学家一直在直观地使用SAR相似性,但迄今为止尚未发表对候选相似性度量的系统比较。通过本论文,我们试图通过提供一个统计框架来填补这一空白,该框架允许根据SAR相似性度量对提供新型取代基最佳活性预测的系列进行排序的能力来进行比较。这种预测是一个两步过程的结果,该过程包括(a)判断系列之间的相似性,以及(b)将SAR从一个系列转移到另一个系列。我们测试了几种SAR相似性度量,发现基于线性回归预测插值的中心均方根偏差(cRMSD)对SAR图谱的排序最佳。基于该排序,我们可以在给定的置信度下建议测试新型取代基。cRMSD的优越性可以基于亲和力数据的实验不确定性和测量的亲和力差异来解释。测量SAR相似性的能力对于诸如匹配分子系列(MMS)分析等应用至关重要,其适用性取决于系列之间是否存在SAR转移性的可能性。通过本文引入的新的SAR相似性度量,我们展示了MMS如何在药物化学环境中用作创意生成器和半定量预测工具。