Novartis Institute of Biomedical Research , Emeryville , California 94608 , United States.
Simulations Plus, Inc ., 42505 10th Street West , Lancaster , California 93534 , United States.
Mol Pharm. 2018 Mar 5;15(3):831-839. doi: 10.1021/acs.molpharmaceut.7b00973. Epub 2018 Feb 1.
When medicinal chemists need to improve oral bioavailability (%F) during lead optimization, they systematically modify compound properties mainly based on their own experience and general rules of thumb. However, at least a dozen properties can influence %F, and the difficulty of multiparameter optimization for such complex nonlinear processes grows combinatorially with the number of variables. Furthermore, strategies can be in conflict. For example, adding a polar or charged group will generally increase solubility but decrease permeability. Identifying the 2 or 3 properties that most influence %F for a given compound series would make %F optimization much more efficient. We previously reported an adaptation of physiologically based pharmacokinetic (PBPK) simulations to predict %F for lead series from purely computational inputs within a 2-fold average error. Here, we run thousands of such simulations to generate a comprehensive "bioavailability landscape" for each series. A key innovation was recognition that the large and variable number of p K's in drug molecules could be replaced by just the two straddling the isoelectric point. Another was use of the ZINC database to cull out chemically inaccessible regions of property space. A quadratic partial least squares regression (PLS) accurately fits a continuous surface to these thousands of bioavailability predictions. The PLS coefficients indicate the globally sensitive compound properties. The PLS surface also displays the %F landscape in these sensitive properties locally around compounds of particular interest. Finally, being quick to calculate, the PLS equation can be combined with models for activity and other properties for multiobjective lead optimization.
当药物化学家在进行先导化合物优化时需要提高口服生物利用度(%F),他们会根据自己的经验和一般经验法则有针对性地对化合物性质进行系统性地修饰。然而,至少有十几个性质会影响%F,对于如此复杂的非线性过程,多参数优化的难度会随着变量数量的增加呈组合式增长。此外,策略之间可能会存在冲突。例如,添加极性或带电基团通常会提高溶解度,但会降低通透性。确定对给定化合物系列影响%F 的 2 或 3 个最重要的性质将使%F 优化效率更高。我们之前曾报道过一种生理相关药代动力学(PBPK)模拟的改编,该模拟可根据纯计算输入在 2 倍平均误差范围内预测先导系列的%F。在这里,我们运行了数千次此类模拟,为每个系列生成了一个全面的“生物利用度图谱”。一项关键创新是认识到药物分子中大量且多变的 pKa 可以用仅跨越等电点的两个 pKa 来代替。另一个创新是使用 ZINC 数据库来剔除性质空间中化学不可及的区域。二次偏最小二乘回归(PLS)准确地将数千个生物利用度预测拟合到一个连续的表面上。PLS 系数表示对化合物性质具有全局敏感性。PLS 表面还显示了这些敏感性质中特定化合物周围的%F 图谱。最后,由于计算速度快,PLS 方程可以与活性和其他性质模型结合,用于多目标先导化合物优化。