CmaxDMPK, LLC, Framingham , Massachusetts 01701, United States.
Alkermes Inc., 852 Winter Street, Waltham, Massachusetts 02451, United States.
Mol Pharm. 2024 Mar 4;21(3):1192-1203. doi: 10.1021/acs.molpharmaceut.3c00812. Epub 2024 Jan 29.
Predicting human clearance with high accuracy from in silico-derived parameters alone is highly desirable, as it is fast, saves in vitro resources, and is animal-sparing. We derived random forest (RF) models from 1340 compounds with human intravenous pharmacokinetic (PK) data, the largest data set publicly available today. To assess the general applicability of the RF models, we systematically removed structural-therapeutic class analogues and other compounds with structural similarity from the training sets. For a quasi-prospective test set of 343 compounds, we show that RF models devoid of structurally similar compounds in the training set predict human clearance with a geometric mean fold error (GMFE) of 3.3. While the observed GMFE illustrates how difficult it is to generate a useful model that is broadly applicable, we posit that our RF models yield a more realistic assessment of how well human clearance can be predicted prospectively. We deployed the conformal prediction formalism to assess the model applicability and to determine the prediction confidence intervals for each prediction. We observed that clearance can be predicted better for renally cleared compounds than for other clearance mechanisms. We show that applying a classification model for predicting renal clearance identifies a subset of compounds for which clearance can be predicted with higher accuracy, yielding a GMFE of 2.3. In addition, our in silico RF human clearance models compared well to models derived from scaling human hepatocytes or preclinical in vivo data.
从计算得出的参数中准确预测人体清除率是非常理想的,因为它速度快、节省了体外资源且减少了动物使用。我们从目前公开的最大数据集(包含 1340 种具有人体静脉药代动力学(PK)数据的化合物)中推导了随机森林(RF)模型。为了评估 RF 模型的普遍适用性,我们系统地从训练集中去除了结构治疗类类似物和其他结构相似的化合物。对于一个由 343 种化合物组成的准前瞻性测试集,我们表明,在训练集中没有结构相似化合物的 RF 模型预测人体清除率的几何均数折叠误差(GMFE)为 3.3。虽然观察到的 GMFE 说明了生成一个广泛适用的有用模型有多么困难,但我们认为我们的 RF 模型可以更真实地评估前瞻性预测人体清除率的能力。我们采用了一致性预测形式来评估模型适用性,并确定每个预测的预测置信区间。我们观察到,对于经肾脏清除的化合物,清除率可以比其他清除机制更好地预测。我们表明,应用用于预测肾脏清除率的分类模型可以识别出一组可以更准确预测清除率的化合物,得到的 GMFE 为 2.3。此外,我们的计算 RF 人体清除率模型与从缩放人体肝细胞或临床前体内数据中得出的模型相比表现良好。