Sahlin U, Jeliazkova N, Öberg T
Centre of Environmental and Climate Research, Lund University, Lund, Sweden phonex:+46 46 222 6831.
Department of Biology and Environmental Science, Linnaeus University, Kalmar, Sweden.
Mol Inform. 2014 Jan;33(1):26-35. doi: 10.1002/minf.201200131. Epub 2013 Oct 7.
Predictive models used in decision making, such as QSARs in chemical regulation or drug discovery, call for evaluated approaches to quantitatively assess associated uncertainty in predictions. Uncertainty in less reliable predictions may be captured by locally varying predictive errors. In the current study, model-based bootstrapping was combined with analogy reasoning to generate predictive distributions varying in magnitude over a model's domain of applicability. A resampling experiment based on PLS regressions on four QSAR data sets demonstrated that predictive errors assessed by k nearest neighbour or weighted PRedicted Error Sum of Squares (PRESS) on samples of external test data or by internal cross-validation improved the performance of the uncertainty assessment. Analogy using similarity defined by Euclidean distances, or differences in standard deviation in perturbed predictions, resulted in better performances than similarity defined by distance to, or density of, the training data. Locally assessed predictive distributions had on average at least as good coverage as Gaussian distribution with variance assessed from the PRESS. An R-code is provided that evaluates performances of the suggested algorithms to assess predictive error based on log likelihood scores and empirical coverage graphs, and which applies these to derive confidence intervals or samples from the predictive distributions of query compounds.
用于决策的预测模型,如化学监管或药物研发中的定量构效关系(QSARs),需要经过评估的方法来定量评估预测中相关的不确定性。可靠性较低的预测中的不确定性可以通过局部变化的预测误差来捕捉。在当前研究中,基于模型的自助法与类比推理相结合,以生成在模型适用范围内大小不同的预测分布。基于四个QSAR数据集的偏最小二乘回归的重采样实验表明,通过外部测试数据样本上的k近邻或加权预测误差平方和(PRESS)或通过内部交叉验证评估的预测误差提高了不确定性评估的性能。使用欧几里得距离定义的相似性或扰动预测中的标准差差异进行类比,其性能优于通过与训练数据的距离或密度定义的相似性。局部评估的预测分布平均至少具有与基于PRESS评估方差的高斯分布一样好的覆盖率。提供了一个R代码,用于评估基于对数似然分数和经验覆盖率图评估预测误差的建议算法的性能,并将其应用于从查询化合物的预测分布中推导置信区间或样本。