Olofsen Erik, Dahan Albert
Department of Anesthesiology, Leiden University Medical Center, PO Box 9600, 2300 RC Leiden, Netherlands.
F1000Res. 2013 Mar 4;2:71. doi: 10.12688/f1000research.2-71.v2. eCollection 2013.
Akaike's information theoretic criterion for model discrimination (AIC) is often stated to "overfit", i.e., it selects models with a higher dimension than the dimension of the model that generated the data. However, with experimental pharmacokinetic data it may not be possible to identify the correct model, because of the complexity of the processes governing drug disposition. Instead of trying to find the correct model, a more useful objective might be to minimize the prediction error of drug concentrations in subjects with unknown disposition characteristics. In that case, the AIC might be the selection criterion of choice. We performed Monte Carlo simulations using a model of pharmacokinetic data (a power function of time) with the property that fits with common multi-exponential models can never be perfect - thus resembling the situation with real data. Prespecified models were fitted to simulated data sets, and AIC and AIC c (the criterion with a correction for small sample sizes) values were calculated and averaged. The average predictive performances of the models, quantified using simulated validation sets, were compared to the means of the AICs. The data for fits and validation consisted of 11 concentration measurements each obtained in 5 individuals, with three degrees of interindividual variability in the pharmacokinetic volume of distribution. Mean AIC c corresponded very well, and better than mean AIC, with mean predictive performance. With increasing interindividual variability, there was a trend towards larger optimal models, but with respect to both lowest AIC c and best predictive performance. Furthermore, it was observed that the mean square prediction error itself became less suitable as a validation criterion, and that a predictive performance measure should incorporate interindividual variability. This simulation study showed that, at least in a relatively simple mixed effects modelling context with a set of prespecified models, minimal mean AIC c corresponded to best predictive performance even in the presence of relatively large interindividual variability.
赤池信息准则(AIC)常用于模型判别,常被指出会“过度拟合”,即它选择的模型维度高于生成数据的模型维度。然而,对于实验性药代动力学数据,由于药物处置过程的复杂性,可能无法识别正确的模型。与其试图找到正确的模型,一个更有用的目标可能是将处置特征未知的受试者体内药物浓度的预测误差最小化。在这种情况下,AIC可能是首选的选择标准。我们使用药代动力学数据模型(时间的幂函数)进行了蒙特卡罗模拟,该模型具有与常见多指数模型的拟合永远无法完美的特性——因此类似于实际数据的情况。将预先指定的模型拟合到模拟数据集,并计算和平均AIC和AICc(针对小样本量进行校正的准则)值。使用模拟验证集量化的模型平均预测性能与AIC的均值进行比较。拟合和验证的数据包括在5名个体中每人获得的11个浓度测量值,药代分布容积存在三种个体间变异性程度。平均AICc与平均预测性能非常吻合,且比平均AIC更好。随着个体间变异性的增加,最优模型有变大的趋势,但在最低AICc和最佳预测性能方面都是如此。此外,观察到均方预测误差本身作为验证标准变得不太合适,并且预测性能度量应纳入个体间变异性。这项模拟研究表明,至少在一组预先指定模型的相对简单的混合效应建模背景下,即使存在相对较大的个体间变异性,最小平均AICc也对应最佳预测性能。