White D B, Walawander C A, Liu D Y, Grasela T H
Department of Statistics, State University of New York, Buffalo 14214.
J Pharmacokinet Biopharm. 1992 Jun;20(3):295-313. doi: 10.1007/BF01062529.
In a simulation study of inference on population pharmacokinetic parameters, two methods of performing tests of hypotheses comparing two populations using NONMEM were evaluated. These two methods are the test based upon 95% confidence intervals and the likelihood ratio test. Data were simulated according to a monoexponential model and, in that context, power curves for each test were generated for (i) the ratio of mean clearance and (ii) the ratio of the population standard deviations of clearance. To generate the power curves, a range of these parameters was employed; other pharmacokinetic parameters were selected to reflect the variability typically present in a Phase II clinical trial. For tests comparing the means, the confidence interval tests had approximately the same power as the likelihood ratio tests and were consistently more faithful to the nominal level of significance. For comparison of the standard deviations, and when the volume of information available was relatively small, however, the likelihood ratio test was more able to detect differences between the two groups. These results were then compared to results on parameter estimation in order to gain insight into the question of power. As an example, the nonnormality of estimates of the ratio of standard deviations plays an important role in explaining the low power for the confidence interval tests. We conclude that, except for the situation of modeling standard deviations with only sparse information, NONMEM produces tests of significance that are effective at detecting clinically significant differences between two populations.
在一项关于群体药代动力学参数推断的模拟研究中,评估了使用NONMEM对两个群体进行假设检验的两种方法。这两种方法是基于95%置信区间的检验和似然比检验。数据根据单指数模型进行模拟,在此背景下,针对(i)平均清除率之比和(ii)清除率的群体标准差之比,为每种检验生成了功效曲线。为了生成功效曲线,采用了这些参数的一系列取值;选择其他药代动力学参数以反映II期临床试验中通常存在的变异性。对于比较均值的检验,置信区间检验的功效与似然比检验大致相同,并且始终更符合名义显著性水平。然而,对于标准差的比较,以及当可用信息量相对较小时,似然比检验更能检测出两组之间的差异。然后将这些结果与参数估计结果进行比较,以便深入了解功效问题。例如,标准差之比估计值的非正态性在解释置信区间检验功效较低方面起着重要作用。我们得出结论,除了仅用稀疏信息对标准差进行建模的情况外,NONMEM产生的显著性检验对于检测两个群体之间具有临床意义的差异是有效的。