Jaki Thomas, Su Ting-Li, Kim Minjung, Lee Van Horn M
Lancaster University.
Manchester University.
Commun Stat Simul Comput. 2018;47(4):1028-1038. doi: 10.1080/03610918.2017.1303726. Epub 2017 Jun 23.
Bootstrapping has been used as a diagnostic tool for validating model results for a wide array of statistical models. Here we evaluate the use of the non-parametric bootstrap for model validation in mixture models. We show that the bootstrap is problematic for validating the results of class enumeration and demonstrating the stability of parameter estimates in both finite mixture and regression mixture models. In only 44% of simulations did bootstrapping detect the correct number of classes in at least 90% of the bootstrap samples for a finite mixture model without any model violations. For regression mixture models and cases with violated model assumptions, the performance was even worse. Consequently, we cannot recommend the non-parametric bootstrap for validating mixture models. The cause of the problem is that when resampling is used influential individual observations have a high likelihood of being sampled many times. The presence of multiple replications of even moderately extreme observations is shown to lead to additional latent classes being extracted. To verify that these replications cause the problems we show that leave-k-out cross-validation where sub-samples taken without replacement does not suffer from the same problem.
自举法已被用作一种诊断工具,用于验证各种统计模型的模型结果。在此,我们评估非参数自举法在混合模型中用于模型验证的情况。我们表明,自举法在验证类枚举结果以及证明有限混合模型和回归混合模型中参数估计的稳定性方面存在问题。在没有任何模型违反的情况下,对于有限混合模型,在仅44%的模拟中,自举法在至少90%的自举样本中检测到正确的类数。对于回归混合模型以及模型假设被违反的情况,性能甚至更差。因此,我们不推荐使用非参数自举法来验证混合模型。问题的原因在于,当使用重采样时,有影响力的单个观测值很有可能被多次采样。即使是适度极端观测值的多次重复出现也会导致提取出额外的潜在类。为了验证这些重复会导致问题,我们表明不放回抽取子样本的留一法交叉验证不会出现同样的问题。