School of Population and Public Health, University of British Columbia, Vancouver, BC, Canada.
BMC Public Health. 2010 Nov 18;10:710. doi: 10.1186/1471-2458-10-710.
Computer simulation models are used increasingly to support public health research and policy, but questions about their quality persist. The purpose of this article is to review the principles and methods for validation of population-based disease simulation models.
We developed a comprehensive framework for validating population-based chronic disease simulation models and used this framework in a review of published model validation guidelines. Based on the review, we formulated a set of recommendations for gathering evidence of model credibility.
Evidence of model credibility derives from examining: 1) the process of model development, 2) the performance of a model, and 3) the quality of decisions based on the model. Many important issues in model validation are insufficiently addressed by current guidelines. These issues include a detailed evaluation of different data sources, graphical representation of models, computer programming, model calibration, between-model comparisons, sensitivity analysis, and predictive validity. The role of external data in model validation depends on the purpose of the model (e.g., decision analysis versus prediction). More research is needed on the methods of comparing the quality of decisions based on different models.
As the role of simulation modeling in population health is increasing and models are becoming more complex, there is a need for further improvements in model validation methodology and common standards for evaluating model credibility.
计算机模拟模型越来越多地被用于支持公共卫生研究和政策,但关于其质量的问题仍然存在。本文的目的是回顾基于人群的疾病模拟模型验证的原则和方法。
我们开发了一个综合框架来验证基于人群的慢性疾病模拟模型,并在对已发表的模型验证指南的综述中使用了该框架。基于综述,我们制定了一系列用于收集模型可信度证据的建议。
模型可信度的证据来源于检查:1)模型开发过程,2)模型性能,和 3)基于模型的决策质量。当前指南中许多重要的模型验证问题没有得到充分解决。这些问题包括对不同数据源的详细评估、模型的图形表示、计算机编程、模型校准、模型间比较、敏感性分析和预测有效性。外部数据在模型验证中的作用取决于模型的目的(例如,决策分析与预测)。需要进一步研究基于不同模型的决策质量比较方法。
随着模拟建模在人群健康中的作用不断增加且模型变得越来越复杂,需要进一步改进模型验证方法和评估模型可信度的通用标准。