Ranapurwala Shabbar I, Cavanaugh Joseph E, Young Tracy, Wu Hongqian, Peek-Asa Corinne, Ramirez Marizen R
1Injury Prevention Research and Department of Epidemiology, University of North Carolina at Chapel Hill, 137 E Franklin St, Suite 500, CB# 7505, Chapel Hill, NC 27599 USA.
2Department of Epidemiology, College of Public Health, University of Iowa, Iowa City, IA USA.
Inj Epidemiol. 2019 Jun 17;6:31. doi: 10.1186/s40621-019-0208-9. eCollection 2019.
The goal of predictive modelling is to identify the likelihood of future events, such as the predictive modelling used in climate science to forecast weather patterns and significant weather occurrences. In public health, increasingly sophisticated predictive models are used to predict health events in patients and to screen high risk individuals, such as for cardiovascular disease and breast cancer. Although causal modelling is frequently used in epidemiology to identify risk factors, predictive modelling provides highly useful information for individual risk prediction and for informing courses of treatment. Such predictive knowledge is often of great utility to physicians, counsellors, health education specialists, policymakers or other professionals, who may then advice course correction or interventions to prevent adverse health outcomes from occurring. In this manuscript, we use an example dataset that documents farm vehicle crashes and conventional statistical methods to forecast the risk of an injury or death in a farm vehicle crash for a specific individual or a scenario.
Using data from 7094 farm crashes that occurred between 2005 and 2010 in nine mid-western states, we demonstrate and discuss predictive model fitting approaches, model validation techniques using external datasets, and the calculation and interpretation of predicted probabilities. We then developed two automated risk prediction tools using readily available software packages. We discuss best practices and common limitations associated with predictive models built from observational datasets.
Predictive analysis offers tools that could aid the decision making of policymakers, physicians, and environmental health practitioners to improve public health.
预测建模的目标是识别未来事件的可能性,例如气候科学中用于预测天气模式和重大天气事件的预测建模。在公共卫生领域,越来越复杂的预测模型被用于预测患者的健康事件并筛查高危个体,如心血管疾病和乳腺癌。虽然因果建模在流行病学中经常用于识别风险因素,但预测建模为个体风险预测和治疗方案提供了非常有用的信息。这种预测性知识通常对医生、顾问、健康教育专家、政策制定者或其他专业人员非常有用,他们随后可以建议进行纠正或干预,以防止不良健康结果的发生。在本手稿中,我们使用一个记录农用车辆碰撞的示例数据集和传统统计方法,来预测特定个体或场景下农用车辆碰撞中受伤或死亡的风险。
利用2005年至2010年期间在美国中西部九个州发生的7094起农用车辆碰撞事故的数据,我们展示并讨论了预测模型拟合方法、使用外部数据集的模型验证技术,以及预测概率的计算和解释。然后,我们使用现成的软件包开发了两种自动风险预测工具。我们讨论了与基于观测数据集构建的预测模型相关的最佳实践和常见局限性。
预测分析提供了有助于政策制定者、医生和环境卫生从业者进行决策以改善公共卫生的工具。