Turner Emily, Brown Gavin, Medina-Ariza Juanjo
University of Manchester UK University of Manchester, UK.
University of Seville Seville Spain University of Seville, Spain.
Psychosoc Interv. 2022 Jul 20;31(3):145-157. doi: 10.5093/pi2022a11. eCollection 2022 Jul.
Domestic abuse victim risk assessment is crucial for providing victims with the correct level of support. However, it has been shown that the approach currently taken by most UK police forces, the Domestic Abuse, Stalking, and Honour Based Violence (DASH) risk assessment, is not identifying the most vulnerable victims. Instead, we tested several machine learning algorithms and propose a predictive model, using logistic regression with elastic net as the best performing, that incorporates information readily available in police databases, and census-area-level statistics. We used data from a large UK police force including 350,000 domestic abuse incidents. Our models made significant improvement upon the predictive capacity of DASH, both for intimate partner violence (IPV; AUC = .748) and other forms of domestic abuse (non-IPV; AUC = .763). The most influential variables in the model were of the categories criminal history and domestic abuse history, particularly time since the last incident. We show that the DASH questions contributed almost nothing to the predictive performance. We also provide an overview of model fairness performance for ethnic and socioeconomic subgroups of the data sample. Although there were disparities between ethnic and demographic subgroups, everyone benefited from the increased accuracy of model-based predictions when compared with officer risk predictions.
家庭虐待受害者风险评估对于为受害者提供适当水平的支持至关重要。然而,研究表明,英国大多数警察部队目前采用的方法,即家庭虐待、跟踪骚扰和基于荣誉的暴力行为(DASH)风险评估,未能识别出最脆弱的受害者。相反,我们测试了几种机器学习算法,并提出了一个预测模型,该模型使用弹性网络逻辑回归作为表现最佳的模型,纳入了警方数据库中 readily available 的信息以及人口普查区域层面的统计数据。我们使用了来自英国一支大型警察部队的数据,其中包括35万起家庭虐待事件。我们的模型在亲密伴侣暴力(IPV;AUC = 0.748)和其他形式的家庭虐待(非IPV;AUC = 0.763)的预测能力方面比DASH有了显著提高。模型中最具影响力的变量属于犯罪历史和家庭虐待历史类别,特别是自上一次事件以来的时间。我们表明,DASH问题对预测性能几乎没有贡献。我们还概述了数据样本中种族和社会经济亚组的模型公平性表现。尽管种族和人口亚组之间存在差异,但与警官风险预测相比,每个人都从基于模型的预测准确性提高中受益。