Violence Prevention Research Program, Department of Emergency Medicine, University of California, Davis, USA.
California Firearm Violence Research Center, Davis, USA.
J Urban Health. 2024 Oct;101(5):955-967. doi: 10.1007/s11524-024-00909-0. Epub 2024 Sep 5.
Gun-related crime continues to be an urgent public health and safety problem in cities across the US. A key question is: how are firearms diverted from the legal retail market into the hands of gun offenders? With close to 8 million legal firearm transaction records in California (2010-2020) linked to over 380,000 records of recovered crime guns (2010-2021), we employ supervised machine learning to predict which firearms are used in crimes shortly after purchase. Specifically, using random forest (RF) with stratified under-sampling, we predict any crime gun recovery within a year (0.2% of transactions) and violent crime gun recovery within a year (0.03% of transactions). We also identify the purchaser, firearm, and dealer characteristics most predictive of this short time-to-crime gun recovery using SHapley Additive exPlanations and mean decrease in accuracy variable importance measures. Overall, our models show good discrimination, and we are able to identify firearms at extreme risk for diversion into criminal hands. The test set AUC is 0.85 for both models. For the model predicting any recovery, a default threshold of 0.50 results in a sensitivity of 0.63 and a specificity of 0.88. Among transactions identified as extremely risky, e.g., transactions with a score of 0.98 and above, 74% (35/47 in the test data) are recovered within a year. The most important predictive features include purchaser age and caliber size. This study suggests the potential utility of transaction records combined with machine learning to identify firearms at the highest risk for diversion and criminal use soon after purchase.
枪支相关犯罪在美国各大城市仍是一个紧迫的公共卫生和安全问题。一个关键问题是:枪支是如何从合法的零售市场流入枪支罪犯手中的?在加利福尼亚州(2010-2020 年),有近 800 万份合法枪支交易记录与 38 万多份追回的犯罪枪支记录(2010-2021 年)相关联,我们利用监督机器学习来预测哪些枪支在购买后不久就会用于犯罪。具体来说,我们使用随机森林(RF)进行分层欠采样,来预测一年内任何犯罪枪支回收(交易的 0.2%)和一年内暴力犯罪枪支回收(交易的 0.03%)。我们还使用 SHapley Additive exPlanations 和平均减少准确性变量重要性度量来识别最能预测这种短时间内犯罪枪支回收的购买者、枪支和经销商特征。总的来说,我们的模型显示出良好的区分能力,并且能够识别出极有可能流入犯罪分子手中的枪支。两个模型的测试集 AUC 均为 0.85。对于预测任何回收的模型,默认阈值为 0.50,灵敏度为 0.63,特异性为 0.88。在被确定为风险极高的交易中,例如,得分在 0.98 及以上的交易,74%(测试数据中的 35/47)在一年内被追回。最重要的预测特征包括购买者年龄和口径大小。本研究表明,交易记录与机器学习相结合,具有识别购买后不久最有可能被转移和用于犯罪的枪支的潜力。