一种利用全国罪犯数据库进行实时累犯预测的公平性量表。

A fairness scale for real-time recidivism forecasts using a national database of convicted offenders.

作者信息

Verrey Jacob, Neyroud Peter, Sherman Lawrence, Ariel Barak

机构信息

Institute of Criminology, University of Cambridge, Sidgwick Ave, Cambridge, CB3 9DA UK.

Benchmark Cambridge Ltd., Rectory Lane, Somersham, PE28 3EL UK.

出版信息

Neural Comput Appl. 2025;37(26):21607-21657. doi: 10.1007/s00521-025-11478-x. Epub 2025 Aug 1.

DOI:10.1007/s00521-025-11478-x

PMID:40904695

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12401775/

Abstract

UNLABELLED

This investigation explores whether machine learning can predict recidivism while addressing societal biases. To investigate this, we obtained conviction data from the UK's Police National Computer (PNC) on 346,685 records between January 1, 2000, and February 3, 2006 (His Majesty's Inspectorate of Constabulary in Use of the Police National Computer: An inspection of the ACRO Criminal Records Office. His Majesty's Inspectorate of Constabulary, Birmingham, https://assets-hmicfrs.justiceinspectorates.gov.uk/uploads/police-national-computer-use-acro-criminal-records-office.pdf, 2017). We generate twelve machine learning models-six to forecast general recidivism, and six to forecast violent recidivism-over a 3-year period, evaluated via fivefold cross-validation. Our best-performing models outperform the existing state-of-the-arts, receiving an area under curve (AUC) score of 0.8660 and 0.8375 for general and violent recidivism, respectively. Next, we construct a fairness scale that communicates the semantic and technical trade-offs associated with debiasing a criminal justice forecasting model. We use this scale to debias our best-performing models. Results indicate both models can achieve all five fairness definitions because the metrics measuring these definitions-the statistical range of recall, precision, positive rate, and error balance between demographics-indicate that these scores are within a one percentage point difference of each other. Deployment recommendations and implications are discussed. These include recommended safeguards against false positives, an explication of how these models addressed societal biases, and a case study illustrating how these models can improve existing criminal justice practices. That is, these models may help police identify fewer people in a way less impacted by structural bias while still reducing crime. A randomized control trial is proposed to test this illustrated case study, and further directions explored.

SUPPLEMENTARY INFORMATION

The online version contains supplementary material available at 10.1007/s00521-025-11478-x.

摘要

未标注

本研究探讨机器学习能否在解决社会偏见的同时预测累犯情况。为了对此进行调查，我们从英国警察国家计算机（PNC）获取了2000年1月1日至2006年2月3日期间346,685条记录的定罪数据（陛下警察监察局对警察国家计算机使用情况的检查：对ACRO刑事记录办公室的检查。陛下警察监察局，伯明翰，https://assets-hmicfrs.justiceinspectorates.gov.uk/uploads/police-national-computer-use-acro-criminal-records-office.pdf，2017）。我们生成了12个机器学习模型——6个用于预测一般累犯，6个用于预测暴力累犯——在3年时间内通过五折交叉验证进行评估。我们表现最佳的模型优于现有最先进的模型，一般累犯和暴力累犯的曲线下面积（AUC）得分分别为0.8660和0.8375。接下来，我们构建了一个公平性量表，该量表传达了与消除刑事司法预测模型偏差相关的语义和技术权衡。我们使用这个量表对表现最佳的模型进行去偏。结果表明，两个模型都能实现所有五个公平性定义，因为衡量这些定义的指标——召回率、精确率、阳性率的统计范围以及不同人口统计特征之间的误差平衡——表明这些分数彼此相差在一个百分点以内。讨论了部署建议和影响。这些建议包括针对误报的推荐保障措施、对这些模型如何解决社会偏见的解释，以及一个说明这些模型如何改进现有刑事司法实践情况的案例研究。也就是说，这些模型可能有助于警方以较少受结构性偏见影响的方式识别更少的人，同时仍能减少犯罪。建议进行一项随机对照试验来测试这个案例研究，并探索进一步的方向。

补充信息

在线版本包含可在10.1007/s00521-025-11478-x获取的补充材料。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6aa6/12401775/eb8aabf2b54c/521_2025_11478_Fig1_HTML.jpg

相似文献

A fairness scale for real-time recidivism forecasts using a national database of convicted offenders.

Neural Comput Appl. 2025;37(26):21607-21657. doi: 10.1007/s00521-025-11478-x. Epub 2025 Aug 1.

Prescription of Controlled Substances: Benefits and Risks

Psychological interventions for adults who have sexually offended or are at risk of offending.

Cochrane Database Syst Rev. 2012 Dec 12;12(12):CD007507. doi: 10.1002/14651858.CD007507.pub2.

A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes: Methodology and Validation Study.

JMIR Med Inform. 2025 Jun 27;13:e66200. doi: 10.2196/66200.

Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.

Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Genetic determinants of testicular sperm extraction outcomes: insights from a large multicentre study of men with non-obstructive azoospermia.

Hum Reprod Open. 2025 Aug 29;2025(3):hoaf049. doi: 10.1093/hropen/hoaf049. eCollection 2025.

Audit and feedback: effects on professional practice.

Cochrane Database Syst Rev. 2025 Mar 25;3(3):CD000259. doi: 10.1002/14651858.CD000259.pub4.

Eliciting adverse effects data from participants in clinical trials.

Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.

Plug-and-play use of tree-based methods: consequences for clinical prediction modeling.

J Clin Epidemiol. 2025 Aug;184:111834. doi: 10.1016/j.jclinepi.2025.111834. Epub 2025 May 19.

本文引用的文献

Nudging as a crime prevention strategy: the use of nudges to improve cyclists' locking behavior and reduce the opportunities for bicycle theft.

Secur J. 2022;35(2):463-485. doi: 10.1057/s41284-021-00285-3. Epub 2021 Feb 25.

Predicting Intimate Partner Violence Reassault and Homicide: A Practitioner's Guide to Making Sense of Predictive Validity Statistics.

Soc Work. 2022 Dec 23;68(1):81-85. doi: 10.1093/sw/swac044.

Machine Learning and Criminal Justice: A Systematic Review of Advanced Methodology for Recidivism Risk Prediction.

Int J Environ Res Public Health. 2022 Aug 25;19(17):10594. doi: 10.3390/ijerph191710594.

Assessing Risk Among Correctional Community Probation Populations: Predicting Reoffense With Mobile Neurocognitive Assessment Software.

Front Psychol. 2020 Jan 24;10:2926. doi: 10.3389/fpsyg.2019.02926. eCollection 2019.

A systematic review of criminal recidivism rates worldwide: 3-year update.

Wellcome Open Res. 2020 Nov 3;4:28. doi: 10.12688/wellcomeopenres.14970.3. eCollection 2019.

Predicting Sexual Recidivism.

Sex Abuse. 2020 Jun;32(4):375-399. doi: 10.1177/1079063219852944. Epub 2019 Jun 6.

Predicting recidivism among youth offenders: Augmenting professional judgement with machine learning algorithms.

J Soc Work (Lond). 2018 Nov;18(6):631-649. doi: 10.1177/1468017317743137. Epub 2017 Dec 27.

Artificial Intelligence, Machine Learning, Deep Learning, and Cognitive Computing: What Do These Terms Mean and How Will They Impact Health Care?

J Arthroplasty. 2018 Aug;33(8):2358-2361. doi: 10.1016/j.arth.2018.02.067. Epub 2018 Feb 27.

Fair Prediction with Disparate Impact: A Study of Bias in Recidivism Prediction Instruments.

Big Data. 2017 Jun;5(2):153-163. doi: 10.1089/big.2016.0047.

The world report on violence and health.

Lancet. 2002 Oct 5;360(9339):1083-8. doi: 10.1016/S0140-6736(02)11133-0.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种利用全国罪犯数据库进行实时累犯预测的公平性量表。

A fairness scale for real-time recidivism forecasts using a national database of convicted offenders.

作者信息

机构信息

出版信息

UNLABELLED

SUPPLEMENTARY INFORMATION

未标注

补充信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献