• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用二项和生存结局的登记数据优化犯罪再犯预测模型的预测性能。

Optimizing predictive performance of criminal recidivism models using registration data with binary and survival outcomes.

机构信息

Research and Documentation Centre (WODC), Ministry of Justice and Security, The Hague, Zuid-Holland, the Netherlands.

Department of Social Sciences, Utrecht University, Utrecht, Utrecht, the Netherlands.

出版信息

PLoS One. 2019 Mar 8;14(3):e0213245. doi: 10.1371/journal.pone.0213245. eCollection 2019.

DOI:10.1371/journal.pone.0213245
PMID:30849094
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6407787/
Abstract

In a recidivism prediction context, there is no consensus on which modeling strategy should be followed for obtaining an optimal prediction model. In previous papers, a range of statistical and machine learning techniques were benchmarked on recidivism data with a binary outcome. However, two important tree ensemble methods, namely gradient boosting and random forests were not extensively evaluated. In this paper, we further explore the modeling potential of these techniques in the binary outcome criminal prediction context. Additionally, we explore the predictive potential of classical statistical and machine learning methods for censored time-to-event data. A range of statistical manually specified statistical and (semi-)automatic machine learning models is fitted on Dutch recidivism data, both for the binary outcome case and censored outcome case. To enhance generalizability of results, the same models are applied to two historical American data sets, the North Carolina prison data. For all datasets, (semi-) automatic modeling in the binary case seems to provide no improvement over an appropriately manually specified traditional statistical model. There is however evidence of slightly improved performance of gradient boosting in survival data. Results on the reconviction data from two sources suggest that both statistical and machine learning should be tried out for obtaining an optimal model. Even if a flexible black-box model does not improve upon the predictions of a manually specified model, it can serve as a test whether important interactions are missing or other misspecification of the model are present and can thus provide more security in the modeling process.

摘要

在累犯预测的背景下,对于应该采用哪种建模策略来获得最优的预测模型,尚未达成共识。在之前的论文中,已经对基于二分类结局的累犯数据使用了一系列统计学和机器学习技术进行了基准测试。然而,两种重要的树集成方法,即梯度提升和随机森林,尚未得到广泛评估。在本文中,我们进一步探索了这些技术在二分类结局的犯罪预测背景下的建模潜力。此外,我们还探索了经典统计学和机器学习方法在删失时间事件数据中的预测潜力。在荷兰累犯数据中,针对二分类结局和删失结局情况,拟合了一系列统计学上手动指定的统计模型和(半)自动机器学习模型。为了提高结果的可推广性,我们将相同的模型应用于两个历史上的美国数据集,即北卡罗来纳州监狱数据。对于所有数据集,(半)自动建模在二分类情况下似乎并没有优于适当手动指定的传统统计模型。然而,在生存数据中,梯度提升的性能略有提高。来自两个来源的再犯数据的结果表明,应该尝试使用统计学和机器学习来获得最优模型。即使灵活的黑盒模型不能提高手动指定模型的预测效果,它也可以作为检验是否缺少重要交互作用或模型存在其他误设定的一种手段,从而为建模过程提供更多的保障。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/17da72abce48/pone.0213245.g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/27e7cc7c0f83/pone.0213245.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/5a038d6072a9/pone.0213245.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/f287227c09ca/pone.0213245.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/35455e6ec2ed/pone.0213245.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/4345ec474089/pone.0213245.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/88b063ea644a/pone.0213245.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/6f93bd87633e/pone.0213245.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/4d91b682bcfa/pone.0213245.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/0ec50d484394/pone.0213245.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/91392c9e0951/pone.0213245.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/0a81900e5931/pone.0213245.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/9bc082c58f25/pone.0213245.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/48231c3c5c6b/pone.0213245.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/0393d0e93b05/pone.0213245.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/fbb0423b3f6f/pone.0213245.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/bb74c641d198/pone.0213245.g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/5a67e394af85/pone.0213245.g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/262325393b93/pone.0213245.g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/3cdaaa33735b/pone.0213245.g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/17da72abce48/pone.0213245.g020.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/27e7cc7c0f83/pone.0213245.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/5a038d6072a9/pone.0213245.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/f287227c09ca/pone.0213245.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/35455e6ec2ed/pone.0213245.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/4345ec474089/pone.0213245.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/88b063ea644a/pone.0213245.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/6f93bd87633e/pone.0213245.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/4d91b682bcfa/pone.0213245.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/0ec50d484394/pone.0213245.g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/91392c9e0951/pone.0213245.g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/0a81900e5931/pone.0213245.g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/9bc082c58f25/pone.0213245.g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/48231c3c5c6b/pone.0213245.g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/0393d0e93b05/pone.0213245.g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/fbb0423b3f6f/pone.0213245.g015.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/bb74c641d198/pone.0213245.g016.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/5a67e394af85/pone.0213245.g017.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/262325393b93/pone.0213245.g018.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/3cdaaa33735b/pone.0213245.g019.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b17/6407787/17da72abce48/pone.0213245.g020.jpg

相似文献

1
Optimizing predictive performance of criminal recidivism models using registration data with binary and survival outcomes.利用二项和生存结局的登记数据优化犯罪再犯预测模型的预测性能。
PLoS One. 2019 Mar 8;14(3):e0213245. doi: 10.1371/journal.pone.0213245. eCollection 2019.
2
Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?预测模型工具能否识别 ACL 重建术后阿片类药物使用时间延长的高风险患者?
Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.
3
Machine Learning and Criminal Justice: A Systematic Review of Advanced Methodology for Recidivism Risk Prediction.机器学习与刑事司法:累犯风险预测先进方法的系统综述。
Int J Environ Res Public Health. 2022 Aug 25;19(17):10594. doi: 10.3390/ijerph191710594.
4
Survival prediction models: an introduction to discrete-time modeling.生存预测模型:离散时间建模简介。
BMC Med Res Methodol. 2022 Jul 26;22(1):207. doi: 10.1186/s12874-022-01679-6.
5
Predicting Sexual Recidivism.预测性再犯
Sex Abuse. 2020 Jun;32(4):375-399. doi: 10.1177/1079063219852944. Epub 2019 Jun 6.
6
Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型,对于使用可穿戴设备进行压力预测具有良好的泛化能力。
J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.
7
Relational caring and contact after treatment. An evaluation study on criminal recidivism.治疗后的关系性关怀与接触。一项关于犯罪再犯的评估研究。
Int J Law Psychiatry. 2018 Sep-Oct;60:45-50. doi: 10.1016/j.ijlp.2018.07.011. Epub 2018 Jul 25.
8
A machine learning approach to small area estimation: predicting the health, housing and well-being of the population of Netherlands.一种用于小区域估计的机器学习方法:预测荷兰人口的健康、住房和福祉。
Int J Health Geogr. 2022 Jun 6;21(1):4. doi: 10.1186/s12942-022-00304-5.
9
Development, Validation, and Evaluation of a Simple Machine Learning Model to Predict Cirrhosis Mortality.开发、验证和评估一种简单的机器学习模型以预测肝硬化死亡率。
JAMA Netw Open. 2020 Nov 2;3(11):e2023780. doi: 10.1001/jamanetworkopen.2020.23780.
10
Predicting hospitalization following psychiatric crisis care using machine learning.运用机器学习预测精神科危机护理后的住院情况。
BMC Med Inform Decis Mak. 2020 Dec 10;20(1):332. doi: 10.1186/s12911-020-01361-1.

引用本文的文献

1
Editorial: Applications of artificial intelligence in forensic mental health: opportunities and challenges.社论:人工智能在法医精神健康中的应用:机遇与挑战
Front Psychiatry. 2024 Jun 12;15:1435219. doi: 10.3389/fpsyt.2024.1435219. eCollection 2024.
2
Beyond Discrimination: Generative AI Applications and Ethical Challenges in Forensic Psychiatry.超越歧视:生成式人工智能在法医精神病学中的应用及伦理挑战
Front Psychiatry. 2024 Mar 8;15:1346059. doi: 10.3389/fpsyt.2024.1346059. eCollection 2024.
3
Boosting ridge for the extreme learning machine globally optimised for classification and regression problems.

本文引用的文献

1
A general approach for predicting the behavior of the Supreme Court of the United States.一种预测美国最高法院行为的通用方法。
PLoS One. 2017 Apr 12;12(4):e0174698. doi: 10.1371/journal.pone.0174698. eCollection 2017.
2
Combining parametric, semi-parametric, and non-parametric survival models with stacked survival models.将参数化、半参数化和非参数化生存模型与堆叠生存模型相结合。
Biostatistics. 2015 Jul;16(3):537-49. doi: 10.1093/biostatistics/kxv001. Epub 2015 Feb 5.
3
Predicting juvenile offending: a comparison of data mining methods.
用于针对分类和回归问题进行全局优化的极限学习机的增强岭算法。
Sci Rep. 2023 Jul 21;13(1):11809. doi: 10.1038/s41598-023-38948-3.
4
A Bayesian Mixture Cure Rate Model for Estimating Short-Term and Long-Term Recidivism.一种用于估计短期和长期再犯率的贝叶斯混合治愈率模型。
Entropy (Basel). 2022 Dec 28;25(1):56. doi: 10.3390/e25010056.
5
Machine Learning and Criminal Justice: A Systematic Review of Advanced Methodology for Recidivism Risk Prediction.机器学习与刑事司法:累犯风险预测先进方法的系统综述。
Int J Environ Res Public Health. 2022 Aug 25;19(17):10594. doi: 10.3390/ijerph191710594.
6
A model for predicting court decisions on child custody.预测子女抚养权法院判决的模型。
PLoS One. 2021 Oct 21;16(10):e0258993. doi: 10.1371/journal.pone.0258993. eCollection 2021.
7
Integrating gut microbiome and host immune markers to understand the pathogenesis of infection.整合肠道微生物组和宿主免疫标志物以了解感染的发病机制。
Gut Microbes. 2021 Jan-Dec;13(1):1-18. doi: 10.1080/19490976.2021.1935186.
8
Assessing Risk Among Correctional Community Probation Populations: Predicting Reoffense With Mobile Neurocognitive Assessment Software.评估惩教社区缓刑人群的风险:使用移动神经认知评估软件预测再次犯罪
Front Psychol. 2020 Jan 24;10:2926. doi: 10.3389/fpsyg.2019.02926. eCollection 2019.
预测青少年犯罪:数据挖掘方法的比较
Int J Offender Ther Comp Criminol. 2013 Feb;57(2):191-207. doi: 10.1177/0306624X11431132. Epub 2011 Dec 12.
4
Support vector methods for survival analysis: a comparison between ranking and regression approaches.支持向量方法在生存分析中的应用:排序和回归方法的比较。
Artif Intell Med. 2011 Oct;53(2):107-18. doi: 10.1016/j.artmed.2011.06.006. Epub 2011 Aug 6.
5
Risk factors for overall recidivism and severity of recidivism in serious juvenile offenders.严重少年犯总体再犯及再犯严重程度的风险因素。
Int J Offender Ther Comp Criminol. 2011 Feb;55(1):118-35. doi: 10.1177/0306624X09356683. Epub 2010 Feb 24.
6
The performance of risk prediction models.风险预测模型的性能。
Biom J. 2008 Aug;50(4):457-79. doi: 10.1002/bimj.200810443.
7
Consistent estimation of the expected Brier score in general survival models with right-censored event times.在具有右删失事件时间的一般生存模型中对预期Brier评分进行一致估计。
Biom J. 2006 Dec;48(6):1029-40. doi: 10.1002/bimj.200610301.
8
Model-based boosting in high dimensions.高维空间中基于模型的提升算法
Bioinformatics. 2006 Nov 15;22(22):2828-9. doi: 10.1093/bioinformatics/btl462. Epub 2006 Aug 29.
9
Survival ensembles.生存集成法。
Biostatistics. 2006 Jul;7(3):355-73. doi: 10.1093/biostatistics/kxj011. Epub 2005 Dec 12.
10
Improvement of breast cancer relapse prediction in high risk intervals using artificial neural networks.使用人工神经网络改善高危间期乳腺癌复发预测
Breast Cancer Res Treat. 2005 Dec;94(3):265-72. doi: 10.1007/s10549-005-9013-y.