• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于文本的冒犯分类的分层方法。

Hierarchical approaches to Text-based Offense Classification.

机构信息

University of Michigan, Ann Arbor, MI, USA.

Measures for Justice, Rochester, NY, USA.

出版信息

Sci Adv. 2023 Mar 3;9(9):eabq8123. doi: 10.1126/sciadv.abq8123.

DOI:10.1126/sciadv.abq8123
PMID:36867702
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9984170/
Abstract

Researchers working with administrative crime data often must classify offense narratives into a common scheme for analysis purposes. No comprehensive standard currently exists, nor is there a mapping tool to transform raw descriptions into offense types. This paper introduces a new schema, the Uniform Crime Classification Standard (UCCS), and the Text-based Offense Classification (TOC) tool to address these shortcomings. The UCCS schema draws from existing efforts, aiming to better reflect offense severity and improve type disambiguation. The TOC tool is a machine learning algorithm that uses a hierarchical, multilayer perceptron classification framework, built on 313,209 hand-coded offense descriptions from 24 states, to translate raw descriptions into UCCS codes. We test how variations in data processing and modeling approaches affect recall, precision, and F1 scores to assess their relative influence on model performance. The code scheme and classification tool are collaborations between Measures for Justice and the Criminal Justice Administrative Records System.

摘要

研究人员在处理行政犯罪数据时,经常需要将犯罪叙述分类为通用方案,以便进行分析。目前没有全面的标准,也没有将原始描述转换为犯罪类型的映射工具。本文介绍了一种新的方案,即统一犯罪分类标准(UCCS)和基于文本的犯罪分类(TOC)工具,以解决这些问题。UCCS 方案借鉴了现有成果,旨在更好地反映犯罪严重程度并提高类型的明确性。TOC 工具是一种机器学习算法,它使用分层多层感知器分类框架,基于来自 24 个州的 313,209 个手工编码的犯罪描述,将原始描述转换为 UCCS 代码。我们测试了数据处理和建模方法的变化如何影响召回率、精度和 F1 分数,以评估它们对模型性能的相对影响。该代码方案和分类工具是 Measures for Justice 与刑事司法行政记录系统之间的合作成果。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d0e/9984170/f34a8837c30e/sciadv.abq8123-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d0e/9984170/fa4eb42a1ae9/sciadv.abq8123-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d0e/9984170/ec69fbffca40/sciadv.abq8123-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d0e/9984170/bd021937be83/sciadv.abq8123-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d0e/9984170/610487202b7c/sciadv.abq8123-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d0e/9984170/f0c10465f1fa/sciadv.abq8123-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d0e/9984170/0218453333ef/sciadv.abq8123-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d0e/9984170/f34a8837c30e/sciadv.abq8123-f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d0e/9984170/fa4eb42a1ae9/sciadv.abq8123-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d0e/9984170/ec69fbffca40/sciadv.abq8123-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d0e/9984170/bd021937be83/sciadv.abq8123-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d0e/9984170/610487202b7c/sciadv.abq8123-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d0e/9984170/f0c10465f1fa/sciadv.abq8123-f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d0e/9984170/0218453333ef/sciadv.abq8123-f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d0e/9984170/f34a8837c30e/sciadv.abq8123-f7.jpg

相似文献

1
Hierarchical approaches to Text-based Offense Classification.基于文本的冒犯分类的分层方法。
Sci Adv. 2023 Mar 3;9(9):eabq8123. doi: 10.1126/sciadv.abq8123.
2
Utilizing Text Mining, Data Linkage and Deep Learning in Police and Health Records to Predict Future Offenses in Family and Domestic Violence.利用警方和健康记录中的文本挖掘、数据关联和深度学习来预测家庭及家庭暴力中的未来犯罪行为。
Front Digit Health. 2021 Feb 17;3:602683. doi: 10.3389/fdgth.2021.602683. eCollection 2021.
3
Predicting offenses among individuals with psychiatric disorders - A machine learning approach.预测精神障碍个体的犯罪行为——一种机器学习方法。
J Psychiatr Res. 2021 Jun;138:146-154. doi: 10.1016/j.jpsychires.2021.03.026. Epub 2021 Mar 29.
4
Offense Narrative Roles of Turkish Offenders.土耳其罪犯的犯罪叙事角色。
Int J Offender Ther Comp Criminol. 2022 Sep;66(12):1237-1262. doi: 10.1177/0306624X211010285. Epub 2021 Apr 30.
5
Criminal justice measures for economic data harmonization in substance use disorder research.物质使用障碍研究中经济数据协调的刑事司法措施。
Health Justice. 2018 Sep 21;6(1):17. doi: 10.1186/s40352-018-0073-6.
6
Predicting COVID-19 Symptoms From Free Text in Medical Records Using Artificial Intelligence: Feasibility Study.使用人工智能从医疗记录中的自由文本预测新冠病毒疾病症状:可行性研究
JMIR Med Inform. 2022 Apr 27;10(4):e37771. doi: 10.2196/37771.
7
Supervised Machine Learning Algorithms Can Classify Open-Text Feedback of Doctor Performance With Human-Level Accuracy.监督式机器学习算法能够以人类水平的准确率对医生表现的开放式文本反馈进行分类。
J Med Internet Res. 2017 Mar 15;19(3):e65. doi: 10.2196/jmir.6533.
8
Rape Perpetrators on Trial: The Effect of Sexual Assault-Related Schemas on Attributions of Blame.受审的强奸犯:性侵犯相关图式对罪责归因的影响。
J Interpers Violence. 2019 Jan;34(2):310-336. doi: 10.1177/0886260516640777. Epub 2016 Mar 28.
9
The reliability and validity of the rating scale of criminal responsibility for mentally disordered offenders.精神障碍患者刑事责任评定量表的信度和效度。
Forensic Sci Int. 2014 Mar;236:146-50. doi: 10.1016/j.forsciint.2013.12.018. Epub 2014 Jan 13.
10
Natural Language Processing for Imaging Protocol Assignment: Machine Learning for Multiclass Classification of Abdominal CT Protocols Using Indication Text Data.基于自然语言处理的成像协议分配:使用指示文本数据进行多类分类的腹部 CT 协议的机器学习。
J Digit Imaging. 2022 Oct;35(5):1120-1130. doi: 10.1007/s10278-022-00633-8. Epub 2022 Jun 2.

引用本文的文献

1
Adverse Childhood Experiences: Increased Likelihood Of Socioeconomic Disadvantages For Young Adults.童年不良经历:年轻成年人面临社会经济劣势的可能性增加。
Health Aff (Millwood). 2025 Jan;44(1):108-116. doi: 10.1377/hlthaff.2024.00827.

本文引用的文献

1
The Criminal Justice Administrative Records System: A next-generation research data platform.刑事司法行政记录系统:下一代研究数据平台。
Sci Data. 2022 Sep 12;9(1):562. doi: 10.1038/s41597-022-01620-y.