Suppr超能文献

性侵犯案件中采集和检测样本数量的成本效益分析

A cost-effectiveness analysis of the number of samples to collect and test from a sexual assault.

机构信息

Graduate School of Business, Stanford University, Stanford, CA 94305.

Criminalistics Laboratory, San Francisco Police Department, San Francisco, CA 94158.

出版信息

Proc Natl Acad Sci U S A. 2020 Jun 16;117(24):13421-13427. doi: 10.1073/pnas.2001103117. Epub 2020 Jun 1.

Abstract

Although the backlog of untested sexual assault kits in the United States is starting to be addressed, many municipalities are opting for selective testing of samples within a kit, where only the most probative samples are tested. We use data from the San Francisco Police Department Criminalistics Laboratory, which tests all samples but also collects information on the samples flagged by sexual assault forensic examiners as most probative, to build a standard machine learning model that predicts (based on covariates gleaned from sexual assault kit questionnaires) which samples are most probative. This model is embedded within an optimization framework that selects which samples to test from each kit to maximize the Combined DNA Index System (CODIS) yield (i.e., the number of kits that generate at least one DNA profile for the criminal DNA database) subject to a budget constraint. Our analysis predicts that, relative to a policy that tests only the samples deemed probative by the sexual assault forensic examiners, the proposed policy increases the CODIS yield by 45.4% without increasing the cost. Full testing of all samples has a slightly lower cost-effectiveness than the selective policy based on forensic examiners, but more than doubles the yield. In over half of the sexual assaults, a sample was not collected during the forensic medical exam from the body location deemed most probative by the machine learning model. Our results suggest that electronic forensic records coupled with machine learning and optimization models could enhance the effectiveness of criminal investigations of sexual assaults.

摘要

尽管美国未检测的性侵犯工具包积压问题开始得到解决,但许多市政当局选择对工具包内的样本进行选择性检测,只检测最具证明力的样本。我们使用了来自旧金山警察局犯罪学实验室的数据,该实验室测试所有样本,但也收集了性侵犯法医检查人员标记为最具证明力的样本的信息,以构建一个标准的机器学习模型,该模型可以预测(根据从性侵犯工具包问卷中收集的协变量)哪些样本最具证明力。该模型嵌入在一个优化框架中,该框架从每个工具包中选择要测试的样本,以最大限度地提高联合 DNA 索引系统(CODIS)的产量(即,生成至少一个犯罪 DNA 数据库 DNA 图谱的工具包数量),同时受到预算限制。我们的分析预测,与仅测试性侵犯法医认为具有证明力的样本的政策相比,拟议的政策在不增加成本的情况下将 CODIS 的产量提高了 45.4%。对所有样本进行全面测试的成本效益略低于基于法医检查的选择性政策,但产量增加了一倍多。在一半以上的性侵犯案件中,在机器学习模型认为最具证明力的身体部位没有收集到法医检查的样本。我们的结果表明,电子法医记录加上机器学习和优化模型可以提高性侵犯犯罪调查的有效性。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验