• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

风险评分、标签偏差以及除了厨房水槽之外的一切。

Risk scores, label bias, and everything but the kitchen sink.

作者信息

Zanger-Tishler Michael, Nyarko Julian, Goel Sharad

机构信息

Sociology and Social Policy, Harvard University, Cambridge, MA 02138, USA.

Stanford Law School, Stanford, CA 94305, USA.

出版信息

Sci Adv. 2024 Mar 29;10(13):eadi8411. doi: 10.1126/sciadv.adi8411.

DOI:10.1126/sciadv.adi8411
PMID:38552013
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10980258/
Abstract

In designing risk assessment algorithms, many scholars promote a "kitchen sink" approach, reasoning that more information yields more accurate predictions. We show, however, that this rationale often fails when algorithms are trained to predict a proxy of the true outcome, for instance, predicting arrest as a proxy for criminal behavior. With this "label bias," one should exclude a feature if its correlation with the proxy and its correlation with the true outcome have opposite signs, conditional on the other model features. This criterion is often satisfied when a feature is weakly correlated with the true outcome, and, additionally, that feature and the true outcome are both direct causes of the proxy outcome. For example, criminal behavior and geography may be weakly correlated and, due to patterns of police deployment, direct causes of one's arrest record-suggesting that excluding geography in criminal risk assessment will weaken an algorithm's performance in predicting arrest but will improve its capacity to predict actual crime.

摘要

在设计风险评估算法时,许多学者推崇一种“一锅烩”的方法,理由是更多信息能带来更准确的预测。然而,我们发现,当算法被训练用于预测真实结果的替代指标时,例如将被捕作为犯罪行为的替代指标进行预测,这种基本原理往往会失效。存在这种“标签偏差”时,如果一个特征与替代指标的相关性以及与真实结果的相关性具有相反的符号,且以其他模型特征为条件,那么就应该排除该特征。当一个特征与真实结果弱相关,并且该特征和真实结果都是替代结果的直接原因时,这个标准通常会得到满足。例如,犯罪行为和地理位置可能弱相关,并且由于警察部署模式的原因,它们都是一个人被捕记录的直接原因——这表明在犯罪风险评估中排除地理位置会削弱算法预测被捕的性能,但会提高其预测实际犯罪的能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6bcd/10980258/250de10e5f92/sciadv.adi8411-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6bcd/10980258/45e27b6166b1/sciadv.adi8411-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6bcd/10980258/b747077b6282/sciadv.adi8411-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6bcd/10980258/61caf87bae9b/sciadv.adi8411-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6bcd/10980258/250de10e5f92/sciadv.adi8411-f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6bcd/10980258/45e27b6166b1/sciadv.adi8411-f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6bcd/10980258/b747077b6282/sciadv.adi8411-f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6bcd/10980258/61caf87bae9b/sciadv.adi8411-f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6bcd/10980258/250de10e5f92/sciadv.adi8411-f4.jpg

相似文献

1
Risk scores, label bias, and everything but the kitchen sink.风险评分、标签偏差以及除了厨房水槽之外的一切。
Sci Adv. 2024 Mar 29;10(13):eadi8411. doi: 10.1126/sciadv.adi8411.
2
Cohort bias in predictive risk assessments of future criminal justice system involvement.队列偏差对未来刑事司法系统参与的预测性风险评估的影响。
Proc Natl Acad Sci U S A. 2023 Jun 6;120(23):e2301990120. doi: 10.1073/pnas.2301990120. Epub 2023 May 30.
3
Assessment and statistical modeling of the relationship between remotely sensed aerosol optical depth and PM2.5 in the eastern United States.美国东部地区遥感气溶胶光学厚度与PM2.5之间关系的评估及统计建模
Res Rep Health Eff Inst. 2012 May(167):5-83; discussion 85-91.
4
Police-initiated diversion for youth to prevent future delinquent behavior: a systematic review.警方发起的青少年分流措施以预防未来的犯罪行为:一项系统综述。
Campbell Syst Rev. 2018 Jun 1;14(1):1-88. doi: 10.4073/csr.2018.5. eCollection 2018.
5
Dynamics of crime activities in the network of city community areas.城市社区区域网络中犯罪活动的动态
Appl Netw Sci. 2019;4(1):127. doi: 10.1007/s41109-019-0239-8. Epub 2019 Dec 26.
6
"I Suck at Everything": Crime, Arrest, and the Generality of Failure.“我事事皆糟”:犯罪、被捕与失败的普遍性
Deviant Behav. 2016;37(8):837-851. doi: 10.1080/01639625.2016.1147809. Epub 2016 Mar 22.
7
Assessing the Proactive and Reactive Dimensions of Criminal Thought Process: Divergent Patterns of Correlation With Variable- and Person-Level Measures of Criminal Risk and Future Outcome.评估犯罪思维过程的主动性和反应性维度:与犯罪风险和未来结果的变量和个体水平测量的相关模式的差异。
J Pers Assess. 2020 Mar-Apr;102(2):223-230. doi: 10.1080/00223891.2018.1508469. Epub 2018 Sep 21.
8
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
9
Pre-arrest diversion to addiction treatment by law enforcement: protocol for the community-level policing initiative to reduce addiction-related harm, including crime.执法部门将逮捕前人员转介至成瘾治疗:减少成瘾相关危害(包括犯罪)的社区层面治安举措方案。
Health Justice. 2021 Mar 10;9(1):9. doi: 10.1186/s40352-021-00134-w.
10
A Game Theory Approach for Assessment of Risk and Deployment of Police Patrols in Response to Criminal Activity in San Francisco.一种用于评估旧金山应对犯罪活动的风险和部署警察巡逻的博弈论方法。
Risk Anal. 2020 Mar;40(3):534-549. doi: 10.1111/risa.13411. Epub 2019 Oct 1.

引用本文的文献

1
Navigating Fairness in AI-based Prediction Models: Theoretical Constructs and Practical Applications.在基于人工智能的预测模型中把握公平性:理论构建与实际应用
medRxiv. 2025 Mar 24:2025.03.24.25324500. doi: 10.1101/2025.03.24.25324500.
2
Guidance for unbiased predictive information for healthcare decision-making and equity (GUIDE): considerations when race may be a prognostic factor.医疗保健决策与公平性的无偏预测信息指南(GUIDE):种族可能成为预后因素时的考量
NPJ Digit Med. 2024 Oct 19;7(1):290. doi: 10.1038/s41746-024-01245-y.
3
Performance of Machine Learning Suicide Risk Models in an American Indian Population.

本文引用的文献

1
Designing equitable algorithms.设计公平的算法。
Nat Comput Sci. 2023 Jul;3(7):601-610. doi: 10.1038/s43588-023-00485-4. Epub 2023 Jul 24.
2
Using measures of race to make clinical predictions: Decision making, patient health, and fairness.种族指标在临床预测中的应用:决策、患者健康和公平性。
Proc Natl Acad Sci U S A. 2023 Aug 29;120(35):e2303370120. doi: 10.1073/pnas.2303370120. Epub 2023 Aug 22.
3
Measuring racial and ethnic disparities in traffic enforcement with large-scale telematics data.利用大规模远程信息处理数据衡量交通执法中的种族和族裔差异。
机器学习自杀风险模型在美国印第安人群体中的表现。
JAMA Netw Open. 2024 Oct 1;7(10):e2439269. doi: 10.1001/jamanetworkopen.2024.39269.
4
Race adjustments in clinical algorithms can help correct for racial disparities in data quality.临床算法中的种族调整可以帮助纠正数据质量方面的种族差异。
Proc Natl Acad Sci U S A. 2024 Aug 20;121(34):e2402267121. doi: 10.1073/pnas.2402267121. Epub 2024 Aug 13.
PNAS Nexus. 2022 Jul 30;1(4):pgac144. doi: 10.1093/pnasnexus/pgac144. eCollection 2022 Sep.
4
Patient-centered appraisal of race-free clinical risk assessment.以患者为中心的无种族临床风险评估评估
Health Econ. 2022 Oct;31(10):2109-2114. doi: 10.1002/hec.4569. Epub 2022 Jul 5.
5
An algorithmic approach to reducing unexplained pain disparities in underserved populations.一种减少服务不足人群中不明原因疼痛差异的算法方法。
Nat Med. 2021 Jan;27(1):136-140. doi: 10.1038/s41591-020-01192-7. Epub 2021 Jan 13.
6
The limits of human predictions of recidivism.人类对累犯预测的局限性。
Sci Adv. 2020 Feb 14;6(7):eaaz0652. doi: 10.1126/sciadv.aaz0652. eCollection 2020 Feb.
7
Dissecting racial bias in an algorithm used to manage the health of populations.剖析用于管理人群健康的算法中的种族偏见。
Science. 2019 Oct 25;366(6464):447-453. doi: 10.1126/science.aax2342.
8
The accuracy, fairness, and limits of predicting recidivism.预测累犯的准确性、公正性和局限性。
Sci Adv. 2018 Jan 17;4(1):eaao5580. doi: 10.1126/sciadv.aao5580. eCollection 2018 Jan.
9
Predictive Analytics for City Agencies: Lessons from Children's Services.面向城市机构的预测分析:儿童服务的经验教训。
Big Data. 2017 Sep;5(3):189-196. doi: 10.1089/big.2016.0052. Epub 2017 Aug 22.
10
Fair Prediction with Disparate Impact: A Study of Bias in Recidivism Prediction Instruments.公平预测与差异影响:累犯预测工具中的偏见研究。
Big Data. 2017 Jun;5(2):153-163. doi: 10.1089/big.2016.0047.