停止为高风险决策解释黑箱机器学习模型，转而使用可解释模型。

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.

作者信息

Rudin Cynthia

机构信息

Duke University.

出版信息

Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.

DOI:10.1038/s42256-019-0048-x

PMID:35603010

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9122117/

Abstract

Black box machine learning models are currently being used for high stakes decision-making throughout society, causing problems throughout healthcare, criminal justice, and in other domains. People have hoped that creating methods for explaining these black box models will alleviate some of these problems, but trying to black box models, rather than creating models that are in the first place, is likely to perpetuate bad practices and can potentially cause catastrophic harm to society. There is a way forward - it is to design models that are inherently interpretable. This manuscript clarifies the chasm between explaining black boxes and using inherently interpretable models, outlines several key reasons why explainable black boxes should be avoided in high-stakes decisions, identifies challenges to interpretable machine learning, and provides several example applications where interpretable models could potentially replace black box models in criminal justice, healthcare, and computer vision.

摘要

黑箱机器学习模型目前正被用于全社会的高风险决策，在整个医疗保健、刑事司法和其他领域引发了问题。人们曾希望创建解释这些黑箱模型的方法能缓解其中一些问题，但试图解释黑箱模型，而不是一开始就创建可解释的模型，可能会使不良做法长期存在，并可能对社会造成灾难性危害。有一条前进的道路——那就是设计本质上可解释的模型。本文阐述了解释黑箱模型和使用本质上可解释的模型之间的差距，概述了在高风险决策中应避免使用可解释黑箱模型的几个关键原因，确定了可解释机器学习面临的挑战，并提供了几个示例应用，说明在刑事司法、医疗保健和计算机视觉领域，可解释模型有可能取代黑箱模型。

相似文献

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.

Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.

Opening the Black Box: The Promise and Limitations of Explainable Machine Learning in Cardiology.

Can J Cardiol. 2022 Feb;38(2):204-213. doi: 10.1016/j.cjca.2021.09.004. Epub 2021 Sep 14.

Explainable, trustworthy, and ethical machine learning for healthcare: A survey.

Comput Biol Med. 2022 Oct;149:106043. doi: 10.1016/j.compbiomed.2022.106043. Epub 2022 Sep 7.

Interpretable machine learning models for hospital readmission prediction: a two-step extracted regression tree approach.

BMC Med Inform Decis Mak. 2023 Jun 5;23(1):104. doi: 10.1186/s12911-023-02193-5.

Explainable Machine Learning Framework for Image Classification Problems: Case Study on Glioma Cancer Prediction.

J Imaging. 2020 May 28;6(6):37. doi: 10.3390/jimaging6060037.

Open your black box classifier.

Healthc Technol Lett. 2023 Aug 29;11(4):210-212. doi: 10.1049/htl2.12050. eCollection 2024 Aug.

The Virtues of Interpretable Medical AI.

Camb Q Healthc Ethics. 2024 Jul;33(3):323-332. doi: 10.1017/S0963180122000664. Epub 2023 Jan 10.

The Virtues of Interpretable Medical Artificial Intelligence.

Camb Q Healthc Ethics. 2022 Dec 16:1-10. doi: 10.1017/S0963180122000305.

Inherently interpretable position-aware convolutional motif kernel networks for biological sequencing data.

Sci Rep. 2023 Oct 11;13(1):17216. doi: 10.1038/s41598-023-44175-7.

Why did AI get this one wrong? - Tree-based explanations of machine learning model predictions.

Artif Intell Med. 2023 Jan;135:102471. doi: 10.1016/j.artmed.2022.102471. Epub 2022 Dec 1.

引用本文的文献

A review of image processing and analysis of computed tomography images using deep learning methods.

Phys Eng Sci Med. 2025 Sep 3. doi: 10.1007/s13246-025-01635-w.

Radiomics Quality Score 2.0: towards radiomics readiness levels and clinical translation for personalized medicine.

Nat Rev Clin Oncol. 2025 Sep 3. doi: 10.1038/s41571-025-01067-1.

Towards the genome-scale discovery of bivariate monotonic classifiers.

BMC Bioinformatics. 2025 Sep 2;26(1):228. doi: 10.1186/s12859-025-06253-7.

Explainable AI in medicine: challenges of integrating XAI into the future clinical routine.

Front Radiol. 2025 Aug 5;5:1627169. doi: 10.3389/fradi.2025.1627169. eCollection 2025.

Feasibility of fully automatic assessment of cervical canal stenosis using MRI via deep learning.

Quant Imaging Med Surg. 2025 Sep 1;15(9):8457-8470. doi: 10.21037/qims-2025-67. Epub 2025 Aug 19.

Transforming Population Health Screening for Atherosclerotic Cardiovascular Disease with AI-Enhanced ECG Analytics: Opportunities and Challenges.

Curr Atheroscler Rep. 2025 Sep 1;27(1):86. doi: 10.1007/s11883-025-01337-4.

The efficacy of machine learning algorithms in evaluating factors associated with shunt-dependent hydrocephalus after subarachnoid hemorrhage: a systematic review and meta-analysis.

Neurosurg Rev. 2025 Sep 1;48(1):629. doi: 10.1007/s10143-025-03773-x.

Improving deceased donor kidney utilization: predicting risk of nonuse with interpretable models.

Front Artif Intell. 2025 Aug 13;8:1638574. doi: 10.3389/frai.2025.1638574. eCollection 2025.

AdapTor: Adaptive Topological Regression for quantitative structure-activity relationship modeling.

J Cheminform. 2025 Aug 28;17(1):128. doi: 10.1186/s13321-025-01071-8.

Beyond Post hoc Explanations: A Comprehensive Framework for Accountable AI in Medical Imaging Through Transparency, Interpretability, and Explainability.

Bioengineering (Basel). 2025 Aug 15;12(8):879. doi: 10.3390/bioengineering12080879.

本文引用的文献

Definitions, methods, and applications in interpretable machine learning.

Proc Natl Acad Sci U S A. 2019 Oct 29;116(44):22071-22080. doi: 10.1073/pnas.1900654116. Epub 2019 Oct 16.

Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs: A cross-sectional study.

PLoS Med. 2018 Nov 6;15(11):e1002683. doi: 10.1371/journal.pmed.1002683. eCollection 2018 Nov.

Modeling recovery curves with application to prostatectomy.

Biostatistics. 2019 Oct 1;20(4):549-564. doi: 10.1093/biostatistics/kxy002.

On the Safety of Machine Learning: Cyber-Physical Systems, Decision Sciences, and Data Products.

Big Data. 2017 Sep;5(3):246-255.

The World Health Organization Adult Attention-Deficit/Hyperactivity Disorder Self-Report Screening Scale for DSM-5.

JAMA Psychiatry. 2017 May 1;74(5):520-527. doi: 10.1001/jamapsychiatry.2017.0298.

Population-Level Prediction of Type 2 Diabetes From Claims Data and Analysis of Risk Factors.

Big Data. 2015 Dec;3(4):277-87. doi: 10.1089/big.2015.0020.

The Magical Mystery Four: How is Working Memory Capacity Limited, and Why?

Curr Dir Psychol Sci. 2010 Feb 1;19(1):51-57. doi: 10.1177/0963721409359277.

The magical number seven plus or minus two: some limits on our capacity for processing information.

Psychol Rev. 1956 Mar;63(2):81-97.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

停止为高风险决策解释黑箱机器学习模型，转而使用可解释模型。

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.

作者信息

Rudin Cynthia

机构信息

Duke University.

出版信息

Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.

DOI:10.1038/s42256-019-0048-x

PMID:35603010

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9122117/

Abstract

摘要

停止为高风险决策解释黑箱机器学习模型，转而使用可解释模型。

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

停止为高风险决策解释黑箱机器学习模型，转而使用可解释模型。

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.

作者信息

机构信息

出版信息