可解释人工智能：机器学习可解释性方法综述

Explainable AI: A Review of Machine Learning Interpretability Methods.

作者信息

Linardatos Pantelis, Papastefanopoulos Vasilis, Kotsiantis Sotiris

机构信息

Department of Mathematics, University of Patras, 26504 Patras, Greece.

出版信息

Entropy (Basel). 2020 Dec 25;23(1):18. doi: 10.3390/e23010018.

DOI:10.3390/e23010018

PMID:33375658

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7824368/

Abstract

Recent advances in artificial intelligence (AI) have led to its widespread industrial adoption, with machine learning systems demonstrating superhuman performance in a significant number of tasks. However, this surge in performance, has often been achieved through increased model complexity, turning such systems into "black box" approaches and causing uncertainty regarding the way they operate and, ultimately, the way that they come to decisions. This ambiguity has made it problematic for machine learning systems to be adopted in sensitive yet critical domains, where their value could be immense, such as healthcare. As a result, scientific interest in the field of Explainable Artificial Intelligence (XAI), a field that is concerned with the development of new methods that explain and interpret machine learning models, has been tremendously reignited over recent years. This study focuses on machine learning interpretability methods; more specifically, a literature review and taxonomy of these methods are presented, as well as links to their programming implementations, in the hope that this survey would serve as a reference point for both theorists and practitioners.

摘要

人工智能（AI）的最新进展已使其在工业中得到广泛应用，机器学习系统在大量任务中展现出超人的性能。然而，这种性能的提升往往是通过增加模型复杂性来实现的，这使得此类系统变成了“黑箱”方法，并导致人们对其运行方式以及最终做出决策的方式存在不确定性。这种模糊性使得机器学习系统难以在敏感但关键的领域（如医疗保健领域，其价值可能巨大）中得到应用。因此，近年来，人们对可解释人工智能（XAI）领域的科学兴趣被极大地重新点燃，该领域关注开发解释和解释机器学习模型的新方法。本研究聚焦于机器学习可解释性方法；更具体地说，本文呈现了这些方法的文献综述和分类法，以及它们的编程实现链接，希望该综述能为理论家和实践者提供一个参考点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/757d/7824368/2417734d11f2/entropy-23-00018-g001.jpg

相似文献

Explainable AI: A Review of Machine Learning Interpretability Methods.可解释人工智能：机器学习可解释性方法综述

Entropy (Basel). 2020 Dec 25;23(1):18. doi: 10.3390/e23010018.

Applications of Explainable Artificial Intelligence in Diagnosis and Surgery.可解释人工智能在诊断与手术中的应用。

Diagnostics (Basel). 2022 Jan 19;12(2):237. doi: 10.3390/diagnostics12020237.

Artificial cognition: How experimental psychology can help generate explainable artificial intelligence.人工认知：实验心理学如何帮助生成可解释的人工智能。

Psychon Bull Rev. 2021 Apr;28(2):454-475. doi: 10.3758/s13423-020-01825-5. Epub 2020 Nov 6.

Explainable AI in medical imaging: An overview for clinical practitioners - Beyond saliency-based XAI approaches.医学成像中的可解释人工智能：临床从业者概述——超越基于显著性的可解释人工智能方法

Eur J Radiol. 2023 May;162:110786. doi: 10.1016/j.ejrad.2023.110786. Epub 2023 Mar 20.

Explainable AI (xAI) for Anatomic Pathology.可解释人工智能（xAI）在解剖病理学中的应用。

Adv Anat Pathol. 2020 Jul;27(4):241-250. doi: 10.1097/PAP.0000000000000264.

Explainable AI for Bioinformatics: Methods, Tools and Applications.可解释人工智能在生物信息学中的应用：方法、工具与应用。

Brief Bioinform. 2023 Sep 20;24(5). doi: 10.1093/bib/bbad236.

The enlightening role of explainable artificial intelligence in medical & healthcare domains: A systematic literature review.可解释人工智能在医疗保健领域中的启示作用：系统文献综述。

Comput Biol Med. 2023 Nov;166:107555. doi: 10.1016/j.compbiomed.2023.107555. Epub 2023 Oct 4.

Model-agnostic explainable artificial intelligence tools for severity prediction and symptom analysis on Indian COVID-19 data.用于印度新冠疫情数据严重程度预测和症状分析的模型无关可解释人工智能工具。

Front Artif Intell. 2023 Dec 4;6:1272506. doi: 10.3389/frai.2023.1272506. eCollection 2023.

Explainable AI in medical imaging: An overview for clinical practitioners - Saliency-based XAI approaches.可解释人工智能在医学影像中的应用：临床医师的概述——基于显著度的 XAI 方法。

Eur J Radiol. 2023 May;162:110787. doi: 10.1016/j.ejrad.2023.110787. Epub 2023 Mar 21.

Toward explainable AI (XAI) for mental health detection based on language behavior.迈向基于语言行为的可解释人工智能（XAI）用于心理健康检测。

Front Psychiatry. 2023 Dec 7;14:1219479. doi: 10.3389/fpsyt.2023.1219479. eCollection 2023.

引用本文的文献

Artificial intelligence in interventional cardiology: a review of its role in diagnosis, decision-making, and procedural precision.人工智能在介入心脏病学中的应用：对其在诊断、决策和操作精准性方面作用的综述

Ann Med Surg (Lond). 2025 Jul 18;87(9):5720-5734. doi: 10.1097/MS9.0000000000003602. eCollection 2025 Sep.

Interpretable AI-assisted diagnosis of papillary thyroid cancer cytopathology using graph neural networks and knowledge graphs.使用图神经网络和知识图谱对甲状腺乳头状癌细胞病理学进行可解释的人工智能辅助诊断。

Sci Rep. 2025 Sep 1;15(1):32165. doi: 10.1038/s41598-025-18235-z.

Improving deceased donor kidney utilization: predicting risk of nonuse with interpretable models.提高 deceased 捐赠者肾脏利用率：使用可解释模型预测未使用风险。（注：“deceased”直译为“已故的”，这里结合语境意译为“ deceased 捐赠者”即“死亡后器官捐赠者” ）

Front Artif Intell. 2025 Aug 13;8:1638574. doi: 10.3389/frai.2025.1638574. eCollection 2025.

AdapTor: Adaptive Topological Regression for quantitative structure-activity relationship modeling.AdapTor：用于定量构效关系建模的自适应拓扑回归

J Cheminform. 2025 Aug 28;17(1):128. doi: 10.1186/s13321-025-01071-8.

Predicting water quality index using stacked ensemble regression and SHAP based explainable artificial intelligence.使用堆叠集成回归和基于SHAP的可解释人工智能预测水质指数。

Sci Rep. 2025 Aug 24;15(1):31139. doi: 10.1038/s41598-025-09463-4.

Evaluating the Effect of Thermal Treatment on Phenolic Compounds in Functional Flours Using Vis-NIR-SWIR Spectroscopy: A Machine Learning Approach.利用可见-近红外-短波红外光谱法评估热处理对功能性面粉中酚类化合物的影响：一种机器学习方法。

Foods. 2025 Jul 29;14(15):2663. doi: 10.3390/foods14152663.

Comparative evaluation of CAM methods for enhancing explainability in veterinary radiography.用于增强兽医放射成像中可解释性的补充与替代医学（CAM）方法的比较评估。

Sci Rep. 2025 Aug 13;15(1):29690. doi: 10.1038/s41598-025-14060-6.

Construction of a transfer learning-based depression detection model for female breast cancer patients: text sentiment analysis.基于迁移学习的女性乳腺癌患者抑郁检测模型构建：文本情感分析

BMC Cancer. 2025 Aug 12;25(1):1307. doi: 10.1186/s12885-025-14650-7.

Role and Use of Race in Artificial Intelligence and Machine Learning Models Related to Health.种族在与健康相关的人工智能和机器学习模型中的作用及应用

J Med Internet Res. 2025 Jul 31;27:e73996. doi: 10.2196/73996.

Artificial Intelligence in Advancing Inflammatory Bowel Disease Management: Setting New Standards.人工智能推动炎症性肠病管理：设定新标准。

Cancers (Basel). 2025 Jul 14;17(14):2337. doi: 10.3390/cancers17142337.

本文引用的文献

Definitions, methods, and applications in interpretable machine learning.可解释机器学习中的定义、方法和应用。

Proc Natl Acad Sci U S A. 2019 Oct 29;116(44):22071-22080. doi: 10.1073/pnas.1900654116. Epub 2019 Oct 16.

A guide to deep learning in healthcare.深度学习在医疗保健中的应用指南。

Nat Med. 2019 Jan;25(1):24-29. doi: 10.1038/s41591-018-0316-z. Epub 2019 Jan 7.

Machine learning: Trends, perspectives, and prospects.机器学习：趋势、观点和展望。

Science. 2015 Jul 17;349(6245):255-60. doi: 10.1126/science.aaa8415.

On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation.关于通过逐层相关性传播对非线性分类器决策进行逐像素解释

PLoS One. 2015 Jul 10;10(7):e0130140. doi: 10.1371/journal.pone.0130140. eCollection 2015.

Deep learning.深度学习。

Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.

Permutation importance: a corrected feature importance measure.排列重要性：一种修正的特征重要性度量。

Bioinformatics. 2010 May 15;26(10):1340-7. doi: 10.1093/bioinformatics/btq134. Epub 2010 Apr 12.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

可解释人工智能：机器学习可解释性方法综述

Explainable AI: A Review of Machine Learning Interpretability Methods.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献