揭开聪明汉斯预测者的面具，评估机器真正学到了什么。

Unmasking Clever Hans predictors and assessing what machines really learn.

机构信息

Department of Video Coding & Analytics, Fraunhofer Heinrich Hertz Institute, Einsteinufer 37, 10587, Berlin, Germany.

Department of Electrical Engineering and Computer Science, Technische Universität Berlin, Marchstr. 23, 10587, Berlin, Germany.

出版信息

Nat Commun. 2019 Mar 11;10(1):1096. doi: 10.1038/s41467-019-08987-4.

DOI:10.1038/s41467-019-08987-4

PMID:30858366

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6411769/

Abstract

Current learning machines have successfully solved hard application problems, reaching high accuracy and displaying seemingly intelligent behavior. Here we apply recent techniques for explaining decisions of state-of-the-art learning machines and analyze various tasks from computer vision and arcade games. This showcases a spectrum of problem-solving behaviors ranging from naive and short-sighted, to well-informed and strategic. We observe that standard performance evaluation metrics can be oblivious to distinguishing these diverse problem solving behaviors. Furthermore, we propose our semi-automated Spectral Relevance Analysis that provides a practically effective way of characterizing and validating the behavior of nonlinear learning machines. This helps to assess whether a learned model indeed delivers reliably for the problem that it was conceived for. Furthermore, our work intends to add a voice of caution to the ongoing excitement about machine intelligence and pledges to evaluate and judge some of these recent successes in a more nuanced manner.

摘要

当前的学习机器已经成功地解决了困难的应用问题，达到了很高的准确性，并表现出看似智能的行为。在这里，我们应用了最新的技术来解释最先进的学习机器的决策，并分析了计算机视觉和街机游戏中的各种任务。这展示了一系列从天真和短视到消息灵通和策略性的问题解决行为。我们观察到，标准的性能评估指标可能无法区分这些不同的问题解决行为。此外，我们提出了我们的半自动谱相关性分析，为非线性学习机器的行为提供了一种实用有效的特征描述和验证方法。这有助于评估学习模型是否确实能够可靠地解决其设计初衷的问题。此外，我们的工作旨在为当前关于机器智能的兴奋情绪增添一份警示，并承诺以更细致的方式评估和判断其中的一些近期成功。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e01d/6411769/780ce214a78e/41467_2019_8987_Fig1_HTML.jpg

相似文献

Unmasking Clever Hans predictors and assessing what machines really learn.揭开聪明汉斯预测者的面具，评估机器真正学到了什么。

Nat Commun. 2019 Mar 11;10(1):1096. doi: 10.1038/s41467-019-08987-4.

Building machines that learn and think like people.建造像人一样学习和思考的机器。

Behav Brain Sci. 2017 Jan;40:e253. doi: 10.1017/S0140525X16001837. Epub 2016 Nov 24.

Preventing undesirable behavior of intelligent machines.防止智能机器的不良行为。

Science. 2019 Nov 22;366(6468):999-1004. doi: 10.1126/science.aag3311.

Data-driven modeling and prediction of blood glucose dynamics: Machine learning applications in type 1 diabetes.基于数据驱动的血糖动力学建模与预测：机器学习在 1 型糖尿病中的应用。

Artif Intell Med. 2019 Jul;98:109-134. doi: 10.1016/j.artmed.2019.07.007. Epub 2019 Jul 26.

Stochastic subset selection for learning with kernel machines.用于核机器学习的随机子集选择

IEEE Trans Syst Man Cybern B Cybern. 2012 Jun;42(3):616-26. doi: 10.1109/TSMCB.2011.2171680. Epub 2011 Oct 27.

Intelligent machines in the twenty-first century: foundations of inference and inquiry.21世纪的智能机器：推理与探究的基础

Philos Trans A Math Phys Eng Sci. 2003 Dec 15;361(1813):2859-73. doi: 10.1098/rsta.2003.1268.

Automated Algorithm Selection: Survey and Perspectives.自动算法选择：调查与展望。

Evol Comput. 2019 Spring;27(1):3-45. doi: 10.1162/evco_a_00242. Epub 2018 Nov 26.

Modeling of autonomous problem solving process by dynamic construction of task models in multiple tasks environment.在多任务环境中通过动态构建任务模型对自主问题解决过程进行建模。

Neural Netw. 2006 Oct;19(8):1169-80. doi: 10.1016/j.neunet.2006.05.037. Epub 2006 Sep 20.

Deep Learning: The Good, the Bad, and the Ugly.深度学习：好的、坏的和丑的。

Annu Rev Vis Sci. 2019 Sep 15;5:399-426. doi: 10.1146/annurev-vision-091718-014951. Epub 2019 Aug 8.

Nonlinear Semi-Supervised Metric Learning Via Multiple Kernels and Local Topology.基于多核与局部拓扑的非线性半监督度量学习

Int J Neural Syst. 2018 Mar;28(2):1750040. doi: 10.1142/S012906571750040X. Epub 2017 Sep 11.

引用本文的文献

Influence of preprocessing of stimulated Raman scattering images on the performance of deep neural networks for detecting cancer tissue.受激拉曼散射图像预处理对用于检测癌组织的深度神经网络性能的影响。

Quant Imaging Med Surg. 2025 Sep 1;15(9):7711-7726. doi: 10.21037/qims-2024-2608. Epub 2025 Aug 12.

Ensuring medical AI safety: interpretability-driven detection and mitigation of spurious model behavior and associated data.确保医学人工智能安全：基于可解释性的虚假模型行为及相关数据检测与缓解

Mach Learn. 2025;114(9):206. doi: 10.1007/s10994-025-06834-w. Epub 2025 Aug 12.

The utility of explainable AI for MRI analysis: Relating model predictions to neuroimaging features of the aging brain.可解释人工智能在MRI分析中的应用：将模型预测与衰老大脑的神经影像学特征相关联。

Imaging Neurosci (Camb). 2025 Feb 27;3. doi: 10.1162/imag_a_00497. eCollection 2025.

Applications of interpretable deep learning in neuroimaging: A comprehensive review.可解释深度学习在神经影像学中的应用：全面综述。

Imaging Neurosci (Camb). 2024 Jul 12;2. doi: 10.1162/imag_a_00214. eCollection 2024.

Explainable deep learning for stratified medicine in inflammatory bowel disease.用于炎症性肠病分层医学的可解释深度学习

Genome Biol. 2025 Jul 24;26(1):223. doi: 10.1186/s13059-025-03692-6.

A comprehensive analysis of perturbation methods in explainable AI feature attribution validation for neural time series classifiers.神经时间序列分类器可解释人工智能特征归因验证中扰动方法的综合分析。

Sci Rep. 2025 Jul 22;15(1):26607. doi: 10.1038/s41598-025-09538-2.

[Artificial intelligence under scrutiny: requirements, quality criteria, and testing tools for medical applications].[接受审查的人工智能：医学应用的要求、质量标准和测试工具]

Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz. 2025 Aug;68(8):915-923. doi: 10.1007/s00103-025-04101-w. Epub 2025 Jul 14.

Development and retrospective validation of an artificial intelligence system for diagnostic assessment of prostate biopsies: study protocol.用于前列腺活检诊断评估的人工智能系统的开发与回顾性验证：研究方案

BMJ Open. 2025 Jul 7;15(7):e097591. doi: 10.1136/bmjopen-2024-097591.

ACES-GNN: can graph neural network learn to explain activity cliffs?ACES-GNN：图神经网络能学会解释活性断崖吗？

Digit Discov. 2025 Jun 30. doi: 10.1039/d5dd00012b.

Explainable AI Model Reveals Informative Mutational Signatures for Cancer-Type Classification.可解释人工智能模型揭示用于癌症类型分类的信息性突变特征。

Cancers (Basel). 2025 May 22;17(11):1731. doi: 10.3390/cancers17111731.

本文引用的文献

Explaining the unique nature of individual gait patterns with deep learning.利用深度学习解释个体步态模式的独特性。

Sci Rep. 2019 Feb 20;9(1):2391. doi: 10.1038/s41598-019-38748-8.

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play.一种通过自我对弈掌握国际象棋、将棋和围棋的通用强化学习算法。

Science. 2018 Dec 7;362(6419):1140-1144. doi: 10.1126/science.aar6404.

Towards exact molecular dynamics simulations with machine-learned force fields.实现基于机器学习力场的精确分子动力学模拟。

Nat Commun. 2018 Sep 24;9(1):3887. doi: 10.1038/s41467-018-06169-2.

Large-Scale, High-Resolution Comparison of the Core Visual Object Recognition Behavior of Humans, Monkeys, and State-of-the-Art Deep Artificial Neural Networks.大规模、高分辨率的人类、猴子和最先进的深度人工神经网络核心视觉对象识别行为比较。

J Neurosci. 2018 Aug 15;38(33):7255-7269. doi: 10.1523/JNEUROSCI.0388-18.2018. Epub 2018 Jul 13.

Mastering the game of Go without human knowledge.无需人类知识即可掌握围棋游戏。

Nature. 2017 Oct 18;550(7676):354-359. doi: 10.1038/nature24270.

"What is relevant in a text document?": An interpretable machine learning approach.“文本文档中的相关内容是什么？”：一种可解释的机器学习方法。

PLoS One. 2017 Aug 11;12(8):e0181142. doi: 10.1371/journal.pone.0181142. eCollection 2017.

Machine learning of accurate energy-conserving molecular force fields.机器学习精准节能分子力场。

Sci Adv. 2017 May 5;3(5):e1603015. doi: 10.1126/sciadv.1603015. eCollection 2017 May.

DeepStack: Expert-level artificial intelligence in heads-up no-limit poker.深筹码：单人无限注德州扑克中的专家级人工智能。

Science. 2017 May 5;356(6337):508-513. doi: 10.1126/science.aam6960. Epub 2017 Mar 2.

Quantum-chemical insights from deep tensor neural networks.基于深度张量神经网络的量子化学研究。

Nat Commun. 2017 Jan 9;8:13890. doi: 10.1038/ncomms13890.

Building machines that learn and think like people.建造像人一样学习和思考的机器。

Behav Brain Sci. 2017 Jan;40:e253. doi: 10.1017/S0140525X16001837. Epub 2016 Nov 24.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

揭开聪明汉斯预测者的面具，评估机器真正学到了什么。

Unmasking Clever Hans predictors and assessing what machines really learn.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献