Suppr
超能文献

增强图像分类中的瓶颈概念学习

Enhancing Bottleneck Concept Learning in Image Classification.

作者信息

Cheng Xingfu, Niu Zhaofeng, Jiang Zhouqiang, Li Liangzhi

机构信息

Computer Science Department, Qufu Normal University, Rizhao 276826, China.

Osaka University, Osaka 565-0871, Japan.

出版信息

Sensors (Basel). 2025 Apr 10;25(8):2398. doi: 10.3390/s25082398.

DOI:10.3390/s25082398

PMID:40285088

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12031560/

Abstract

Deep neural networks (DNNs) have demonstrated exceptional performance in image classification. However, their "black-box" nature raises concerns about trust and transparency, particularly in high-stakes fields such as healthcare and autonomous systems. While explainable AI (XAI) methods attempt to address these concerns through feature- or concept-based explanations, existing approaches are often limited by the need for manually defined concepts, overly abstract granularity, or misalignment with human semantics. This paper introduces the Enhanced Bottleneck Concept Learner (E-BotCL), a self-supervised framework that autonomously discovers task-relevant, interpretable semantic concepts via a dual-path contrastive learning strategy and multi-task regularization. By combining contrastive learning to build robust concept prototypes, attention mechanisms for spatial localization, and feature aggregation to activate concepts, E-BotCL enables end-to-end concept learning and classification without requiring human supervision. Experiments conducted on the CUB200 and ImageNet datasets demonstrated that E-BotCL significantly enhanced interpretability while maintaining classification accuracy. Specifically, two interpretability metrics, the Concept Discovery Rate (CDR) and Concept Consistency (CC), improved by 0.6104 and 0.4486, respectively. This work advances the balance between model performance and transparency, offering a scalable solution for interpretable decision-making in complex vision tasks.

摘要

深度神经网络（DNN）在图像分类方面展现出了卓越的性能。然而，其“黑箱”性质引发了人们对信任和透明度的担忧，尤其是在医疗保健和自主系统等高风险领域。虽然可解释人工智能（XAI）方法试图通过基于特征或概念的解释来解决这些问题，但现有方法往往受到手动定义概念的需求、过于抽象的粒度或与人类语义不一致的限制。本文介绍了增强瓶颈概念学习器（E-BotCL），这是一个自监督框架，它通过双路径对比学习策略和多任务正则化自主发现与任务相关的、可解释的语义概念。通过结合对比学习来构建强大的概念原型、用于空间定位的注意力机制以及用于激活概念的特征聚合，E-BotCL能够在无需人工监督的情况下实现端到端的概念学习和分类。在CUB200和ImageNet数据集上进行的实验表明，E-BotCL在保持分类准确率的同时显著提高了可解释性。具体而言，两个可解释性指标，即概念发现率（CDR）和概念一致性（CC），分别提高了0.6104和0.4486。这项工作推动了模型性能和透明度之间的平衡，为复杂视觉任务中的可解释决策提供了一种可扩展的解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f3ca/12031560/8057715c2c00/sensors-25-02398-g001.jpg

相似文献

Enhancing Bottleneck Concept Learning in Image Classification.

Sensors (Basel). 2025 Apr 10;25(8):2398. doi: 10.3390/s25082398.

Toward explainable AI (XAI) for mental health detection based on language behavior.

Front Psychiatry. 2023 Dec 7;14:1219479. doi: 10.3389/fpsyt.2023.1219479. eCollection 2023.

ExAID: A multimodal explanation framework for computer-aided diagnosis of skin lesions.

Comput Methods Programs Biomed. 2022 Mar;215:106620. doi: 10.1016/j.cmpb.2022.106620. Epub 2022 Jan 5.

X-CHAR: A Concept-based Explainable Complex Human Activity Recognition Model.

Proc ACM Interact Mob Wearable Ubiquitous Technol. 2023 Mar;7(1). doi: 10.1145/3580804. Epub 2023 Mar 28.

sCL-ST: Supervised Contrastive Learning With Semantic Transformations for Multiple Lead ECG Arrhythmia Classification.

IEEE J Biomed Health Inform. 2023 Jun;27(6):2818-2828. doi: 10.1109/JBHI.2023.3246241. Epub 2023 Jun 5.

Describe, Spot and Explain: Interpretable Representation Learning for Discriminative Visual Reasoning.

IEEE Trans Image Process. 2023;32:2481-2492. doi: 10.1109/TIP.2023.3268001. Epub 2023 May 8.

Explainable deep stacking ensemble model for accurate and transparent brain tumor diagnosis.

Comput Biol Med. 2025 Jun;191:110166. doi: 10.1016/j.compbiomed.2025.110166. Epub 2025 Apr 17.

ResViT FusionNet Model: An explainable AI-driven approach for automated grading of diabetic retinopathy in retinal images.

Comput Biol Med. 2025 Mar;186:109656. doi: 10.1016/j.compbiomed.2025.109656. Epub 2025 Jan 16.

Multimodal brain tumor segmentation and classification from MRI scans based on optimized DeepLabV3+ and interpreted networks information fusion empowered with explainable AI.

Comput Biol Med. 2024 Nov;182:109183. doi: 10.1016/j.compbiomed.2024.109183. Epub 2024 Oct 2.

Explainable Artificial Intelligence (XAI) for Deep Learning Based Medical Imaging Classification.

J Imaging. 2023 Aug 30;9(9):177. doi: 10.3390/jimaging9090177.

本文引用的文献

On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation.

PLoS One. 2015 Jul 10;10(7):e0130140. doi: 10.1371/journal.pone.0130140. eCollection 2015.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

增强图像分类中的瓶颈概念学习

Enhancing Bottleneck Concept Learning in Image Classification.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译