• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用模式发现和解缠结对医疗保健中的可解释性进行基准测试。

Benchmarking Interpretability in Healthcare Using Pattern Discovery and Disentanglement.

作者信息

Zhou Pei-Yuan, Takeuchi Amane, Martinez-Lopez Fernando, Ehghaghi Malikeh, Wong Andrew K C, Lee En-Shiun Annie

机构信息

System Design Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada.

Department of Computer Science, University of Toronto, Toronto, ON M5S 1A1, Canada.

出版信息

Bioengineering (Basel). 2025 Mar 18;12(3):308. doi: 10.3390/bioengineering12030308.

DOI:10.3390/bioengineering12030308
PMID:40150773
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11939797/
Abstract

The healthcare industry seeks to integrate AI into clinical applications, yet understanding AI decision making remains a challenge for healthcare practitioners as these systems often function as black boxes. Our work benchmarks the Pattern Discovery and Disentanglement (PDD) system's unsupervised learning algorithm, which provides interpretable outputs and clustering results from clinical notes to aid decision making. Using the MIMIC-IV dataset, we process free-text clinical notes and ICD-9 codes with Term Frequency-Inverse Document Frequency and Topic Modeling. The PDD algorithm discretizes numerical features into event-based features, discovers association patterns from a disentangled statistical feature value association space, and clusters clinical records. The output is an interpretable knowledge base linking knowledge, patterns, and data to support decision making. Despite being unsupervised, PDD demonstrated performance comparable to supervised deep learning models, validating its clustering ability and knowledge representation. We benchmark interpretability techniques-Feature Permutation, Gradient SHAP, and Integrated Gradients-on the best-performing models (in terms of F1, ROC AUC, balanced accuracy, etc.), evaluating these based on sufficiency, comprehensiveness, and sensitivity metrics. Our findings highlight the limitations of feature importance ranking and post hoc analysis for clinical diagnosis. Meanwhile, PDD's global interpretability effectively compensates for these issues, helping healthcare practitioners understand the decision-making process and providing suggestive clusters of diseases to assist their diagnosis.

摘要

医疗行业试图将人工智能整合到临床应用中,但对于医疗从业者来说,理解人工智能的决策过程仍然是一项挑战,因为这些系统通常像黑匣子一样运作。我们的工作对模式发现与解缠结(PDD)系统的无监督学习算法进行了基准测试,该算法可从临床记录中提供可解释的输出和聚类结果,以辅助决策。使用MIMIC-IV数据集,我们通过词频-逆文档频率和主题建模来处理自由文本临床记录和ICD-9编码。PDD算法将数值特征离散化为基于事件的特征,从解缠结的统计特征值关联空间中发现关联模式,并对临床记录进行聚类。输出结果是一个可解释的知识库,它将知识、模式和数据联系起来以支持决策。尽管PDD是无监督的,但其表现与有监督的深度学习模型相当,验证了其聚类能力和知识表示。我们在表现最佳的模型(根据F1、ROC AUC、平衡准确率等)上对可解释性技术——特征排列、梯度SHAP和集成梯度——进行基准测试,并根据充分性、全面性和敏感性指标对其进行评估。我们的研究结果突出了临床诊断中特征重要性排序和事后分析的局限性。同时,PDD的全局可解释性有效地弥补了这些问题,帮助医疗从业者理解决策过程,并提供疾病的提示性聚类以协助他们进行诊断。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35bc/11939797/7ec8323744b6/bioengineering-12-00308-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35bc/11939797/29854c50a913/bioengineering-12-00308-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35bc/11939797/64787a7338de/bioengineering-12-00308-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35bc/11939797/6becff9af1ab/bioengineering-12-00308-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35bc/11939797/7ec8323744b6/bioengineering-12-00308-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35bc/11939797/29854c50a913/bioengineering-12-00308-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35bc/11939797/64787a7338de/bioengineering-12-00308-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35bc/11939797/6becff9af1ab/bioengineering-12-00308-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/35bc/11939797/7ec8323744b6/bioengineering-12-00308-g004.jpg

相似文献

1
Benchmarking Interpretability in Healthcare Using Pattern Discovery and Disentanglement.利用模式发现和解缠结对医疗保健中的可解释性进行基准测试。
Bioengineering (Basel). 2025 Mar 18;12(3):308. doi: 10.3390/bioengineering12030308.
2
An Unsupervised Error Detection Methodology for Detecting Mislabels in Healthcare Analytics.一种用于医疗保健分析中检测错误标签的无监督错误检测方法。
Bioengineering (Basel). 2024 Jul 31;11(8):770. doi: 10.3390/bioengineering11080770.
3
Interpretability and fairness evaluation of deep learning models on MIMIC-IV dataset.深度学习模型在 MIMIC-IV 数据集上的可解释性和公平性评估。
Sci Rep. 2022 May 3;12(1):7166. doi: 10.1038/s41598-022-11012-2.
4
Theory and rationale of interpretable all-in-one pattern discovery and disentanglement system.可解释一体化模式发现与解缠系统的理论与原理
NPJ Digit Med. 2023 May 22;6(1):92. doi: 10.1038/s41746-023-00816-9.
5
Responsible AI for cardiovascular disease detection: Towards a privacy-preserving and interpretable model.心血管疾病检测的负责任 AI:迈向隐私保护和可解释的模型。
Comput Methods Programs Biomed. 2024 Sep;254:108289. doi: 10.1016/j.cmpb.2024.108289. Epub 2024 Jun 17.
6
Explanation and prediction of clinical data with imbalanced class distribution based on pattern discovery and disentanglement.基于模式发现与解缠的具有不平衡类别分布的临床数据的解释与预测。
BMC Med Inform Decis Mak. 2021 Jan 9;21(1):16. doi: 10.1186/s12911-020-01356-y.
7
Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach.基于机器学习的自然语言处理方法对临床笔记进行医学子域分类。
BMC Med Inform Decis Mak. 2017 Dec 1;17(1):155. doi: 10.1186/s12911-017-0556-8.
8
DeepXplainer: An interpretable deep learning based approach for lung cancer detection using explainable artificial intelligence.深演析:一种基于可解释人工智能的用于肺癌检测的可解释深度学习方法。
Comput Methods Programs Biomed. 2024 Jan;243:107879. doi: 10.1016/j.cmpb.2023.107879. Epub 2023 Oct 24.
9
Non-invasive Prediction of Lymph Node Metastasis in NSCLC Using Clinical, Radiomics, and Deep Learning Features From F-FDG PET/CT Based on Interpretable Machine Learning.基于可解释机器学习,利用F-FDG PET/CT的临床、影像组学和深度学习特征对非小细胞肺癌淋巴结转移进行无创预测
Acad Radiol. 2025 Mar;32(3):1645-1655. doi: 10.1016/j.acra.2024.11.037. Epub 2024 Dec 10.
10
Topic Modeling for Interpretable Text Classification From EHRs.用于电子健康记录可解释文本分类的主题建模
Front Big Data. 2022 May 4;5:846930. doi: 10.3389/fdata.2022.846930. eCollection 2022.

本文引用的文献

1
Revolutionizing healthcare: the role of artificial intelligence in clinical practice.人工智能在临床实践中的应用:医疗保健的革命。
BMC Med Educ. 2023 Sep 22;23(1):689. doi: 10.1186/s12909-023-04698-z.
2
Theory and rationale of interpretable all-in-one pattern discovery and disentanglement system.可解释一体化模式发现与解缠系统的理论与原理
NPJ Digit Med. 2023 May 22;6(1):92. doi: 10.1038/s41746-023-00816-9.
3
MIMIC-IV, a freely accessible electronic health record dataset.MIMIC-IV,一个可自由访问的电子健康记录数据集。
Sci Data. 2023 Jan 3;10(1):1. doi: 10.1038/s41597-022-01899-x.
4
Transparency of deep neural networks for medical image analysis: A review of interpretability methods.用于医学图像分析的深度神经网络透明度:可解释性方法综述
Comput Biol Med. 2022 Jan;140:105111. doi: 10.1016/j.compbiomed.2021.105111. Epub 2021 Dec 4.
5
ICD Coding from Clinical Text Using Multi-Filter Residual Convolutional Neural Network.使用多滤波器残差卷积神经网络从临床文本中进行ICD编码
Proc AAAI Conf Artif Intell. 2020 Feb;34(5):8180-8187. doi: 10.1609/aaai.v34i05.6331. Epub 2020 Apr 3.
6
Pattern discovery and disentanglement on relational datasets.关系型数据集的模式发现与解缠。
Sci Rep. 2021 Mar 11;11(1):5688. doi: 10.1038/s41598-021-84869-4.
7
Machine learning applications in microbial ecology, human microbiome studies, and environmental monitoring.机器学习在微生物生态学、人类微生物组研究和环境监测中的应用。
Comput Struct Biotechnol J. 2021 Jan 27;19:1092-1107. doi: 10.1016/j.csbj.2021.01.028. eCollection 2021.
8
AI, Machine Learning, and Ethics in Health Care.医疗保健中的人工智能、机器学习与伦理
J Leg Med. 2019 Oct-Dec;39(4):427-441. doi: 10.1080/01947648.2019.1690604.
9
BioBERT: a pre-trained biomedical language representation model for biomedical text mining.BioBERT:一种用于生物医学文本挖掘的预训练生物医学语言表示模型。
Bioinformatics. 2020 Feb 15;36(4):1234-1240. doi: 10.1093/bioinformatics/btz682.
10
An empirical evaluation of deep learning for ICD-9 code assignment using MIMIC-III clinical notes.基于 MIMIC-III 临床记录的深度学习方法在 ICD-9 编码任务中的实证评估
Comput Methods Programs Biomed. 2019 Aug;177:141-153. doi: 10.1016/j.cmpb.2019.05.024. Epub 2019 May 25.