为临床决策支持打开人工智能的“黑箱”：一项预测中风结果的研究。

Opening the black box of artificial intelligence for clinical decision support: A study predicting stroke outcome.

机构信息

Charité Lab for Artificial Intelligence in Medicine-CLAIM, Charité - Universitätsmedizin Berlin, Berlin, Germany.

Centre for Stroke Research Berlin, Charité - Universitätsmedizin Berlin, Berlin, Germany.

出版信息

PLoS One. 2020 Apr 6;15(4):e0231166. doi: 10.1371/journal.pone.0231166. eCollection 2020.

DOI:10.1371/journal.pone.0231166

PMID:32251471

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7135268/

Abstract

State-of-the-art machine learning (ML) artificial intelligence methods are increasingly leveraged in clinical predictive modeling to provide clinical decision support systems to physicians. Modern ML approaches such as artificial neural networks (ANNs) and tree boosting often perform better than more traditional methods like logistic regression. On the other hand, these modern methods yield a limited understanding of the resulting predictions. However, in the medical domain, understanding of applied models is essential, in particular, when informing clinical decision support. Thus, in recent years, interpretability methods for modern ML methods have emerged to potentially allow explainable predictions paired with high performance. To our knowledge, we present in this work the first explainability comparison of two modern ML methods, tree boosting and multilayer perceptrons (MLPs), to traditional logistic regression methods using a stroke outcome prediction paradigm. Here, we used clinical features to predict a dichotomized 90 days post-stroke modified Rankin Scale (mRS) score. For interpretability, we evaluated clinical features' importance with regard to predictions using deep Taylor decomposition for MLP, Shapley values for tree boosting and model coefficients for logistic regression. With regard to performance as measured by Area under the Curve (AUC) values on the test dataset, all models performed comparably: Logistic regression AUCs were 0.83, 0.83, 0.81 for three different regularization schemes; tree boosting AUC was 0.81; MLP AUC was 0.83. Importantly, the interpretability analysis demonstrated consistent results across models by rating age and stroke severity consecutively amongst the most important predictive features. For less important features, some differences were observed between the methods. Our analysis suggests that modern machine learning methods can provide explainability which is compatible with domain knowledge interpretation and traditional method rankings. Future work should focus on replication of these findings in other datasets and further testing of different explainability methods.

摘要

最先进的机器学习 (ML) 人工智能方法越来越多地被应用于临床预测建模，为医生提供临床决策支持系统。现代 ML 方法，如人工神经网络 (ANNs) 和树增强，通常比逻辑回归等更传统的方法表现更好。另一方面，这些现代方法对产生的预测结果的理解有限。然而，在医学领域，对应用模型的理解是至关重要的，特别是在提供临床决策支持时。因此，近年来，出现了用于现代 ML 方法的可解释性方法，以潜在地实现高性能与可解释预测的结合。据我们所知，我们在这项工作中首次对两种现代 ML 方法（树增强和多层感知机 (MLPs)）与传统逻辑回归方法进行了可解释性比较，使用了中风结果预测范式。在这里，我们使用临床特征来预测 90 天脑卒中改良 Rankin 量表（mRS）评分的二分结果。为了进行可解释性评估，我们使用深度泰勒分解（用于 MLP）、Shapley 值（用于树增强）和模型系数（用于逻辑回归）来评估临床特征对预测的重要性。在测试数据集上，所有模型的性能表现都相当：三种不同正则化方案的逻辑回归 AUC 值分别为 0.83、0.83 和 0.81；树增强的 AUC 值为 0.81；MLP 的 AUC 值为 0.83。重要的是，可解释性分析通过对年龄和中风严重程度进行连续评分，证明了不同模型之间的一致结果，认为这是最重要的预测特征。对于不太重要的特征，在方法之间观察到一些差异。我们的分析表明，现代机器学习方法可以提供可解释性，这与领域知识解释和传统方法排名兼容。未来的工作应该集中在其他数据集上复制这些发现，并进一步测试不同的可解释性方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0f5c/7135268/99c4adcd6b67/pone.0231166.g001.jpg

相似文献

Opening the black box of artificial intelligence for clinical decision support: A study predicting stroke outcome.

PLoS One. 2020 Apr 6;15(4):e0231166. doi: 10.1371/journal.pone.0231166. eCollection 2020.

Can Predictive Modeling Tools Identify Patients at High Risk of Prolonged Opioid Use After ACL Reconstruction?

Clin Orthop Relat Res. 2020 Jul;478(7):0-1618. doi: 10.1097/CORR.0000000000001251.

Developing machine learning models to predict multi-class functional outcomes and death three months after stroke in Sweden.

PLoS One. 2024 May 13;19(5):e0303287. doi: 10.1371/journal.pone.0303287. eCollection 2024.

Stroke prognostication for discharge planning with machine learning: A derivation study.

J Clin Neurosci. 2020 Sep;79:100-103. doi: 10.1016/j.jocn.2020.07.046. Epub 2020 Aug 5.

Machine Learning Models for Predicting Influential Factors of Early Outcomes in Acute Ischemic Stroke: Registry-Based Study.

JMIR Med Inform. 2022 Mar 25;10(3):e32508. doi: 10.2196/32508.

Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.

Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.

Predicting Inpatient Payments Prior to Lower Extremity Arthroplasty Using Deep Learning: Which Model Architecture Is Best?

J Arthroplasty. 2019 Oct;34(10):2235-2241.e1. doi: 10.1016/j.arth.2019.05.048. Epub 2019 Jun 3.

Comparison of Supervised Machine Learning Algorithms for Classifying of Home Discharge Possibility in Convalescent Stroke Patients: A Secondary Analysis.

J Stroke Cerebrovasc Dis. 2021 Oct;30(10):106011. doi: 10.1016/j.jstrokecerebrovasdis.2021.106011. Epub 2021 Jul 26.

Development of prediction models for one-year brain tumour survival using machine learning: a comparison of accuracy and interpretability.

Comput Methods Programs Biomed. 2023 May;233:107482. doi: 10.1016/j.cmpb.2023.107482. Epub 2023 Mar 13.

Predicting Long-Term Outcomes After Poor-Grade Aneurysmal Subarachnoid Hemorrhage Using Decision Tree Modeling.

Neurosurgery. 2020 Sep 1;87(3):523-529. doi: 10.1093/neuros/nyaa052.

引用本文的文献

AI post-intervention operational and functional outcomes prediction in ischemic stroke patients using MRIs.

BMC Med Imaging. 2025 Aug 14;25(1):329. doi: 10.1186/s12880-025-01864-1.

Optimal mean arterial pressure for favorable neurological outcomes in patients after cardiac arrest.

J Intensive Care. 2025 Jul 31;13(1):42. doi: 10.1186/s40560-025-00814-x.

Explainable AI-driven intelligent system for precision forecasting in cardiovascular disease.

Front Med (Lausanne). 2025 Jul 9;12:1596335. doi: 10.3389/fmed.2025.1596335. eCollection 2025.

A Responsible Framework for Assessing, Selecting, and Explaining Machine Learning Models in Cardiovascular Disease Outcomes Among People With Type 2 Diabetes: Methodology and Validation Study.

JMIR Med Inform. 2025 Jun 27;13:e66200. doi: 10.2196/66200.

Construction of disability risk prediction model for the elderly based on machine learning.

Sci Rep. 2025 May 9;15(1):16247. doi: 10.1038/s41598-025-01404-5.

Comparison of different AI systems for diagnosing sepsis, septic shock, and cardiogenic shock: a retrospective study.

Sci Rep. 2025 May 6;15(1):15850. doi: 10.1038/s41598-025-00830-9.

Prediction of Patient Visits for Skin Diseases through Enhanced Evolutionary Computation and Ensemble Learning.

J Med Syst. 2025 Apr 23;49(1):52. doi: 10.1007/s10916-025-02185-0.

Artificial intelligence in stroke risk assessment and management via retinal imaging.

Front Comput Neurosci. 2025 Feb 17;19:1490603. doi: 10.3389/fncom.2025.1490603. eCollection 2025.

Introducing and Validating the Multiphasic Evidential Decision-Making Matrix (MedMax) for Clinical Management in Patients with Intrahepatic Cholangiocarcinoma.

Cancers (Basel). 2024 Dec 27;17(1):52. doi: 10.3390/cancers17010052.

Machine Learning Applied to Reference Signal-Less Detection of Motion Artifacts in Photoplethysmographic Signals: A Review.

Sensors (Basel). 2024 Nov 9;24(22):7193. doi: 10.3390/s24227193.

本文引用的文献

Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.

Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.

Prediction model development of late-onset preeclampsia using machine learning-based methods.

PLoS One. 2019 Aug 23;14(8):e0221202. doi: 10.1371/journal.pone.0221202. eCollection 2019.

Bias in Artificial Intelligence.

N C Med J. 2019 Jul-Aug;80(4):220-222. doi: 10.18043/ncm.80.4.220.

Predictive analytics with gradient boosting in clinical medicine.

Ann Transl Med. 2019 Apr;7(7):152. doi: 10.21037/atm.2019.03.29.

Using machine-learning methods to support health-care professionals in making admission decisions.

Int J Health Plann Manage. 2019 Apr;34(2):e1236-e1246. doi: 10.1002/hpm.2769. Epub 2019 Apr 7.

Artificial Intelligence and Black-Box Medical Decisions: Accuracy versus Explainability.

Hastings Cent Rep. 2019 Jan;49(1):15-21. doi: 10.1002/hast.973.

Learning from Longitudinal Data in Electronic Health Record and Genetic Data to Improve Cardiovascular Event Prediction.

Sci Rep. 2019 Jan 24;9(1):717. doi: 10.1038/s41598-018-36745-x.

Artificial intelligence, bias and clinical safety.

BMJ Qual Saf. 2019 Mar;28(3):231-237. doi: 10.1136/bmjqs-2018-008370. Epub 2019 Jan 12.

A guide to deep learning in healthcare.

Nat Med. 2019 Jan;25(1):24-29. doi: 10.1038/s41591-018-0316-z. Epub 2019 Jan 7.

Transforming health policy through machine learning.

PLoS Med. 2018 Nov 13;15(11):e1002692. doi: 10.1371/journal.pmed.1002692. eCollection 2018 Nov.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

为临床决策支持打开人工智能的“黑箱”：一项预测中风结果的研究。

Opening the black box of artificial intelligence for clinical decision support: A study predicting stroke outcome.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献