Suppr超能文献

评估用于预测老年精神病患者谵妄的机器学习方法的性能和可解释性。

Evaluating Performance and Interpretability of Machine Learning Methods for Predicting Delirium in Gerontopsychiatric Patients.

作者信息

Netzer Michael, Hackl Werner O, Schaller Michael, Alber Lisa, Marksteiner Josef, Ammenwerth Elske

机构信息

Institute of Medical Informatics, Private University for Health Sciences, Medical Informatics and Technology, Hall in Tirol, Austria.

Nursing Management Board, Tyrolean Federal Hospitals, Innsbruck, Austria.

出版信息

Stud Health Technol Inform. 2020 Jun 23;271:121-128. doi: 10.3233/SHTI200087.

Abstract

Delirium is an acute mental disturbance that particularly occurs during hospital stay. Current clinical assessment instruments include the Delirium Observation Screening Scale (DOSS) or the Confusion Assessment Method (CAM). The aim of this work is to analyze the performance of machine learning approaches to detect delirium based on DOSS and CAM information obtained from two geropsychiatric wards in Tyrol. From a machine learning perspective, the questions of these two assessment instruments represent the features and the ICD 10 diagnoses of delirium (yes/no) is the corresponding class variable. We compare seven popular classification methods and analyze the performance and interpretability of the learning models. As our dataset is highly imbalanced, we also evaluate the effect of common sampling methods including down- and up-sampling methods as well as hybrid methods. Our results indicate a high predictive ability of advanced methods such as Random Forest that can handle even unbalanced datasets. Overall, combining a good performance of a prediction model with the ability of users to understand the prediction is challenging. However, for clinical application in fully electronic settings, a good performance seems to be more important than an easy interpretation of the prediction by the user. On the other hand, explanations of decisions are often needed to assess other criteria such as safety.

摘要

谵妄是一种急性精神障碍,尤其在住院期间发生。当前的临床评估工具包括谵妄观察筛查量表(DOSS)或混乱评估方法(CAM)。这项工作的目的是分析基于从蒂罗尔州两个老年精神科病房获得的DOSS和CAM信息,使用机器学习方法检测谵妄的性能。从机器学习的角度来看,这两种评估工具的问题代表特征,而谵妄的ICD - 10诊断(是/否)是相应的类别变量。我们比较了七种流行的分类方法,并分析了学习模型的性能和可解释性。由于我们的数据集高度不平衡,我们还评估了常见采样方法的效果,包括下采样和上采样方法以及混合方法。我们的结果表明,诸如随机森林等先进方法具有很高的预测能力,甚至可以处理不平衡的数据集。总体而言,将预测模型的良好性能与用户理解预测的能力相结合具有挑战性。然而,对于全电子环境中的临床应用,良好的性能似乎比用户对预测的易于解释更为重要。另一方面,通常需要决策解释来评估其他标准,如安全性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验