Suppr超能文献

临床事件类型的自动分类

Automated Classification of Clinical Incident Types.

作者信息

Gupta Jaiprakash, Koprinska Irena, Patrick Jon

机构信息

School of Information Technologies, University of Sydney, Australia.

出版信息

Stud Health Technol Inform. 2015;214:87-93.

Abstract

We consider the task of automatic classification of clinical incident reports using machine learning methods. Our data consists of 5448 clinical incident reports collected from the Incident Information Management System used by 7 hospitals in the state of New South Wales in Australia. We evaluate the performance of four classification algorithms: decision tree, naïve Bayes, multinomial naïve Bayes and support vector machine. We initially consider 13 classes (incident types) that were then reduced to 12, and show that it is possible to build accurate classifiers. The most accurate classifier was the multinomial naïve Bayes achieving accuracy of 80.44% and AUC of 0.91. We also investigate the effect of class labelling by an ordinary clinician and an expert, and show that when the data is labelled by an expert the classification performance of all classifiers improves. We found that again the best classifier was multinomial naïve Bayes achieving accuracy of 81.32% and AUC of 0.97. Our results show that some classes in the Incident Information Management System such as Primary Care are not distinct and their removal can improve performance; some other classes such as Aggression Victim are easier to classify than others such as Behavior and Human Performance. In summary, we show that the classification performance can be improved by expert class labelling of the training data, removing classes that are not well defined and selecting appropriate machine learning classifiers.

摘要

我们考虑使用机器学习方法对临床事件报告进行自动分类的任务。我们的数据由从澳大利亚新南威尔士州7家医院使用的事件信息管理系统收集的5448份临床事件报告组成。我们评估了四种分类算法的性能:决策树、朴素贝叶斯、多项式朴素贝叶斯和支持向量机。我们最初考虑了13个类别(事件类型),然后减少到12个,并表明可以构建准确的分类器。最准确的分类器是多项式朴素贝叶斯,准确率达到80.44%,AUC为0.91。我们还研究了普通临床医生和专家进行类别标注的效果,结果表明,当数据由专家标注时,所有分类器的分类性能都会提高。我们发现,最好的分类器仍然是多项式朴素贝叶斯,准确率达到81.32%,AUC为0.97。我们的结果表明,事件信息管理系统中的一些类别,如初级护理,并不清晰,去除这些类别可以提高性能;其他一些类别,如攻击受害者,比行为和人员绩效等其他类别更容易分类。总之,我们表明,通过对训练数据进行专家类别标注、去除定义不明确的类别以及选择合适的机器学习分类器,可以提高分类性能。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验