Suppr超能文献

构建用于从养老机构用药事件报告中提取多个事件因素的多标签分类器:自然语言处理方法

Construction of a Multi-Label Classifier for Extracting Multiple Incident Factors From Medication Incident Reports in Residential Care Facilities: Natural Language Processing Approach.

作者信息

Kizaki Hayato, Satoh Hiroki, Ebara Sayaka, Watabe Satoshi, Sawada Yasufumi, Imai Shungo, Hori Satoko

机构信息

Division of Drug Informatics, Keio University Faculty of Pharmacy, Tokyo, Japan.

Graduate School of Pharmaceutical Sciences, The University of Tokyo, Tokyo, Japan.

出版信息

JMIR Med Inform. 2024 Jul 23;12:e58141. doi: 10.2196/58141.

Abstract

BACKGROUND

Medication safety in residential care facilities is a critical concern, particularly when nonmedical staff provide medication assistance. The complex nature of medication-related incidents in these settings, coupled with the psychological impact on health care providers, underscores the need for effective incident analysis and preventive strategies. A thorough understanding of the root causes, typically through incident-report analysis, is essential for mitigating medication-related incidents.

OBJECTIVE

We aimed to develop and evaluate a multilabel classifier using natural language processing to identify factors contributing to medication-related incidents using incident report descriptions from residential care facilities, with a focus on incidents involving nonmedical staff.

METHODS

We analyzed 2143 incident reports, comprising 7121 sentences, from residential care facilities in Japan between April 1, 2015, and March 31, 2016. The incident factors were annotated using sentences based on an established organizational factor model and previous research findings. The following 9 factors were defined: procedure adherence, medicine, resident, resident family, nonmedical staff, medical staff, team, environment, and organizational management. To assess the label criteria, 2 researchers with relevant medical knowledge annotated a subset of 50 reports; the interannotator agreement was measured using Cohen κ. The entire data set was subsequently annotated by 1 researcher. Multiple labels were assigned to each sentence. A multilabel classifier was developed using deep learning models, including 2 Bidirectional Encoder Representations From Transformers (BERT)-type models (Tohoku-BERT and a University of Tokyo Hospital BERT pretrained with Japanese clinical text: UTH-BERT) and an Efficiently Learning Encoder That Classifies Token Replacements Accurately (ELECTRA), pretrained on Japanese text. Both sentence- and report-level training were performed; the performance was evaluated by the F-score and exact match accuracy through 5-fold cross-validation.

RESULTS

Among all 7121 sentences, 1167, 694, 2455, 23, 1905, 46, 195, 1104, and 195 included "procedure adherence," "medicine," "resident," "resident family," "nonmedical staff," "medical staff," "team," "environment," and "organizational management," respectively. Owing to limited labels, "resident family" and "medical staff" were omitted from the model development process. The interannotator agreement values were higher than 0.6 for each label. A total of 10, 278, and 1855 reports contained no, 1, and multiple labels, respectively. The models trained using the report data outperformed those trained using sentences, with macro F-scores of 0.744, 0.675, and 0.735 for Tohoku-BERT, UTH-BERT, and ELECTRA, respectively. The report-trained models also demonstrated better exact match accuracy, with 0.411, 0.389, and 0.399 for Tohoku-BERT, UTH-BERT, and ELECTRA, respectively. Notably, the accuracy was consistent even when the analysis was confined to reports containing multiple labels.

CONCLUSIONS

The multilabel classifier developed in our study demonstrated potential for identifying various factors associated with medication-related incidents using incident reports from residential care facilities. Thus, this classifier can facilitate prompt analysis of incident factors, thereby contributing to risk management and the development of preventive strategies.

摘要

背景

在寄宿护理机构中,用药安全是一个至关重要的问题,尤其是当非医疗人员提供用药协助时。这些环境中与用药相关事件的复杂性,再加上对医护人员的心理影响,凸显了进行有效事件分析和预防策略的必要性。通常通过事件报告分析来深入了解根本原因,这对于减少与用药相关的事件至关重要。

目的

我们旨在开发并评估一种使用自然语言处理的多标签分类器,以利用寄宿护理机构的事件报告描述来识别导致用药相关事件的因素,重点关注涉及非医疗人员的事件。

方法

我们分析了2015年4月1日至2016年3月31日期间日本寄宿护理机构的2143份事件报告,共7121个句子。根据既定的组织因素模型和先前的研究结果,使用句子对事件因素进行注释。定义了以下9个因素:程序依从性、药物、居民、居民家属、非医疗人员、医疗人员、团队、环境和组织管理。为了评估标签标准,2名具有相关医学知识的研究人员对50份报告的子集进行了注释;使用科恩κ系数测量注释者间的一致性。随后由1名研究人员对整个数据集进行注释。每个句子被赋予多个标签。使用深度学习模型开发了一个多标签分类器,包括2种基于变换器的双向编码器表示(BERT)模型(东北BERT和东京大学医院用日本临床文本预训练的BERT:UTH-BERT)以及一个在日语文本上预训练的准确分类令牌替换的高效学习编码器(ELECTRA)。进行了句子级和报告级的训练;通过5折交叉验证,使用F分数和精确匹配准确率评估性能。

结果

在所有7121个句子中,分别有1167、694、2455、23、1905、46、195、1104和195个句子包含“程序依从性”“药物”“居民”“居民家属”“非医疗人员”“医疗人员”“团队”“环境”和“组织管理”。由于标签有限,“居民家属”和“医疗人员”被排除在模型开发过程之外。每个标签的注释者间一致性值均高于0.6。分别有10份、278份和1855份报告没有标签、有1个标签和有多个标签。使用报告数据训练的模型优于使用句子训练的模型,东北BERT、UTH-BERT和ELECTRA的宏观F分数分别为0.744、0.675和0.735。报告训练的模型在精确匹配准确率方面也表现更好,东北BERT、UTH-BERT和ELECTRA的精确匹配准确率分别为0.411、0.389和0.399。值得注意的是,即使仅分析包含多个标签的报告,准确率也是一致的。

结论

我们研究中开发的多标签分类器显示出利用寄宿护理机构的事件报告识别与用药相关事件的各种因素的潜力。因此,该分类器有助于迅速分析事件因素,从而有助于风险管理和预防策略的制定。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9645/11303886/6e373ec86b33/medinform_v12i1e58141_fig1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验