• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

开发一个分析流程,使用优化的预测算法对患者安全事件报告进行分类。

Developing an Analytical Pipeline to Classify Patient Safety Event Reports Using Optimized Predictive Algorithms.

机构信息

Partnership for Health IT Patient Safety, ECRI, Plymouth Meeting, Pennsylvania, United States.

出版信息

Methods Inf Med. 2021 Dec;60(5-06):147-161. doi: 10.1055/s-0041-1735620. Epub 2021 Oct 31.

DOI:10.1055/s-0041-1735620
PMID:34719010
Abstract

BACKGROUND

Patient safety event reports provide valuable insight into systemic safety issues but deriving insights from these reports requires computational tools to efficiently parse through large volumes of qualitative data. Natural language processing (NLP) combined with predictive learning provides an automated approach to evaluating these data and supporting the work of patient safety analysts.

OBJECTIVES

The objective of this study was to use NLP and machine learning techniques to develop a generalizable, scalable, and reliable approach to classifying event reports for the purpose of driving improvements in the safety and quality of patient care.

METHODS

Datasets for 14 different labels (themes) were vectorized using a bag-of-words, , or document embeddings approach and then applied to a series of classification algorithms via a hyperparameter grid search to derive an optimized model. Reports were also analyzed for terms strongly associated with each theme using an adjusted F-score calculation.

RESULTS

F score for each optimized model ranged from 0.951 ("Fall") to 0.544 ("Environment"). The bag-of-words approach proved optimal for 12 of 14 labels, and the naïve Bayes algorithm performed best for nine labels. Linear support vector machine was demonstrated as optimal for three labels and XGBoost for four of the 14 labels. Labels with more distinctly associated terms performed better than less distinct themes, as shown by a Pearson's correlation coefficient of 0.634.

CONCLUSIONS

We were able to demonstrate an analytical pipeline that broadly applies NLP and predictive modeling to categorize patient safety reports from multiple facilities. This pipeline allows analysts to more rapidly identify and structure information contained in patient safety data, which can enhance the evaluation and the use of this information over time.

摘要

背景

患者安全事件报告为系统性安全问题提供了有价值的见解,但要从这些报告中获得见解,需要计算工具来高效地分析大量定性数据。自然语言处理 (NLP) 与预测学习相结合,为评估这些数据并支持患者安全分析师的工作提供了一种自动化方法。

目的

本研究的目的是使用 NLP 和机器学习技术开发一种通用、可扩展和可靠的方法来对事件报告进行分类,以提高患者护理的安全性和质量。

方法

使用词袋、或文档嵌入方法对 14 个不同标签(主题)的数据集进行矢量化,然后通过超参数网格搜索将其应用于一系列分类算法,以得出优化模型。还使用调整后的 F 分数计算分析了与每个主题强烈相关的报告。

结果

每个优化模型的 F 分数范围从 0.951(“跌倒”)到 0.544(“环境”)。对于 14 个标签中的 12 个,词袋方法被证明是最优的,朴素贝叶斯算法在 9 个标签中表现最好。线性支持向量机被证明对于 3 个标签是最优的,XGBoost 对于 14 个标签中的 4 个是最优的。与不太明显的主题相比,具有更明显关联术语的标签表现更好,Pearson 相关系数为 0.634。

结论

我们能够展示一个广泛应用 NLP 和预测建模来对来自多个设施的患者安全报告进行分类的分析管道。该管道允许分析师更快速地识别和构建患者安全数据中包含的信息,从而随着时间的推移增强对该信息的评估和使用。

相似文献

1
Developing an Analytical Pipeline to Classify Patient Safety Event Reports Using Optimized Predictive Algorithms.开发一个分析流程,使用优化的预测算法对患者安全事件报告进行分类。
Methods Inf Med. 2021 Dec;60(5-06):147-161. doi: 10.1055/s-0041-1735620. Epub 2021 Oct 31.
2
Social Reminiscence in Older Adults' Everyday Conversations: Automated Detection Using Natural Language Processing and Machine Learning.老年人日常对话中的社会怀旧:使用自然语言处理和机器学习的自动检测。
J Med Internet Res. 2020 Sep 15;22(9):e19133. doi: 10.2196/19133.
3
A comparison of rule-based and machine learning approaches for classifying patient portal messages.基于规则和机器学习方法在患者门户消息分类中的比较。
Int J Med Inform. 2017 Sep;105:110-120. doi: 10.1016/j.ijmedinf.2017.06.004. Epub 2017 Jun 23.
4
Natural language processing and machine learning approaches for food categorization and nutrition quality prediction compared with traditional methods.与传统方法相比,用于食品分类和营养质量预测的自然语言处理和机器学习方法。
Am J Clin Nutr. 2023 Mar;117(3):553-563. doi: 10.1016/j.ajcnut.2022.11.022. Epub 2022 Dec 23.
5
Integrating Natural Language Processing and Machine Learning Algorithms to Categorize Oncologic Response in Radiology Reports.将自然语言处理和机器学习算法集成到放射学报告中的肿瘤反应分类中。
J Digit Imaging. 2018 Apr;31(2):178-184. doi: 10.1007/s10278-017-0027-x.
6
Automated Classification of Free-Text Radiology Reports: Using Different Feature Extraction Methods to Identify Fractures of the Distal Fibula.自动化自由文本放射学报告分类:使用不同的特征提取方法识别腓骨远端骨折。
Rofo. 2023 Aug;195(8):713-719. doi: 10.1055/a-2061-6562. Epub 2023 May 9.
7
A natural language processing approach to categorise contributing factors from patient safety event reports.一种自然语言处理方法,用于对患者安全事件报告中的促成因素进行分类。
BMJ Health Care Inform. 2023 May;30(1). doi: 10.1136/bmjhci-2022-100731.
8
Natural language processing of head CT reports to identify intracranial mass effect: CTIME algorithm.通过头部CT报告的自然语言处理识别颅内占位效应:CTIME算法
Am J Emerg Med. 2022 Jan;51:388-392. doi: 10.1016/j.ajem.2021.11.001. Epub 2021 Nov 9.
9
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
10
Development of machine learning and natural language processing algorithms for preoperative prediction and automated identification of intraoperative vascular injury in anterior lumbar spine surgery.开发机器学习和自然语言处理算法,用于在前路腰椎手术中进行术前预测和术中血管损伤的自动识别。
Spine J. 2021 Oct;21(10):1635-1642. doi: 10.1016/j.spinee.2020.04.001. Epub 2020 Apr 12.