• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

挖掘临床记录中与跌倒相关的信息:基于规则和基于新颖词嵌入的机器学习方法的比较。

Mining fall-related information in clinical notes: Comparison of rule-based and novel word embedding-based machine learning approaches.

作者信息

Topaz Maxim, Murga Ludmila, Gaddis Katherine M, McDonald Margaret V, Bar-Bachar Ofrit, Goldberg Yoav, Bowles Kathryn H

机构信息

School of Nursing & Data Science Institute, Columbia University, New York, NY, USA; The Visiting Nurse Service of New York, New York, NY, USA.

Cheryl Spencer Department of Nursing, University of Haifa, Haifa, Israel.

出版信息

J Biomed Inform. 2019 Feb;90:103103. doi: 10.1016/j.jbi.2019.103103. Epub 2019 Jan 9.

DOI:10.1016/j.jbi.2019.103103
PMID:30639392
Abstract

BACKGROUND

Natural language processing (NLP) of health-related data is still an expertise demanding, and resource expensive process. We created a novel, open source rapid clinical text mining system called NimbleMiner. NimbleMiner combines several machine learning techniques (word embedding models and positive only labels learning) to facilitate the process in which a human rapidly performs text mining of clinical narratives, while being aided by the machine learning components.

OBJECTIVE

This manuscript describes the general system architecture and user Interface and presents results of a case study aimed at classifying fall-related information (including fall history, fall prevention interventions, and fall risk) in homecare visit notes.

METHODS

We extracted a corpus of homecare visit notes (n = 1,149,586) for 89,459 patients from a large US-based homecare agency. We used a gold standard testing dataset of 750 notes annotated by two human reviewers to compare the NimbleMiner's ability to classify documents regarding whether they contain fall-related information with a previously developed rule-based NLP system.

RESULTS

NimbleMiner outperformed the rule-based system in almost all domains. The overall F- score was 85.8% compared to 81% by the rule based-system with the best performance for identifying general fall history (F = 89% vs. F = 85.1% rule-based), followed by fall risk (F = 87% vs. F = 78.7% rule-based), fall prevention interventions (F = 88.1% vs. F = 78.2% rule-based) and fall within 2 days of the note date (F = 83.1% vs. F = 80.6% rule-based). The rule-based system achieved slightly better performance for fall within 2 weeks of the note date (F = 81.9% vs. F = 84% rule-based).

DISCUSSION & CONCLUSIONS: NimbleMiner outperformed other systems aimed at fall information classification, including our previously developed rule-based approach. These promising results indicate that clinical text mining can be implemented without the need for large labeled datasets necessary for other types of machine learning. This is critical for domains with little NLP developments, like nursing or allied health professions.

摘要

背景

对健康相关数据进行自然语言处理(NLP)仍然是一个需要专业知识且资源消耗大的过程。我们创建了一个名为NimbleMiner的新型开源快速临床文本挖掘系统。NimbleMiner结合了多种机器学习技术(词嵌入模型和仅正向标签学习),以促进人类在机器学习组件辅助下快速对临床叙述进行文本挖掘的过程。

目的

本文描述了该系统的总体架构和用户界面,并展示了一个案例研究的结果,该研究旨在对家庭护理访视记录中的跌倒相关信息(包括跌倒史、跌倒预防干预措施和跌倒风险)进行分类。

方法

我们从美国一家大型家庭护理机构提取了89459名患者的家庭护理访视记录语料库(n = 1149586)。我们使用了由两名人类审阅者标注的750条记录的金标准测试数据集,将NimbleMiner对文档是否包含跌倒相关信息进行分类的能力与之前开发的基于规则的NLP系统进行比较。

结果

NimbleMiner在几乎所有领域的表现都优于基于规则的系统。总体F值为85.8%,而基于规则的系统为81%,在识别一般跌倒史方面表现最佳(F = 89% 对比基于规则的F = 85.1%),其次是跌倒风险(F = 87% 对比基于规则的F = 78.7%)、跌倒预防干预措施(F = 88.1% 对比基于规则的F = 78.2%)以及记录日期后2天内的跌倒情况(F = 83.1% 对比基于规则的F = 80.6%)。基于规则的系统在记录日期后2周内的跌倒情况方面表现略好(F = 81.9% 对比基于规则的F = 84%)。

讨论与结论

NimbleMiner在跌倒信息分类方面的表现优于其他系统,包括我们之前开发的基于规则的方法。这些令人鼓舞的结果表明,临床文本挖掘无需其他类型机器学习所需的大量标注数据集即可实现。这对于像护理或相关健康专业等NLP发展较少的领域至关重要。

相似文献

1
Mining fall-related information in clinical notes: Comparison of rule-based and novel word embedding-based machine learning approaches.挖掘临床记录中与跌倒相关的信息:基于规则和基于新颖词嵌入的机器学习方法的比较。
J Biomed Inform. 2019 Feb;90:103103. doi: 10.1016/j.jbi.2019.103103. Epub 2019 Jan 9.
2
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.
3
NimbleMiner: An Open-Source Nursing-Sensitive Natural Language Processing System Based on Word Embedding.NimbleMiner:一种基于词嵌入的开源护理敏感自然语言处理系统。
Comput Inform Nurs. 2019 Nov;37(11):583-590. doi: 10.1097/CIN.0000000000000557.
4
Extracting Alcohol and Substance Abuse Status from Clinical Notes: The Added Value of Nursing Data.从临床记录中提取酒精和药物滥用状况:护理数据的附加价值。
Stud Health Technol Inform. 2019 Aug 21;264:1056-1060. doi: 10.3233/SHTI190386.
5
Identifying Diabetes in Clinical Notes in Hebrew: A Novel Text Classification Approach Based on Word Embedding.从希伯来语临床记录中识别糖尿病:一种基于词嵌入的新型文本分类方法。
Stud Health Technol Inform. 2019 Aug 21;264:393-397. doi: 10.3233/SHTI190250.
6
Extraction of sleep information from clinical notes of Alzheimer's disease patients using natural language processing.使用自然语言处理从阿尔茨海默病患者的临床记录中提取睡眠信息。
J Am Med Inform Assoc. 2024 Oct 1;31(10):2217-2227. doi: 10.1093/jamia/ocae177.
7
Speculation detection for Chinese clinical notes: Impacts of word segmentation and embedding models.中文临床笔记中的推测检测:分词和嵌入模型的影响
J Biomed Inform. 2016 Apr;60:334-41. doi: 10.1016/j.jbi.2016.02.011. Epub 2016 Feb 26.
8
NimbleMiner: A Novel Multi-Lingual Text Mining Application.NimbleMiner:一种新型多语言文本挖掘应用程序。
Stud Health Technol Inform. 2019 Aug 21;264:1608-1609. doi: 10.3233/SHTI190558.
9
Detecting negation and scope in Chinese clinical notes using character and word embedding.使用字符和词嵌入检测中文临床记录中的否定和范围
Comput Methods Programs Biomed. 2017 Mar;140:53-59. doi: 10.1016/j.cmpb.2016.11.009. Epub 2016 Nov 23.
10
General Symptom Extraction from VA Electronic Medical Notes.从退伍军人事务部电子病历中提取一般症状
Stud Health Technol Inform. 2017;245:356-360.

引用本文的文献

1
Development and Validation of a Rule-Based Natural Language Processing Algorithm to Identify Falls in Inpatient Records of Older Adults: Retrospective Analysis.用于识别老年人住院记录中跌倒事件的基于规则的自然语言处理算法的开发与验证:回顾性分析
JMIR Aging. 2025 Jul 8;8:e65195. doi: 10.2196/65195.
2
Enhanced effective convolutional attention network with squeeze-and-excitation inception module for multi-label clinical document classification.基于挤压激励 inception 模块的增强型有效卷积注意力网络用于多标签临床文档分类
Sci Rep. 2025 May 16;15(1):16988. doi: 10.1038/s41598-025-98719-0.
3
Examining the Role of AI in Changing the Role of Nurses in Patient Care: Systematic Review.
审视人工智能在改变护士在患者护理中角色方面的作用:系统综述
JMIR Nurs. 2025 Feb 19;8:e63335. doi: 10.2196/63335.
4
Psychometrics of the Attitude Scale towards the use of Artificial Intelligence Technologies in Nursing.护理中使用人工智能技术态度量表的心理测量学
BMC Nurs. 2025 Feb 10;24(1):151. doi: 10.1186/s12912-025-02732-7.
5
Co-producing a safe mobility and falls informatics platform to drive meaningful quality improvement in the hospital setting: a mixed-methods protocol for the study.共同打造一个安全移动与跌倒信息学平台,以推动医院环境中有意义的质量改进:该研究的混合方法方案
BMJ Open. 2025 Feb 3;15(2):e082053. doi: 10.1136/bmjopen-2023-082053.
6
Improving Surgical Outcomes for Older Adults with Adoption of Technological Advances in Comprehensive Geriatric Assessment.通过采用综合老年评估中的技术进步改善老年人的手术结局
Semin Colon Rectal Surg. 2024 Dec;35(4). doi: 10.1016/j.scrs.2024.101060. Epub 2024 Nov 6.
7
The use of natural language processing for the identification of ageing syndromes including sarcopenia, frailty and falls in electronic healthcare records: a systematic review.利用自然语言处理技术在电子医疗记录中识别包括肌肉减少症、虚弱和跌倒在内的老年综合征:系统评价。
Age Ageing. 2024 Jul 2;53(7). doi: 10.1093/ageing/afae135.
8
Advancing equity in breast cancer care: natural language processing for analysing treatment outcomes in under-represented populations.推进乳腺癌护理中的公平性:自然语言处理分析代表性不足人群的治疗结果。
BMJ Health Care Inform. 2024 Jul 1;31(1):e100966. doi: 10.1136/bmjhci-2023-100966.
9
Using Natural Language Processing to Identify Home Health Care Patients at Risk for Diagnosis of Alzheimer's Disease and Related Dementias.利用自然语言处理识别有阿尔茨海默病和相关痴呆症诊断风险的家庭保健患者。
J Appl Gerontol. 2024 Oct;43(10):1461-1472. doi: 10.1177/07334648241242321. Epub 2024 Mar 31.
10
Machine Learning Accelerates De Novo Design of Antimicrobial Peptides.机器学习加速抗菌肽的从头设计。
Interdiscip Sci. 2024 Jun;16(2):392-403. doi: 10.1007/s12539-024-00612-3. Epub 2024 Feb 28.