Suppr超能文献

临床笔记中移动功能信息的综合研究:实体层次结构、语料库标注和序列标记。

A comprehensive study of mobility functioning information in clinical notes: Entity hierarchy, corpus annotation, and sequence labeling.

机构信息

Oklahoma State University, Stillwater, OK, United States.

National Institutes of Health Clinical Center, Bethesda, MD‬, United States.

出版信息

Int J Med Inform. 2021 Mar;147:104351. doi: 10.1016/j.ijmedinf.2020.104351. Epub 2020 Dec 24.

Abstract

BACKGROUND

Secondary use of Electronic Health Records (EHRs) has mostly focused on health conditions (diseases and drugs). Function is an important health indicator in addition to morbidity and mortality. Nevertheless, function has been overlooked in accessing patients' health status. The World Health Organization (WHO)'s International Classification of Functioning, Disability and Health (ICF) is considered the international standard for describing and coding function and health states. We pioneer the first comprehensive analysis and identification of functioning concepts in the Mobility domain of the ICF.

RESULTS

Using physical therapy notes at the National Institutes of Health's Clinical Center, we induced a hierarchical order of mobility-related entities including 5 entities types, 3 relations, 8 attributes, and 33 attribute values. Two domain experts manually curated a gold standard corpus of 14,281 nested entity mentions from 400 clinical notes. Inter-annotator agreement (IAA) of exact matching averaged 92.3 % F1-score on mention text spans, and 96.6 % Cohen's kappa on attributes assignments. A high-performance Ensemble machine learning model for named entity recognition (NER) was trained and evaluated using the gold standard corpus. Average F1-score on exact entity matching of our Ensemble method (84.90 %) outperformed popular NER methods: Conditional Random Field (80.4 %), Recurrent Neural Network (81.82 %), and Bidirectional Encoder Representations from Transformers (82.33 %).

CONCLUSIONS

The results of this study show that mobility functioning information can be reliably captured from clinical notes once adequate resources are provided for sequence labeling methods. We expect that functioning concepts in other domains of the ICF can be identified in similar fashion.

摘要

背景

电子健康记录(EHRs)的二次利用主要集中在健康状况(疾病和药物)上。功能是除了发病率和死亡率之外的一个重要健康指标。然而,在评估患者的健康状况时,功能却被忽视了。世界卫生组织(WHO)的《国际功能、残疾和健康分类》(ICF)被认为是描述和编码功能和健康状况的国际标准。我们率先对 ICF 的活动领域中的功能概念进行了全面的分析和识别。

结果

利用美国国立卫生研究院临床中心的物理治疗记录,我们归纳出了与活动相关的实体的层次结构,包括 5 种实体类型、3 种关系、8 种属性和 33 种属性值。两位领域专家手动整理了 400 份临床记录中的 14281 个嵌套实体提及的黄金标准语料库。提及文本跨度的精确匹配的标注者间一致性(IAA)平均为 92.3% F1 分数,属性分配的 Cohen's kappa 为 96.6%。使用黄金标准语料库对命名实体识别(NER)的高性能集成机器学习模型进行了训练和评估。我们的集成方法在精确实体匹配方面的平均 F1 分数(84.90%)优于流行的 NER 方法:条件随机场(80.4%)、递归神经网络(81.82%)和双向转换器编码器表示(82.33%)。

结论

这项研究的结果表明,一旦为序列标注方法提供了足够的资源,就可以从临床记录中可靠地提取活动功能信息。我们期望以类似的方式识别 ICF 其他领域的功能概念。

相似文献

6
Korean clinical entity recognition from diagnosis text using BERT.基于 BERT 的韩语文本临床实体识别。
BMC Med Inform Decis Mak. 2020 Sep 30;20(Suppl 7):242. doi: 10.1186/s12911-020-01241-8.

引用本文的文献

2
Named Entity Recognition in Electronic Health Records: A Methodological Review.电子健康记录中的命名实体识别:方法学综述
Healthc Inform Res. 2023 Oct;29(4):286-300. doi: 10.4258/hir.2023.29.4.286. Epub 2023 Oct 31.
4
Development of an ontology to characterize mental functioning.开发一个本体来描述心理功能。
Disabil Rehabil. 2024 Aug;46(16):3739-3748. doi: 10.1080/09638288.2023.2252337. Epub 2023 Sep 13.

本文引用的文献

1
HARE: a Flexible Highlighting Annotator for Ranking and Exploration.HARE:一种用于排序和探索的灵活高亮注释器。
Proc Conf Empir Methods Nat Lang Process. 2019 Nov;2019:85-90. doi: 10.18653/v1/d19-3015.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验