评估临床病历中疾病识别和规范化的当前技术水平。

Evaluating the state of the art in disorder recognition and normalization of the clinical narrative.

作者信息

Pradhan Sameer, Elhadad Noémie, South Brett R, Martinez David, Christensen Lee, Vogel Amy, Suominen Hanna, Chapman Wendy W, Savova Guergana

机构信息

Boston Children's Hospital and Harvard Medical School, Boston, Massachusetts, USA.

Columbia University, New York, New York, USA.

出版信息

J Am Med Inform Assoc. 2015 Jan;22(1):143-54. doi: 10.1136/amiajnl-2013-002544. Epub 2014 Aug 21.

DOI:10.1136/amiajnl-2013-002544

PMID:25147248

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4433360/

Abstract

OBJECTIVE

The ShARe/CLEF eHealth 2013 Evaluation Lab Task 1 was organized to evaluate the state of the art on the clinical text in (i) disorder mention identification/recognition based on Unified Medical Language System (UMLS) definition (Task 1a) and (ii) disorder mention normalization to an ontology (Task 1b). Such a community evaluation has not been previously executed. Task 1a included a total of 22 system submissions, and Task 1b included 17. Most of the systems employed a combination of rules and machine learners.

MATERIALS AND METHODS

We used a subset of the Shared Annotated Resources (ShARe) corpus of annotated clinical text--199 clinical notes for training and 99 for testing (roughly 180 K words in total). We provided the community with the annotated gold standard training documents to build systems to identify and normalize disorder mentions. The systems were tested on a held-out gold standard test set to measure their performance.

RESULTS

For Task 1a, the best-performing system achieved an F1 score of 0.75 (0.80 precision; 0.71 recall). For Task 1b, another system performed best with an accuracy of 0.59.

DISCUSSION

Most of the participating systems used a hybrid approach by supplementing machine-learning algorithms with features generated by rules and gazetteers created from the training data and from external resources.

CONCLUSIONS

The task of disorder normalization is more challenging than that of identification. The ShARe corpus is available to the community as a reference standard for future studies.

摘要

目的

组织开展2013年共享与整合生命科学文本挖掘评估实验室任务1，以评估在以下方面临床文本的技术水平：（i）基于统一医学语言系统（UMLS）定义进行疾病提及识别/确认（任务1a），以及（ii）将疾病提及标准化为本体（任务1b）。此前尚未进行过此类社区评估。任务1a共有22个系统提交结果，任务1b有17个。大多数系统采用了规则与机器学习相结合的方法。

材料与方法

我们使用了带注释的临床文本共享注释资源（ShARe）语料库的一个子集——199份临床记录用于训练，99份用于测试（总计约18万字）。我们向社区提供了带注释的金标准训练文档，以构建用于识别和标准化疾病提及的系统。这些系统在一个预留的金标准测试集上进行测试，以衡量其性能。

结果

对于任务1a，表现最佳的系统F1分数为0.75（精确率0.80；召回率0.71）。对于任务1b，另一个系统表现最佳，准确率为0.59。

讨论

大多数参与系统采用了混合方法，通过用由规则生成的特征以及从训练数据和外部资源创建的地名词典来补充机器学习算法。

结论

疾病标准化任务比识别任务更具挑战性。ShARe语料库可供社区用作未来研究的参考标准。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b333/4433360/5d36a042c7fc/ocu904f1p.jpg

相似文献

Evaluating the state of the art in disorder recognition and normalization of the clinical narrative.评估临床病历中疾病识别和规范化的当前技术水平。

J Am Med Inform Assoc. 2015 Jan;22(1):143-54. doi: 10.1136/amiajnl-2013-002544. Epub 2014 Aug 21.

The 2019 National Natural language processing (NLP) Clinical Challenges (n2c2)/Open Health NLP (OHNLP) shared task on clinical concept normalization for clinical records.2019 年全国自然语言处理（NLP）临床挑战（n2c2）/开放健康自然语言处理（OHNLP）临床记录临床概念规范化共享任务。

J Am Med Inform Assoc. 2020 Oct 1;27(10):1529-1537. doi: 10.1093/jamia/ocaa106.

Challenges in clinical natural language processing for automated disorder normalization.临床自然语言处理中自动疾病标准化的挑战。

J Biomed Inform. 2015 Oct;57:28-37. doi: 10.1016/j.jbi.2015.07.010. Epub 2015 Jul 14.

Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives.开发和评估 RapTAT：一种用于从医学叙述中映射短语概念的机器学习系统。

J Biomed Inform. 2014 Apr;48:54-65. doi: 10.1016/j.jbi.2013.11.008. Epub 2013 Dec 4.

Task definition, annotated dataset, and supervised natural language processing models for symptom extraction from unstructured clinical notes.从非结构化临床记录中提取症状的任务定义、标注数据集和监督自然语言处理模型。

J Biomed Inform. 2020 Feb;102:103354. doi: 10.1016/j.jbi.2019.103354. Epub 2019 Dec 12.

Unified Medical Language System resources improve sieve-based generation and Bidirectional Encoder Representations from Transformers (BERT)-based ranking for concept normalization.统一医学语言系统资源提高了基于筛子的生成和基于双向编码器表示的转换器（BERT）的排名，以实现概念归一化。

J Am Med Inform Assoc. 2020 Oct 1;27(10):1510-1519. doi: 10.1093/jamia/ocaa080.

Normalizing acronyms and abbreviations to aid patient understanding of clinical texts: ShARe/CLEF eHealth Challenge 2013, Task 2.规范首字母缩略词以帮助患者理解临床文本：2013年共享/交叉语言评估论坛电子健康挑战赛，任务2

J Biomed Semantics. 2016 Jul 1;7:43. doi: 10.1186/s13326-016-0084-y.

Ambiguity in medical concept normalization: An analysis of types and coverage in electronic health record datasets.医学概念规范化中的歧义：电子健康记录数据集的类型和覆盖范围分析。

J Am Med Inform Assoc. 2021 Mar 1;28(3):516-532. doi: 10.1093/jamia/ocaa269.

CUILESS2016: a clinical corpus applying compositional normalization of text mentions.CUILESS2016：一个应用文本提及成分归一化的临床语料库。

J Biomed Semantics. 2018 Jan 10;9(1):2. doi: 10.1186/s13326-017-0173-6.

Assessment of disease named entity recognition on a corpus of annotated sentences.基于带注释句子语料库的疾病命名实体识别评估。

BMC Bioinformatics. 2008 Apr 11;9 Suppl 3(Suppl 3):S3. doi: 10.1186/1471-2105-9-S3-S3.

引用本文的文献

Open-Source Hybrid Large Language Model Integrated System for Extraction of Breast Cancer Treatment Pathway From Free-Text Clinical Notes.用于从自由文本临床记录中提取乳腺癌治疗路径的开源混合大语言模型集成系统

JCO Clin Cancer Inform. 2025 Jun;9:e2500002. doi: 10.1200/CCI-25-00002. Epub 2025 Jun 27.

Transformers and large language models in healthcare: A review.医疗保健中的变压器和大型语言模型：综述。

Artif Intell Med. 2024 Aug;154:102900. doi: 10.1016/j.artmed.2024.102900. Epub 2024 Jun 5.

Sample Size Considerations for Fine-Tuning Large Language Models for Named Entity Recognition Tasks: Methodological Study.用于命名实体识别任务的大语言模型微调的样本量考量：方法学研究

JMIR AI. 2024 May 16;3:e52095. doi: 10.2196/52095.

Exploring optimal granularity for extractive summarization of unstructured health records: Analysis of the largest multi-institutional archive of health records in Japan.探索非结构化健康记录提取式摘要的最佳粒度：对日本最大的多机构健康记录存档进行分析。

PLOS Digit Health. 2022 Sep 15;1(9):e0000099. doi: 10.1371/journal.pdig.0000099. eCollection 2022 Sep.

Clinical concept recognition: Evaluation of existing systems on EHRs.临床概念识别：对电子健康记录现有系统的评估。

Front Artif Intell. 2023 Jan 13;5:1051724. doi: 10.3389/frai.2022.1051724. eCollection 2022.

A scoping review of publicly available language tasks in clinical natural language processing.临床自然语言处理中公开可用语言任务的范围综述

J Am Med Inform Assoc. 2022 Sep 12;29(10):1797-1806. doi: 10.1093/jamia/ocac127.

Chemical identification and indexing in PubMed full-text articles using deep learning and heuristics.使用深度学习和启发式方法在 PubMed 全文文章中进行化学物质的识别和标引。

Database (Oxford). 2022 Jul 1;2022. doi: 10.1093/database/baac047.

SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks.SemClinBr - 一个用于葡萄牙语临床自然语言处理任务的多机构和多专业的语义注释语料库。

J Biomed Semantics. 2022 May 8;13(1):13. doi: 10.1186/s13326-022-00269-1.

Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach.西班牙语临床文本中的否定和不确定性检测：一种基于深度学习的方法。

PeerJ Comput Sci. 2022 Mar 7;8:e913. doi: 10.7717/peerj-cs.913. eCollection 2022.

Improving broad-coverage medical entity linking with semantic type prediction and large-scale datasets.利用语义类型预测和大规模数据集提高全面的医学实体链接。

J Biomed Inform. 2021 Sep;121:103880. doi: 10.1016/j.jbi.2021.103880. Epub 2021 Aug 12.

本文引用的文献

Temporal Annotation in the Clinical Domain.临床领域中的时间标注

Trans Assoc Comput Linguist. 2014 Apr;2:143-154.

Normalization and standardization of electronic health records for high-throughput phenotyping: the SHARPn consortium.电子健康记录的高通量表型标准化和规范化：SHARPn 联盟。

J Am Med Inform Assoc. 2013 Dec;20(e2):e341-8. doi: 10.1136/amiajnl-2013-001939. Epub 2013 Nov 4.

Automatic prediction of rheumatoid arthritis disease activity from the electronic medical records.从电子病历中自动预测类风湿关节炎的疾病活动度。

PLoS One. 2013 Aug 16;8(8):e69932. doi: 10.1371/journal.pone.0069932. eCollection 2013.

DNorm: disease name normalization with pairwise learning to rank.DNorm：基于对分学习排序的疾病名称标准化。

Bioinformatics. 2013 Nov 15;29(22):2909-17. doi: 10.1093/bioinformatics/btt474. Epub 2013 Aug 21.

Towards comprehensive syntactic and semantic annotations of the clinical narrative.朝着临床叙述的全面句法和语义标注努力。

J Am Med Inform Assoc. 2013 Sep-Oct;20(5):922-30. doi: 10.1136/amiajnl-2012-001317. Epub 2013 Jan 25.

Knowledge-based biomedical word sense disambiguation: an evaluation and application to clinical document classification.基于知识的生物医学词义消歧：评估及在临床文档分类中的应用。

J Am Med Inform Assoc. 2013 Sep-Oct;20(5):882-6. doi: 10.1136/amiajnl-2012-001350. Epub 2012 Oct 16.

Cataract research using electronic health records.利用电子健康记录进行白内障研究。

BMC Ophthalmol. 2011 Nov 11;11:32. doi: 10.1186/1471-2415-11-32.

Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions.克服临床文本自然语言处理的障碍：共享任务的作用及对其他创造性解决方案的需求。

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):540-3. doi: 10.1136/amiajnl-2011-000465.

2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.2010 i2b2/VA 挑战赛：临床文本中的概念、断言和关系

J Am Med Inform Assoc. 2011 Sep-Oct;18(5):552-6. doi: 10.1136/amiajnl-2011-000203. Epub 2011 Jun 16.

Facilitating pharmacogenetic studies using electronic health records and natural-language processing: a case study of warfarin.利用电子健康记录和自然语言处理促进药物遗传学研究：以华法林为例的案例研究。

J Am Med Inform Assoc. 2011 Jul-Aug;18(4):387-91. doi: 10.1136/amiajnl-2011-000208.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

评估临床病历中疾病识别和规范化的当前技术水平。

Evaluating the state of the art in disorder recognition and normalization of the clinical narrative.

作者信息

机构信息

出版信息

OBJECTIVE

MATERIALS AND METHODS

RESULTS

DISCUSSION

CONCLUSIONS

目的

材料与方法

结果

讨论

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献