解析临床文本：最先进的解析器有多出色？

Parsing clinical text: how good are the state-of-the-art parsers?

作者信息

Jiang Min, Huang Yang, Fan Jung-wei, Tang Buzhou, Denny Josh, Xu Hua

出版信息

BMC Med Inform Decis Mak. 2015;15 Suppl 1(Suppl 1):S2. doi: 10.1186/1472-6947-15-S1-S2. Epub 2015 May 20.

DOI:10.1186/1472-6947-15-S1-S2

PMID:26045009

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4460747/

Abstract

BACKGROUND

Parsing, which generates a syntactic structure of a sentence (a parse tree), is a critical component of natural language processing (NLP) research in any domain including medicine. Although parsers developed in the general English domain, such as the Stanford parser, have been applied to clinical text, there are no formal evaluations and comparisons of their performance in the medical domain.

METHODS

In this study, we investigated the performance of three state-of-the-art parsers: the Stanford parser, the Bikel parser, and the Charniak parser, using following two datasets: (1) A Treebank containing 1,100 sentences that were randomly selected from progress notes used in the 2010 i2b2 NLP challenge and manually annotated according to a Penn Treebank based guideline; and (2) the MiPACQ Treebank, which is developed based on pathology notes and clinical notes, containing 13,091 sentences. We conducted three experiments on both datasets. First, we measured the performance of the three state-of-the-art parsers on the clinical Treebanks with their default settings. Then we re-trained the parsers using the clinical Treebanks and evaluated their performance using the 10-fold cross validation method. Finally we re-trained the parsers by combining the clinical Treebanks with the Penn Treebank.

RESULTS

Our results showed that the original parsers achieved lower performance in clinical text (Bracketing F-measure in the range of 66.6%-70.3%) compared to general English text. After retraining on the clinical Treebank, all parsers achieved better performance, with the best performance from the Stanford parser that reached the highest Bracketing F-measure of 73.68% on progress notes and 83.72% on the MiPACQ corpus using 10-fold cross validation. When the combined clinical Treebanks and Penn Treebank was used, of the three parsers, the Charniak parser achieved the highest Bracketing F-measure of 73.53% on progress notes and the Stanford parser reached the highest F-measure of 84.15% on the MiPACQ corpus.

CONCLUSIONS

Our study demonstrates that re-training using clinical Treebanks is critical for improving general English parsers' performance on clinical text, and combining clinical and open domain corpora might achieve optimal performance for parsing clinical text.

摘要

背景

句法分析用于生成句子的句法结构（句法剖析树），是包括医学领域在内的任何领域的自然语言处理（NLP）研究的关键组成部分。尽管在通用英语领域开发的句法分析器，如斯坦福句法分析器，已应用于临床文本，但尚无对其在医学领域性能的正式评估和比较。

方法

在本研究中，我们使用以下两个数据集研究了三种最先进的句法分析器的性能：斯坦福句法分析器、比克尔句法分析器和查尔尼亚克句法分析器：（1）一个树库，包含从2010年i2b2 NLP挑战赛中使用的病程记录中随机选择的1100个句子，并根据基于宾州树库的指南进行了人工标注；（2）MiPACQ树库，它基于病理记录和临床记录开发，包含13091个句子。我们在这两个数据集上进行了三项实验。首先，我们在默认设置下测量了三种最先进的句法分析器在临床树库上的性能。然后，我们使用临床树库对句法分析器进行重新训练，并使用10折交叉验证方法评估它们的性能。最后，我们通过将临床树库与宾州树库相结合来重新训练句法分析器。

结果

我们的结果表明，与通用英语文本相比，原始句法分析器在临床文本中的性能较低（括号F值在66.6%-70.3%之间）。在临床树库上重新训练后，所有句法分析器的性能都有所提高，其中斯坦福句法分析器表现最佳，在病程记录上使用10折交叉验证达到了最高的括号F值73.68%，在MiPACQ语料库上达到了83.72%。当使用临床树库和宾州树库的组合时，在三种句法分析器中，查尔尼亚克句法分析器在病程记录上达到了最高的括号F值73.53%，斯坦福句法分析器在MiPACQ语料库上达到了最高的F值84.15%。

结论

我们的研究表明，使用临床树库进行重新训练对于提高通用英语句法分析器在临床文本上的性能至关重要，并且将临床语料库和开放领域语料库相结合可能会在句法分析临床文本时实现最佳性能。

相似文献

Parsing clinical text: how good are the state-of-the-art parsers?解析临床文本：最先进的解析器有多出色？

BMC Med Inform Decis Mak. 2015;15 Suppl 1(Suppl 1):S2. doi: 10.1186/1472-6947-15-S1-S2. Epub 2015 May 20.

Semantic Role Labeling of Clinical Text: Comparing Syntactic Parsers and Features.临床文本的语义角色标注：句法分析器与特征比较

AMIA Annu Symp Proc. 2017 Feb 10;2016:1283-1292. eCollection 2016.

Domain adaption of parsing for operative notes.手术记录解析的领域适应

J Biomed Inform. 2015 Apr;54:1-9. doi: 10.1016/j.jbi.2015.01.016. Epub 2015 Feb 7.

Syntactic parsing of clinical text: guideline and corpus development with handling ill-formed sentences.临床文本的句法分析：处理不规范句子的指南和语料库开发。

J Am Med Inform Assoc. 2013 Nov-Dec;20(6):1168-77. doi: 10.1136/amiajnl-2013-001810. Epub 2013 Aug 1.

Porting a lexicalized-grammar parser to the biomedical domain.将词汇语法分析器移植到生物医学领域。

J Biomed Inform. 2009 Oct;42(5):852-65. doi: 10.1016/j.jbi.2008.12.004. Epub 2008 Dec 25.

Cross-Lingual Universal Dependency Parsing Only From One Monolingual Treebank.

IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13393-13407. doi: 10.1109/TPAMI.2023.3291388. Epub 2023 Oct 3.

Applying semantic-based probabilistic context-free grammar to medical language processing--a preliminary study on parsing medication sentences.应用基于语义的概率上下文无关语法进行医学语言处理——解析药物句子的初步研究。

J Biomed Inform. 2011 Dec;44(6):1068-75. doi: 10.1016/j.jbi.2011.08.009. Epub 2011 Aug 12.

Statistical parsing of varieties of clinical Finnish.芬兰语临床变体的统计句法分析。

Artif Intell Med. 2014 Jul;61(3):131-6. doi: 10.1016/j.artmed.2014.02.002. Epub 2014 Mar 5.

Evaluating contributions of natural language parsers to protein-protein interaction extraction.评估自然语言解析器对蛋白质-蛋白质相互作用提取的贡献。

Bioinformatics. 2009 Feb 1;25(3):394-400. doi: 10.1093/bioinformatics/btn631. Epub 2008 Dec 9.

The effects of natural language processing on cross-institutional portability of influenza case detection for disease surveillance.自然语言处理对用于疾病监测的流感病例检测跨机构可移植性的影响。

Appl Clin Inform. 2017 May 31;8(2):560-580. doi: 10.4338/ACI-2016-12-RA-0211.

引用本文的文献

Automatic inference of ICD-10 codes from German ophthalmologic physicians' letters using natural language processing.使用自然语言处理自动推断德国眼科医生信函中的 ICD-10 编码。

Sci Rep. 2024 Apr 19;14(1):9035. doi: 10.1038/s41598-024-59926-3.

Developing a RadLex-Based Named Entity Recognition Tool for Mining Textual Radiology Reports: Development and Performance Evaluation Study.基于 RadLex 的命名实体识别工具在挖掘文本放射学报告中的开发：开发和性能评估研究。

J Med Internet Res. 2021 Oct 29;23(10):e25378. doi: 10.2196/25378.

Augmented intelligence with natural language processing applied to electronic health records for identifying patients with non-alcoholic fatty liver disease at risk for disease progression.应用自然语言处理的增强型人工智能用于电子健康记录，以识别非酒精性脂肪性肝病患者中疾病进展风险较高的患者。

Int J Med Inform. 2019 Sep;129:334-341. doi: 10.1016/j.ijmedinf.2019.06.028. Epub 2019 Jul 6.

Parsing clinical text using the state-of-the-art deep learning based parsers: a systematic comparison.基于最先进的深度学习解析器解析临床文本：系统比较。

BMC Med Inform Decis Mak. 2019 Apr 4;19(Suppl 3):77. doi: 10.1186/s12911-019-0783-2.

Semantic Role Labeling of Clinical Text: Comparing Syntactic Parsers and Features.临床文本的语义角色标注：句法分析器与特征比较

AMIA Annu Symp Proc. 2017 Feb 10;2016:1283-1292. eCollection 2016.

Clinical Natural Language Processing in 2015: Leveraging the Variety of Texts of Clinical Interest.2015年的临床自然语言处理：利用各类具有临床意义的文本

Yearb Med Inform. 2016 Nov 10(1):234-239. doi: 10.15265/IY-2016-049.

本文引用的文献

Syntactic parsing of clinical text: guideline and corpus development with handling ill-formed sentences.临床文本的句法分析：处理不规范句子的指南和语料库开发。

J Am Med Inform Assoc. 2013 Nov-Dec;20(6):1168-77. doi: 10.1136/amiajnl-2013-001810. Epub 2013 Aug 1.

Towards comprehensive syntactic and semantic annotations of the clinical narrative.朝着临床叙述的全面句法和语义标注努力。

J Am Med Inform Assoc. 2013 Sep-Oct;20(5):922-30. doi: 10.1136/amiajnl-2012-001317. Epub 2013 Jan 25.

Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications.梅奥临床文本分析和知识提取系统（cTAKES）：架构、组件评估和应用。

J Am Med Inform Assoc. 2010 Sep-Oct;17(5):507-13. doi: 10.1136/jamia.2009.001560.

MedEx: a medication information extraction system for clinical narratives.MedEx：一个用于临床叙述的药物信息提取系统。

J Am Med Inform Assoc. 2010 Jan-Feb;17(1):19-24. doi: 10.1197/jamia.M3378.

Extracting information from textual documents in the electronic health record: a review of recent research.从电子健康记录中的文本文件提取信息：近期研究综述

Yearb Med Inform. 2008:128-44.

Benchmarking natural-language parsers for biological applications using dependency graphs.使用依存关系图对生物应用中的自然语言解析器进行基准测试。

BMC Bioinformatics. 2007 Jan 25;8:24. doi: 10.1186/1471-2105-8-24.

Effect of Dianex, a herbal formulation on experimentally induced diabetes mellitus.中药配方Dianex对实验性诱导糖尿病的影响。

Phytother Res. 2005 May;19(5):409-15. doi: 10.1002/ptr.1570.

Improved identification of noun phrases in clinical radiology reports using a high-performance statistical natural language parser augmented with the UMLS specialist lexicon.使用增强了统一医学语言系统（UMLS）专业词典的高性能统计自然语言解析器，改进临床放射学报告中名词短语的识别。

J Am Med Inform Assoc. 2005 May-Jun;12(3):275-85. doi: 10.1197/jamia.M1695. Epub 2005 Jan 31.

Two biomedical sublanguages: a description based on the theories of Zellig Harris.两种生物医学子语言：基于泽利格·哈里斯理论的一种描述

J Biomed Inform. 2002 Aug;35(4):222-35. doi: 10.1016/s1532-0464(03)00012-1.

"Understanding" medical school curriculum content using KnowledgeMap.使用知识图谱“理解”医学院课程内容。

J Am Med Inform Assoc. 2003 Jul-Aug;10(4):351-62. doi: 10.1197/jamia.M1176. Epub 2003 Mar 28.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验