Suppr超能文献

解析临床文本:最先进的解析器有多出色?

Parsing clinical text: how good are the state-of-the-art parsers?

作者信息

Jiang Min, Huang Yang, Fan Jung-wei, Tang Buzhou, Denny Josh, Xu Hua

出版信息

BMC Med Inform Decis Mak. 2015;15 Suppl 1(Suppl 1):S2. doi: 10.1186/1472-6947-15-S1-S2. Epub 2015 May 20.

Abstract

BACKGROUND

Parsing, which generates a syntactic structure of a sentence (a parse tree), is a critical component of natural language processing (NLP) research in any domain including medicine. Although parsers developed in the general English domain, such as the Stanford parser, have been applied to clinical text, there are no formal evaluations and comparisons of their performance in the medical domain.

METHODS

In this study, we investigated the performance of three state-of-the-art parsers: the Stanford parser, the Bikel parser, and the Charniak parser, using following two datasets: (1) A Treebank containing 1,100 sentences that were randomly selected from progress notes used in the 2010 i2b2 NLP challenge and manually annotated according to a Penn Treebank based guideline; and (2) the MiPACQ Treebank, which is developed based on pathology notes and clinical notes, containing 13,091 sentences. We conducted three experiments on both datasets. First, we measured the performance of the three state-of-the-art parsers on the clinical Treebanks with their default settings. Then we re-trained the parsers using the clinical Treebanks and evaluated their performance using the 10-fold cross validation method. Finally we re-trained the parsers by combining the clinical Treebanks with the Penn Treebank.

RESULTS

Our results showed that the original parsers achieved lower performance in clinical text (Bracketing F-measure in the range of 66.6%-70.3%) compared to general English text. After retraining on the clinical Treebank, all parsers achieved better performance, with the best performance from the Stanford parser that reached the highest Bracketing F-measure of 73.68% on progress notes and 83.72% on the MiPACQ corpus using 10-fold cross validation. When the combined clinical Treebanks and Penn Treebank was used, of the three parsers, the Charniak parser achieved the highest Bracketing F-measure of 73.53% on progress notes and the Stanford parser reached the highest F-measure of 84.15% on the MiPACQ corpus.

CONCLUSIONS

Our study demonstrates that re-training using clinical Treebanks is critical for improving general English parsers' performance on clinical text, and combining clinical and open domain corpora might achieve optimal performance for parsing clinical text.

摘要

背景

句法分析用于生成句子的句法结构(句法剖析树),是包括医学领域在内的任何领域的自然语言处理(NLP)研究的关键组成部分。尽管在通用英语领域开发的句法分析器,如斯坦福句法分析器,已应用于临床文本,但尚无对其在医学领域性能的正式评估和比较。

方法

在本研究中,我们使用以下两个数据集研究了三种最先进的句法分析器的性能:斯坦福句法分析器、比克尔句法分析器和查尔尼亚克句法分析器:(1)一个树库,包含从2010年i2b2 NLP挑战赛中使用的病程记录中随机选择的1100个句子,并根据基于宾州树库的指南进行了人工标注;(2)MiPACQ树库,它基于病理记录和临床记录开发,包含13091个句子。我们在这两个数据集上进行了三项实验。首先,我们在默认设置下测量了三种最先进的句法分析器在临床树库上的性能。然后,我们使用临床树库对句法分析器进行重新训练,并使用10折交叉验证方法评估它们的性能。最后,我们通过将临床树库与宾州树库相结合来重新训练句法分析器。

结果

我们的结果表明,与通用英语文本相比,原始句法分析器在临床文本中的性能较低(括号F值在66.6%-70.3%之间)。在临床树库上重新训练后,所有句法分析器的性能都有所提高,其中斯坦福句法分析器表现最佳,在病程记录上使用10折交叉验证达到了最高的括号F值73.68%,在MiPACQ语料库上达到了83.72%。当使用临床树库和宾州树库的组合时,在三种句法分析器中,查尔尼亚克句法分析器在病程记录上达到了最高的括号F值73.53%,斯坦福句法分析器在MiPACQ语料库上达到了最高的F值84.15%。

结论

我们的研究表明,使用临床树库进行重新训练对于提高通用英语句法分析器在临床文本上的性能至关重要,并且将临床语料库和开放领域语料库相结合可能会在句法分析临床文本时实现最佳性能。

相似文献

1
Parsing clinical text: how good are the state-of-the-art parsers?解析临床文本:最先进的解析器有多出色?
BMC Med Inform Decis Mak. 2015;15 Suppl 1(Suppl 1):S2. doi: 10.1186/1472-6947-15-S1-S2. Epub 2015 May 20.
3
Domain adaption of parsing for operative notes.手术记录解析的领域适应
J Biomed Inform. 2015 Apr;54:1-9. doi: 10.1016/j.jbi.2015.01.016. Epub 2015 Feb 7.
5
Porting a lexicalized-grammar parser to the biomedical domain.将词汇语法分析器移植到生物医学领域。
J Biomed Inform. 2009 Oct;42(5):852-65. doi: 10.1016/j.jbi.2008.12.004. Epub 2008 Dec 25.
6
Cross-Lingual Universal Dependency Parsing Only From One Monolingual Treebank.
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):13393-13407. doi: 10.1109/TPAMI.2023.3291388. Epub 2023 Oct 3.
8
Statistical parsing of varieties of clinical Finnish.芬兰语临床变体的统计句法分析。
Artif Intell Med. 2014 Jul;61(3):131-6. doi: 10.1016/j.artmed.2014.02.002. Epub 2014 Mar 5.

引用本文的文献

本文引用的文献

2
Towards comprehensive syntactic and semantic annotations of the clinical narrative.朝着临床叙述的全面句法和语义标注努力。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):922-30. doi: 10.1136/amiajnl-2012-001317. Epub 2013 Jan 25.
10
"Understanding" medical school curriculum content using KnowledgeMap.使用知识图谱“理解”医学院课程内容。
J Am Med Inform Assoc. 2003 Jul-Aug;10(4):351-62. doi: 10.1197/jamia.M1176. Epub 2003 Mar 28.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验