Patterson Olga, Igo Sean, Hurdle John F
Department of Biomedical Informatics, University of Utah, Salt Lake City, UT.
AMIA Annu Symp Proc. 2010 Nov 13;2010:612-6.
Natural language processing of clinical notes is challenging due to a high degree of semantic ambiguity. Previous research has uncovered ways to improve disambiguation accuracy using manually created rules of semantic sentence structure. However, applying a natural language processing system in a new clinical domain using this method is very labor intensive. This paper presents an automatic method of developing such disambiguation rules for a wide range of clinical domains. Our rules are based on the co-occurrence patterns of semantic types of terms unambiguously mapped to UMLS concepts by MetaMap. These patterns are combined into a sublanguage semantic schema that can be used by an existing natural language processing system such as MetaMap. The differences of co-occurrence patterns across clinical notes of different domains are presented here as evidence of clinical sublanguages.
由于语义歧义程度高,临床记录的自然语言处理具有挑战性。先前的研究已经发现了一些方法,可使用人工创建的语义句子结构规则来提高消歧准确性。然而,使用这种方法在新的临床领域应用自然语言处理系统非常耗费人力。本文提出了一种针对广泛临床领域开发此类消歧规则的自动方法。我们的规则基于通过MetaMap明确映射到UMLS概念的术语语义类型的共现模式。这些模式被组合成一个子语言语义模式,可被诸如MetaMap之类的现有自然语言处理系统使用。不同领域临床记录中共现模式的差异在此作为临床子语言的证据呈现。