Lu Chris J, Tormey Destinee, McCreedy Lynn, Browne Allen C
National Library of Medicine, Bethesda, MD, USA.
Stud Health Technol Inform. 2017;245:501-505.
Concept mapping is important in natural language processing (NLP) for bioinformatics. The UMLS Metathesaurus provides a rich synonym thesaurus and is a popular resource for concept mapping. Query expansion using synonyms for subterm substitutions is an effective technique to increase recall for UMLS concept mapping. Synonyms used to substitute subterms are called element synonyms. The completeness and quality of both element synonyms and the UMLS synonym thesaurus is the key to success in such applications. The Lexical Systems Group (LSG) has developed a new system for element synonym acquisition based on new enhanced requirements and design for better performance. The results show: 1) A 36.71 times growth of synonyms in the Lexicon (lexSynonym) in the 2017 release; 2) Improvements of concept mapping for recall and F1 with similar precision using the lexSynonym.2017 as element synonyms due to the broader coverage and better quality.
概念映射在生物信息学的自然语言处理(NLP)中很重要。UMLS元词表提供了丰富的同义词库,是概念映射的常用资源。使用同义词进行子词替换的查询扩展是提高UMLS概念映射召回率的有效技术。用于替换子词的同义词称为元素同义词。元素同义词和UMLS同义词库的完整性和质量是此类应用成功的关键。词汇系统组(LSG)基于新的增强要求和设计开发了一种新的元素同义词获取系统,以实现更好的性能。结果表明:1)2017年版本的词汇表(lexSynonym)中同义词增长了36.71倍;2)由于覆盖范围更广、质量更高,使用lexSynonym.2017作为元素同义词时,概念映射在召回率和F1值方面有所提高,同时精度相似。