Patel Chintan O, Cimino James J
Department of Biomedical Informatics, Columbia University, New York, NY, USA.
AMIA Annu Symp Proc. 2007 Oct 11;2007:588-92.
Biomedical terminologies often contain composite concepts that cannot be translated into single unique synonymous concepts in a target controlled terminology. Such composite concepts need to be decomposed into sets of component concepts present in the target terminology that can serve as the proxy for applications in information retrieval, decision support or data analysis. Towards this goal, we use a "clustering coefficient" over the UMLS Metathesaurus to traverse the closely clustered neighbors of the composite source concept to generate a ranked list of possible component concepts. Using the MeSH Associated Expression mappings as the gold-standard, we show that the proposed approach generates relevant component concepts as compared to existing semantic locality based methods. The topological connectivity of the concepts in the UMLS Metathesaurus is a useful feature that can be coupled with existing lexical and semantic locality based approaches towards terminology translation.
生物医学术语通常包含复合概念,这些复合概念无法在目标受控术语中翻译为单一独特的同义概念。此类复合概念需要分解为目标术语中存在的组成概念集,这些组成概念集可作为信息检索、决策支持或数据分析应用的代理。为了实现这一目标,我们使用统一医学语言系统(UMLS)元词表上的“聚类系数”来遍历复合源概念的紧密聚类邻居,以生成可能的组成概念的排序列表。以医学主题词表(MeSH)关联表达式映射作为金标准,我们表明,与现有的基于语义局部性的方法相比,所提出的方法能够生成相关的组成概念。UMLS元词表中概念的拓扑连通性是一个有用的特征,可以与现有的基于词汇和语义局部性的术语翻译方法相结合。