Nelson S J, Fuller L F, Erlbaum M S, Tuttle M S, Sherertz D D, Olson N E
Department of Medicine, Medical College of Georgia, Augusta.
Proc Annu Symp Comput Appl Med Care. 1992:649-53.
Meta-1.1, the UMLS metathesaurus, represents medical knowledge in the forms of names of concepts and links between those concepts. The representations of the semantic neighborhood of a concept can be thought of as dimensions of the property of semantic locality and include term information (broader, narrower, or otherwise related), the contextual information (parent-child, siblings in a hierarchy), the semantic types, and the co-occurrence data (links discovered empirically from concepts used to index the medical literature.) The degree of redundancy of each of these dimensions was investigated by reviewing the extent of multiple presentations of concepts which appear as related to a given concept. The degree of overlap was surprisingly small. While the co-occurrence data finds some of the links represented by other dimensions, those links are but minute fractions of the vast amount of co-occurrence derived links. Because parent-child relationships are often subsumptive (or categorical) in nature, it might be expected that siblings usually share the same semantic types. While true in the aggregate, the wide variance in percent of types shared may reflect the intended usages of the source vocabularies. Noun phrases were extracted from the definitions of 40 concepts in Meta-1 in order to assess systematically the coverage of important concepts by Meta-1, and to assess whether the links between these definitional concepts, which may have special value, and the concept being defined were indeed present. Out of 161 of these definitional concepts, 29 were not represented in Meta-1, and 37 of those represented in Meta-1 had no direct link to the concept they were defining.(ABSTRACT TRUNCATED AT 250 WORDS)
元数据1.1(UMLS元词表)以概念名称及这些概念之间的链接形式呈现医学知识。概念语义邻域的表示可被视为语义局部性属性的维度,包括术语信息( broader、narrower或以其他方式相关)、上下文信息(层次结构中的父子、兄弟关系)、语义类型以及共现数据(从用于索引医学文献的概念中通过实证发现的链接)。通过审查与给定概念相关的概念多次呈现的程度,对这些维度中每个维度的冗余程度进行了研究。重叠程度惊人地小。虽然共现数据发现了其他维度所表示的一些链接,但这些链接只是大量共现衍生链接中的极小部分。由于父子关系通常本质上是包含性的(或分类性的),可能会预期兄弟通常共享相同的语义类型。总体上确实如此,但共享类型百分比的广泛差异可能反映了源词汇表的预期用法。从元数据1中40个概念的定义中提取名词短语,以便系统评估元数据1对重要概念的覆盖范围,并评估这些定义性概念(可能具有特殊价值)与被定义概念之间的链接是否确实存在。在这些定义性概念中,有161个,其中29个未在元数据1中表示,在元数据1中表示的那些概念中有37个与它们所定义的概念没有直接链接。(摘要截断于250字)