Zhang Li, Perl Yehoshua, Halper Michael, Geller James, Cimino James J
CS Department, NJIT, University Heights, Newark, NJ 07102, USA.
J Am Med Inform Assoc. 2004 May-Jun;11(3):195-206. doi: 10.1197/jamia.M1269. Epub 2004 Feb 5.
The Unified Medical Language System's (UMLS's) Semantic Network's (SN's) two-tree structure is restrictive because it does not allow a semantic type to be a specialization of several other semantic types. In this article, the SN is expanded into a multiple subsumption structure with a directed acyclic graph (DAG) IS-A hierarchy, allowing a semantic type to have multiple parents. New viable IS-A links are added as warranted.
Two methodologies are presented to identify and add new viable IS-A links. The first methodology is based on imposing the characteristic of connectivity on a previously presented partition of the SN. Four transformations are provided to find viable IS-A links in the process of converting the partition's disconnected groups into connected ones. The second methodology identifies new IS-A links through a string matching process involving names and definitions of various semantic types in the SN. A domain expert is needed to review all the results to determine the validity of the new IS-A links.
Nineteen new IS-A links are added to the SN, and four new semantic types are also created to support the multiple subsumption framework. The resulting network, called the Enriched Semantic Network (ESN), exhibits a DAG-structured hierarchy. A partition of the ESN containing 19 connected groups is also derived.
The ESN is an expanded abstraction of the UMLS compared with the original SN. Its multiple subsumption hierarchy can accommodate semantic types with multiple parents. Its representation thus provides direct access to a broader range of subsumption knowledge.
统一医学语言系统(UMLS)的语义网络(SN)的双树结构具有局限性,因为它不允许一个语义类型成为其他几个语义类型的特化。在本文中,SN被扩展为具有有向无环图(DAG)“是一个”层次结构的多重包含结构,允许一个语义类型有多个父类型。根据需要添加新的可行“是一个”链接。
提出了两种方法来识别和添加新的可行“是一个”链接。第一种方法基于在之前呈现的SN分区上施加连通性特征。提供了四种转换,以便在将分区的不连通组转换为连通组的过程中找到可行的“是一个”链接。第二种方法通过涉及SN中各种语义类型的名称和定义的字符串匹配过程来识别新的“是一个”链接。需要领域专家审查所有结果,以确定新“是一个”链接的有效性。
向SN添加了19个新的“是一个”链接,还创建了四个新的语义类型以支持多重包含框架。由此产生的网络称为丰富语义网络(ESN),呈现出DAG结构的层次结构。还得出了一个包含19个连通组的ESN分区。
与原始SN相比,ESN是UMLS的扩展抽象。其多重包含层次结构可以容纳具有多个父类型的语义类型。因此,它的表示提供了对更广泛的包含知识的直接访问。