Schulz Stefan, Beisswanger Elena, van den Hoek László, Bodenreider Olivier, van Mulligen Erik M
Institute of Medical Biometry and Medical Informatics, University Medical Center Freiburg, Freiburg, Germany.
Bioinformatics. 2009 Jun 15;25(12):i69-76. doi: 10.1093/bioinformatics/btp194.
For many years, the Unified Medical Language System (UMLS) semantic network (SN) has been used as an upper-level semantic framework for the categorization of terms from terminological resources in biomedicine. BioTop has recently been developed as an upper-level ontology for the biomedical domain. In contrast to the SN, it is founded upon strict ontological principles, using OWL DL as a formal representation language, which has become standard in the semantic Web. In order to make logic-based reasoning available for the resources annotated or categorized with the SN, a mapping ontology was developed aligning the SN with BioTop.
The theoretical foundations and the practical realization of the alignment are being described, with a focus on the design decisions taken, the problems encountered and the adaptations of BioTop that became necessary. For evaluation purposes, UMLS concept pairs obtained from MEDLINE abstracts by a named entity recognition system were tested for possible semantic relationships. Furthermore, all semantic-type combinations that occur in the UMLS Metathesaurus were checked for satisfiability.
The effort-intensive alignment process required major design changes and enhancements of BioTop and brought up several design errors that could be fixed. A comparison between a human curator and the ontology yielded only a low agreement. Ontology reasoning was also used to successfully identify 133 inconsistent semantic-type combinations.
BioTop, the OWL DL representation of the UMLS SN, and the mapping ontology are available at http://www.purl.org/biotop/.
多年来,统一医学语言系统(UMLS)语义网络(SN)一直被用作生物医学术语资源中术语分类的上层语义框架。BioTop最近被开发为生物医学领域的上层本体。与SN不同,它基于严格的本体原则,使用OWL DL作为形式表示语言,这已成为语义网的标准。为了使基于逻辑的推理可用于用SN注释或分类的资源,开发了一种映射本体,将SN与BioTop对齐。
描述了对齐的理论基础和实际实现,重点关注所做的设计决策、遇到的问题以及对BioTop进行的必要调整。为了进行评估,对通过命名实体识别系统从MEDLINE摘要中获得的UMLS概念对测试可能的语义关系。此外,检查了UMLS元词表中出现的所有语义类型组合的可满足性。
耗费精力的对齐过程需要对BioTop进行重大设计更改和增强,并发现了几个可以修复的设计错误。人工编目员与本体之间的比较结果一致性较低。本体推理还成功识别了133个不一致的语义类型组合。
BioTop、UMLS SN的OWL DL表示以及映射本体可在http://www.purl.org/biotop/获取。