Department of Biomedical Informatics, University of Pittsburgh School of Medicine, PA 15232, USA.
J Biomed Inform. 2011 Feb;44(1):163-79. doi: 10.1016/j.jbi.2010.07.006. Epub 2010 Jul 18.
While the biomedical informatics community widely acknowledges the utility of domain ontologies, there remain many barriers to their effective use. One important requirement of domain ontologies is that they must achieve a high degree of coverage of the domain concepts and concept relationships. However, the development of these ontologies is typically a manual, time-consuming, and often error-prone process. Limited resources result in missing concepts and relationships as well as difficulty in updating the ontology as knowledge changes. Methodologies developed in the fields of Natural Language Processing, information extraction, information retrieval and machine learning provide techniques for automating the enrichment of an ontology from free-text documents. In this article, we review existing methodologies and developed systems, and discuss how existing methods can benefit the development of biomedical ontologies.
虽然生物医学信息学领域广泛承认领域本体的实用性,但在有效使用它们方面仍然存在许多障碍。领域本体的一个重要要求是,它们必须实现对领域概念和概念关系的高度覆盖。然而,这些本体的开发通常是一个手动的、耗时的且经常容易出错的过程。有限的资源导致缺少概念和关系,并且难以随着知识的变化更新本体。自然语言处理、信息提取、信息检索和机器学习等领域中开发的方法提供了从自由文本文档自动丰富本体的技术。在本文中,我们回顾了现有的方法和开发的系统,并讨论了现有方法如何有益于生物医学本体的开发。