School of Computer Engineering, Nanyang Technological University, Block N4, 02a-32, Nanyang Avenue, Singapore 639798, Singapore.
J Biomed Inform. 2012 Apr;45(2):337-49. doi: 10.1016/j.jbi.2011.11.010. Epub 2011 Dec 2.
The biomedical sciences is one of the few domains where ontologies are widely being developed to facilitate information retrieval and knowledge sharing, but there still remains the problem that applications using different ontologies cannot share knowledge without explicit references between overlapping concepts. Ontology alignment is the task of identifying such equivalence relations between concepts across ontologies. Its application to the biomedical domain should address two open issues: (1) determining the equivalence of concept-pairs which have overlapping terms in their names, and (2) the high run-time required to align large ontologies which are typical in the biomedical domain. To address them, we present a novel approach, named the Biomedical Ontologies Alignment Technique (BOAT), which is state-of-the-art in terms of F-measure, precision and speed. A key feature of BOAT is that it considers the informativeness of each component word in the concept labels, which has significant impact on biomedical ontologies, resulting in a 12.2% increase in F-measure. Another important feature of BOAT is that it selects for comparison only concept pairs that show high likelihoods of equivalence, based on the similarity of their annotations. BOAT's F-measure of 0.88 for the alignment of the mouse and human anatomy ontologies is on par with that of another state-of-the-art matcher, AgreementMaker, while taking a shorter time.
生物医学科学是少数广泛开发本体论以促进信息检索和知识共享的领域之一,但仍然存在一个问题,即使用不同本体论的应用程序如果没有重叠概念之间的显式引用,就无法共享知识。本体对齐是识别本体论之间概念之间这种等价关系的任务。它在生物医学领域的应用应该解决两个开放问题:(1)确定名称中具有重叠术语的概念对的等价性,以及(2)生物医学领域中典型的大型本体对齐所需的高运行时间。为了解决这些问题,我们提出了一种新的方法,名为生物医学本体对齐技术(BOAT),它在 F 度量、精度和速度方面是最先进的。BOAT 的一个关键特征是它考虑了概念标签中每个组成词的信息量,这对生物医学本体论有重大影响,导致 F 度量提高了 12.2%。BOAT 的另一个重要特征是,它仅根据注释的相似性选择具有高等价可能性的概念对进行比较。BOAT 对鼠标和人类解剖本体论的对齐的 F 度量为 0.88,与另一种最先进的匹配器 AgreementMaker 相当,而用时更短。