Kamdar Maulik R, Tudorache Tania, Musen Mark A
Stanford Center for Biomedical Informatics Research, Department of Medicine, Stanford University.
CEUR Workshop Proc. 2015 Jul;1515. Epub 2015 Nov 18.
We investigate the current extent of term reuse and overlap among biomedical ontologies. We use the corpus of biomedical ontologies stored in the BioPortal repository, and analyze three types of reuse constructs: (a) explicit term reuse, (b) reuse, and (c) Concept Unique Identifier (CUI) reuse. While there is a term label similarity of approximately 14.4% of the total terms, we observed that most ontologies reuse considerably fewer than 5% of their terms from a concise set of a few core ontologies. We developed an interactive visualization to explore reuse dependencies among biomedical ontologies. Moreover, we identified a set of patterns that indicate ontology developers did intend to reuse terms from other ontologies, but they were using different and sometimes incorrect representations. Our results suggest the value of semi-automated tools that augment term reuse in the ontology engineering process through personalized recommendations.
我们研究了生物医学本体中术语重用和重叠的当前程度。我们使用存储在BioPortal知识库中的生物医学本体语料库,并分析三种类型的重用结构:(a)显式术语重用,(b)重用,以及(c)概念唯一标识符(CUI)重用。虽然总术语中约14.4%存在术语标签相似性,但我们观察到,大多数本体从少数几个核心本体的精简集中重用的术语远少于5%。我们开发了一种交互式可视化工具,以探索生物医学本体之间的重用依赖关系。此外,我们识别出了一组模式,这些模式表明本体开发者确实打算重用其他本体中的术语,但他们使用的是不同的、有时甚至是不正确的表示形式。我们的结果表明了半自动工具在本体工程过程中通过个性化推荐增强术语重用的价值。