Burek Patryk, Loebe Frank, Herre Heinrich
Institute of Medical Informatics, Statistics and Epidemiology, University of Leipzig, Haertelstrasse 16-18, Leipzig, 04107, Germany.
Computer Science Institute, University of Leipzig, Augustusplatz 10, Leipzig, 04109, Germany.
J Biomed Semantics. 2017 Oct 4;8(1):48. doi: 10.1186/s13326-017-0152-y.
Gene Ontology (GO) is the largest resource for cataloging gene products. This resource grows steadily and, naturally, this growth raises issues regarding the structure of the ontology. Moreover, modeling and refactoring large ontologies such as GO is generally far from being simple, as a whole as well as when focusing on certain aspects or fragments. It seems that human-friendly graphical modeling languages such as the Unified Modeling Language (UML) could be helpful in connection with these tasks.
We investigate the use of UML for making the structural organization of the Molecular Function Ontology (MFO), a sub-ontology of GO, more explicit. More precisely, we present a UML dialect, called the Function Modeling Language (FueL), which is suited for capturing functions in an ontologically founded way. FueL is equipped, among other features, with language elements that arise from studying patterns of subsumption between functions. We show how to use this UML dialect for capturing the structure of molecular functions. Furthermore, we propose and discuss some refactoring options concerning fragments of MFO.
FueL enables the systematic, graphical representation of functions and their interrelations, including making information explicit that is currently either implicit in MFO or is mainly captured in textual descriptions. Moreover, the considered subsumption patterns lend themselves to the methodical analysis of refactoring options with respect to MFO. On this basis we argue that the approach can increase the comprehensibility of the structure of MFO for humans and can support communication, for example, during revision and further development.
基因本体论(Gene Ontology,GO)是用于编目基因产物的最大资源。该资源稳步增长,这种增长自然会引发有关本体结构的问题。此外,对诸如GO这样的大型本体进行建模和重构,总体而言以及专注于某些方面或片段时,通常都绝非易事。似乎诸如统一建模语言(Unified Modeling Language,UML)之类的对人类友好的图形建模语言在处理这些任务时会有所帮助。
我们研究了使用UML使GO的子本体分子功能本体论(Molecular Function Ontology,MFO)的结构组织更加明确。更确切地说,我们提出了一种UML方言,称为功能建模语言(Function Modeling Language,FueL),它适合以本体论的方式捕获功能。FueL除其他功能外,还配备了从研究功能之间的包含模式中产生的语言元素。我们展示了如何使用这种UML方言来捕获分子功能的结构。此外,我们提出并讨论了一些有关MFO片段的重构选项。
FueL能够对功能及其相互关系进行系统的图形表示,包括使当前在MFO中隐含或主要在文本描述中捕获的信息变得明确。此外,所考虑的包含模式有助于对MFO的重构选项进行有条理的分析。在此基础上,我们认为该方法可以提高MFO结构对人类的可理解性,并可以支持交流,例如在修订和进一步开发期间。