Gryk Michael R, Ludäscher Bertram
University of Illinois, Urbana-Champaign, Champaign IL 61820, USA.
UCONN Health, Farmington, CT 06030, USA.
Transform Digit Worlds (2018). 2018 Mar;10766:620-625. doi: 10.1007/978-3-319-78105-1_70. Epub 2018 Mar 15.
Two barriers to computational reproducibility are the ability to record the critical metadata required for rerunning a computation, as well as translating the semantics of the metadata so that alternate approaches can easily be configured for verifying computational reproducibility. We are addressing this problem in the context of biomolecular NMR computational analysis by developing a series of linked ontologies which define the semantics of the various software tools used by researchers for data transformation and analysis. Building from a core ontology representing the primary observational data of NMR, the linked data approach allows for the translation of metadata in order to configure alternate software approaches for given computational tasks. In this paper we illustrate the utility of this with a small sample of the core ontology as well as tool-specific semantics for two third-party software tools. This approach to semantic mediation will help support an automated approach to validating the reliability of computation in which the same processing workflow is implemented with different software tools. In addition, the detailed semantics of both the data and the processing functionalities will provide a method for software tool classification.
计算可重复性面临两个障碍,一是记录重新运行计算所需的关键元数据的能力,二是转换元数据的语义,以便能够轻松配置替代方法来验证计算可重复性。我们正在生物分子核磁共振计算分析的背景下解决这个问题,通过开发一系列链接本体,这些本体定义了研究人员用于数据转换和分析的各种软件工具的语义。从表示核磁共振主要观测数据的核心本体出发,链接数据方法允许转换元数据,以便为给定的计算任务配置替代软件方法。在本文中,我们用核心本体的一个小样本以及两个第三方软件工具的特定工具语义来说明其效用。这种语义中介方法将有助于支持一种自动化方法来验证计算的可靠性,即使用不同的软件工具实现相同的处理工作流程。此外,数据和处理功能的详细语义将提供一种软件工具分类方法。