Phillips Joshua, Chilukuri Ram, Fragoso Gilberto, Warzel Denise, Covitz Peter A
Science Applications International Corporation, Annapolis, MD, USA.
BMC Med Inform Decis Mak. 2006 Jan 6;6:2. doi: 10.1186/1472-6947-6-2.
Robust, programmatically accessible biomedical information services that syntactically and semantically interoperate with other resources are challenging to construct. Such systems require the adoption of common information models, data representations and terminology standards as well as documented application programming interfaces (APIs). The National Cancer Institute (NCI) developed the cancer common ontologic representation environment (caCORE) to provide the infrastructure necessary to achieve interoperability across the systems it develops or sponsors. The caCORE Software Development Kit (SDK) was designed to provide developers both within and outside the NCI with the tools needed to construct such interoperable software systems.
The caCORE SDK requires a Unified Modeling Language (UML) tool to begin the development workflow with the construction of a domain information model in the form of a UML Class Diagram. Models are annotated with concepts and definitions from a description logic terminology source using the Semantic Connector component. The annotated model is registered in the Cancer Data Standards Repository (caDSR) using the UML Loader component. System software is automatically generated using the Codegen component, which produces middleware that runs on an application server. The caCORE SDK was initially tested and validated using a seven-class UML model, and has been used to generate the caCORE production system, which includes models with dozens of classes. The deployed system supports access through object-oriented APIs with consistent syntax for retrieval of any type of data object across all classes in the original UML model. The caCORE SDK is currently being used by several development teams, including by participants in the cancer biomedical informatics grid (caBIG) program, to create compatible data services. caBIG compatibility standards are based upon caCORE resources, and thus the caCORE SDK has emerged as a key enabling technology for caBIG.
The caCORE SDK substantially lowers the barrier to implementing systems that are syntactically and semantically interoperable by providing workflow and automation tools that standardize and expedite modeling, development, and deployment. It has gained acceptance among developers in the caBIG program, and is expected to provide a common mechanism for creating data service nodes on the data grid that is under development.
构建能够在语法和语义上与其他资源进行互操作的强大的、可编程访问的生物医学信息服务具有挑战性。此类系统需要采用通用信息模型、数据表示和术语标准以及文档化的应用程序编程接口(API)。美国国家癌症研究所(NCI)开发了癌症通用本体表示环境(caCORE),以提供实现其开发或资助的系统之间互操作性所需的基础设施。caCORE软件开发工具包(SDK)旨在为NCI内外的开发人员提供构建此类可互操作软件系统所需的工具。
caCORE SDK需要一个统一建模语言(UML)工具来开始开发工作流程,即构建一个UML类图形式的领域信息模型。使用语义连接器组件,根据描述逻辑术语源中的概念和定义对模型进行注释。使用UML加载器组件将带注释的模型注册到癌症数据标准存储库(caDSR)中。使用代码生成组件自动生成系统软件,该组件生成在应用服务器上运行的中间件。caCORE SDK最初使用一个七级UML模型进行测试和验证,并已用于生成caCORE生产系统,该系统包括具有数十个类的模型。部署的系统支持通过面向对象的API进行访问,这些API具有一致的语法,可用于检索原始UML模型中所有类的任何类型的数据对象。目前,包括癌症生物医学信息学网格(caBIG)计划的参与者在内的几个开发团队正在使用caCORE SDK来创建兼容的数据服务。caBIG兼容性标准基于caCORE资源,因此caCORE SDK已成为caBIG的关键支持技术。
caCORE SDK通过提供标准化和加速建模、开发和部署的工作流程及自动化工具,大幅降低了实现语法和语义可互操作系统的障碍。它已在caBIG计划的开发人员中获得认可,并有望为正在开发的数据网格上创建数据服务节点提供一种通用机制。