Department of Health Administration and Policy, College of Health and Human Services, George Mason University, Fairfax, Virginia, USA Fox Chase Cancer Center, Temple University Health, Philadelphia, Pennsylvania, USA.
Booz Allen Hamilton, Rockville, Maryland, USA.
J Am Med Inform Assoc. 2014 Jul-Aug;21(4):642-9. doi: 10.1136/amiajnl-2013-001763. Epub 2013 Sep 27.
In an effort to standardize behavioral measures and their data representation, the present study develops a methodology for incorporating measures found in the National Cancer Institute's (NCI) grid-enabled measures (GEM) portal, a repository for behavioral and social measures, into the cancer data standards registry and repository (caDSR).
The methodology consists of four parts for curating GEM measures into the caDSR: (1) develop unified modeling language (UML) models for behavioral measures; (2) create common data elements (CDE) for UML components; (3) bind CDE with concepts from the NCI thesaurus; and (4) register CDE in the caDSR.
UML models have been developed for four GEM measures, which have been registered in the caDSR as CDE. New behavioral concepts related to these measures have been created and incorporated into the NCI thesaurus. Best practices for representing measures using UML models have been utilized in the practice (eg, caDSR). One dataset based on a GEM-curated measure is available for use by other systems and users connected to the grid.
Behavioral and population science data can be standardized by using and extending current standards. A new branch of CDE for behavioral science was developed for the caDSR. It expands the caDSR domain coverage beyond the clinical and biological areas. In addition, missing terms and concepts specific to the behavioral measures addressed in this paper were added to the NCI thesaurus. A methodology was developed and refined for curation of behavioral and population science data.
为了标准化行为测量及其数据表示,本研究开发了一种方法,将国家癌症研究所(NCI)网格启用测量(GEM)门户中发现的测量方法纳入癌症数据标准注册和存储库(caDSR)中,该门户是行为和社会测量的存储库。
该方法包括将 GEM 措施纳入 caDSR 的四个部分:(1)为行为措施开发统一建模语言(UML)模型;(2)为 UML 组件创建通用数据元素(CDE);(3)将 CDE 与 NCI 词库中的概念绑定;(4)在 caDSR 中注册 CDE。
已经为四个 GEM 措施开发了 UML 模型,并作为 CDE 在 caDSR 中注册。与这些措施相关的新行为概念已被创建并纳入 NCI 词库。在实践中(例如 caDSR)已利用了使用 UML 模型表示措施的最佳实践。一个基于 GEM 策划措施的数据集可供与网格连接的其他系统和用户使用。
可以通过使用和扩展当前标准来标准化行为和人群科学数据。caDSR 为行为科学开发了一个新的 CDE 分支。它扩展了 caDSR 的域覆盖范围,超出了临床和生物学领域。此外,还添加了本文中提到的行为措施特有的缺失术语和概念到 NCI 词库中。已经开发并完善了一种用于行为和人群科学数据编纂的方法。