Department of Pediatrics, University of Chicago, Chicago, IL.
AMIA Annu Symp Proc. 2024 Jan 11;2023:874-883. eCollection 2023.
The Pediatric Cancer Data Commons (PCDC) comprises an international community whose ironclad commitment to data sharing is combatting pediatric cancer in an unprecedented way. The byproduct of their data sharing efforts is a gold-standard consensus data model covering many types of pediatric cancer. This article describes an effort to utilize SSSOM, an emerging specification for semantically-rich data mappings, to provide a "hub and spoke" model of mappings from several common data models (CDMs) to the PCDC data model. This provides important contributions to the research community, including: 1) a clear view of the current coverage of these CDMs in the domain of pediatric oncology, and 2) a demonstration of creating standardized mappings. These mappings can allow downstream crosswalk for data transformation and enhance data sharing. This can guide those who currently create and maintain brittle ad hoc data mappings in order to utilize the growing volume of viable research data.
儿科癌症数据共享组织(PCDC)由一个国际社区组成,他们坚定地致力于数据共享,正在以前所未有的方式对抗儿科癌症。他们数据共享工作的副产品是一个涵盖多种儿科癌症的黄金标准共识数据模型。本文描述了利用 SSSOM(一种用于语义丰富数据映射的新兴规范)的努力,为从几个常见数据模型(CDMs)到 PCDC 数据模型的映射提供“中心辐射”模型。这为研究社区做出了重要贡献,包括:1)清楚地了解这些 CDMs 在儿科肿瘤学领域的当前覆盖范围,以及 2)展示了创建标准化映射的方法。这些映射可以允许下游进行数据转换的转换,并增强数据共享。这可以指导那些目前创建和维护脆弱的特定用途数据映射的人,以利用不断增长的大量可行研究数据。