Department of Biostatistics & Bioinformatics, Moffitt Cancer Center, Tampa, FL.
Health Informatics, Moffitt Cancer Center, Tampa, FL.
JCO Clin Cancer Inform. 2021 May;5:561-569. doi: 10.1200/CCI.20.00175.
The use of genomics within cancer research and clinical oncology practice has become commonplace. Efforts such as The Cancer Genome Atlas have characterized the cancer genome and suggested a wealth of targets for implementing precision medicine strategies for patients with cancer. The data produced from research studies and clinical care have many potential secondary uses beyond their originally intended purpose. Effective storage, query, retrieval, and visualization of these data are essential to create an infrastructure to enable new discoveries in cancer research.
Moffitt Cancer Center implemented a molecular data warehouse to complement the extensive enterprise clinical data warehouse (Health and Research Informatics). Seven different sequencing experiment types were included in the warehouse, with data from institutional research studies and clinical sequencing.
The implementation of the molecular warehouse involved the close collaboration of many teams with different expertise and a use case-focused approach. Cornerstones of project success included project planning, open communication, institutional buy-in, piloting the implementation, implementing custom solutions to address specific problems, data quality improvement, and data governance, unique aspects of which are featured here. We describe our experience in selecting, configuring, and loading molecular data into the molecular data warehouse. Specifically, we developed solutions for heterogeneous genomic sequencing cohorts (many different platforms) and integration with our existing clinical data warehouse.
The implementation was ultimately successful despite challenges encountered, many of which can be generalized to other research cancer centers.
基因组学在癌症研究和临床肿瘤学实践中的应用已经变得很普遍。例如癌症基因组图谱(The Cancer Genome Atlas)等努力已经对癌症基因组进行了特征描述,并为癌症患者实施精准医学策略提供了大量目标。从研究和临床护理中产生的数据除了最初的预期用途之外,还有许多潜在的次要用途。有效存储、查询、检索和可视化这些数据对于建立一个基础设施以促进癌症研究中的新发现至关重要。
莫菲特癌症中心(Moffitt Cancer Center)实施了一个分子数据仓库,以补充广泛的企业临床数据仓库(健康与研究信息学)。该仓库包括七种不同的测序实验类型,数据来自机构研究和临床测序。
分子仓库的实施涉及到许多具有不同专业知识的团队的密切合作,并采用了以用例为中心的方法。项目成功的基石包括项目规划、开放沟通、机构认可、试点实施、实施定制解决方案以解决特定问题、数据质量改进和数据治理,其中的独特方面在此处介绍。我们描述了在将分子数据选择、配置和加载到分子数据仓库中的经验。具体来说,我们针对异构基因组测序队列(许多不同的平台)开发了解决方案,并与我们现有的临床数据仓库进行了集成。
尽管遇到了挑战,但实施最终取得了成功,其中许多挑战可以推广到其他研究癌症中心。