Data Science Institute, Imperial College London, London, UK.
Clinical Research Centre, University of Surrey, Guildford, UK.
Sci Data. 2019 Aug 13;6(1):149. doi: 10.1038/s41597-019-0156-9.
Biomedical informatics has traditionally adopted a linear view of the informatics process (collect, store and analyse) in translational medicine (TM) studies; focusing primarily on the challenges in data integration and analysis. However, a data management challenge presents itself with the new lifecycle view of data emphasized by the recent calls for data re-use, long term data preservation, and data sharing. There is currently a lack of dedicated infrastructure focused on the 'manageability' of the data lifecycle in TM research between data collection and analysis. Current community efforts towards establishing a culture for open science prompt the creation of a data custodianship environment for management of TM data assets to support data reuse and reproducibility of research results. Here we present the development of a lifecycle-based methodology to create a metadata management framework based on community driven standards for standardisation, consolidation and integration of TM research data. Based on this framework, we also present the development of a new platform (PlatformTM) focused on managing the lifecycle for translational research data assets.
生物医学信息学在转化医学(TM)研究中传统上采用信息学过程(收集、存储和分析)的线性视图;主要侧重于数据集成和分析方面的挑战。然而,随着数据再利用、长期数据保存和数据共享的新生命周期视图的提出,数据管理方面提出了新的挑战。目前,在 TM 研究中,从数据收集到分析,针对数据生命周期的“可管理性”,缺乏专门的基础设施。当前社区为建立开放科学文化所做的努力促使创建一个数据保管环境,以管理 TM 数据资产,支持数据再利用和研究结果的可重复性。在这里,我们提出了一种基于生命周期的方法的开发,以创建一个基于社区驱动标准的元数据管理框架,用于 TM 研究数据的标准化、整合和集成。基于这个框架,我们还提出了一个新平台(PlatformTM)的开发,该平台专注于管理转化研究数据资产的生命周期。