NASA Jet Propulsion Laboratory, Pasadena, CA, USA.
Cancer Biomark. 2010;9(1-6):511-30. doi: 10.3233/CBM-2011-0180.
Capturing, sharing, and publishing cancer biomarker research data are all fundamental challenges of enabling new opportunities to research and understand scientific data. Informatics experts from the National Cancer Institute's (NCI) Early Detection Research Network (EDRN) have pioneered a principled informatics infrastructure to capture and disseminate data from biomarker validation studies, in effect, providing a national-scale, real-world successful example of how to address these challenges. EDRN is a distributed, collaborative network and it requires its infrastructure to support research across cancer research institutions and across their individual laboratories. The EDRN informatics infrastructure is also referred to as the EDRN Knowledge Environment, or EKE. EKE connects information about biomarkers, studies, specimens and resulting scientific data, allowing users to search, download and compare each of these disparate sources of cancer research information. EKE's data is enriched by providing annotations that describe the research results (biomarkers, protocols, studies) and that link the research results to the captured information within EDRN (raw instrument datasets, specimens, etc.). In addition EKE provides external links to public resources related to the research results and captured data. EKE has leveraged and reused data management software technologies originally developed for planetary and earth science research results and has infused those capabilities into biomarker research. This paper will describe the EDRN Knowledge Environment, its deployment to the EDRN enterprise, and how a number of these challenges have been addressed through the capture and curation of biomarker data results.
捕获、共享和发布癌症生物标志物研究数据是实现新的研究和理解科学数据机会的基本挑战。美国国家癌症研究所 (NCI) 的早期检测研究网络 (EDRN) 的信息学专家率先开发了一种有原则的信息学基础设施,用于捕获和传播生物标志物验证研究的数据,实际上提供了一个全国范围内的真实成功案例,说明了如何应对这些挑战。EDRN 是一个分布式协作网络,它需要其基础设施来支持跨癌症研究机构和其各自实验室的研究。EDRN 信息学基础设施也被称为 EDRN 知识环境或 EKE。EKE 连接有关生物标志物、研究、标本和相关科学数据的信息,允许用户搜索、下载和比较癌症研究信息的这些不同来源。通过提供描述研究结果(生物标志物、方案、研究)的注释,并将研究结果链接到 EDRN 中捕获的信息(原始仪器数据集、标本等),EKE 的数据得到了丰富。此外,EKE 还提供了与研究结果和捕获数据相关的公共资源的外部链接。EKE 利用并重新使用了最初为行星和地球科学研究结果开发的数据管理软件技术,并将这些功能融入到生物标志物研究中。本文将描述 EDRN 知识环境、它在 EDRN 企业中的部署,以及通过捕获和策生物标志物数据结果来解决这些挑战的方法。