Cejovic Jovan, Radenkovic Jelena, Mladenovic Vladimir, Stanojevic Adam, Miletic Milica, Radanovic Stevan, Bajcic Dragan, Djordjevic Dragan, Jelic Filip, Nesic Milos, Lau Jessica, Grady Patrick, Groves-Kirkby Nick, Kural Deniz, Davis-Dusenbery Brandi
Seven Bridges Genomics Inc., Cambridge, MA, USA.
Cancer Inform. 2018 Sep 28;17:1176935118774787. doi: 10.1177/1176935118774787. eCollection 2018.
Increased efforts in cancer genomics research and bioinformatics are producing tremendous amounts of data. These data are diverse in origin, format, and content. As the amount of available sequencing data increase, technologies that make them discoverable and usable are critically needed. In response, we have developed a Semantic Web-based Data Browser, a tool allowing users to visually build and execute ontology-driven queries. This approach simplifies access to available data and improves the process of using them in analyses on the Seven Bridges Cancer Genomics Cloud (CGC; www.cancergenomicscloud.org). The Data Browser makes large data sets easily explorable and simplifies the retrieval of specific data of interest. Although initially implemented on top of The Cancer Genome Atlas (TCGA) data set, the Data Browser's architecture allows for seamless integration of other data sets. By deploying it on the CGC, we have enabled remote researchers to access data and perform collaborative investigations.
癌症基因组学研究和生物信息学方面不断加大的努力正在产生海量数据。这些数据在来源、格式和内容上各不相同。随着可用测序数据量的增加,迫切需要能够使这些数据可被发现和使用的技术。作为回应,我们开发了一种基于语义网的数据浏览器,这是一种允许用户直观地构建和执行本体驱动查询的工具。这种方法简化了对可用数据的访问,并改善了在七桥癌症基因组学云平台(CGC;www.cancergenomicscloud.org)上进行分析时使用这些数据的过程。数据浏览器使大型数据集易于探索,并简化了对特定感兴趣数据的检索。尽管最初是在癌症基因组图谱(TCGA)数据集之上实现的,但数据浏览器的架构允许无缝集成其他数据集。通过将其部署在CGC上,我们使远程研究人员能够访问数据并进行协作研究。