Guardia Gabriela D A, Pires Luís Ferreira, Vêncio Ricardo Z N, Malmegrim Kelen C R, de Farias Cléver R G
Department of Computer Science and Mathematics-Faculty of Philosophy, Sciences and Letters of Ribeirão Preto (FFCLRP)-University of São Paulo (USP), Ribeirão Preto, Brazil.
Faculty of Electrical Engineering, Mathematics and Computer Science-University of Twente, Enschede, the Netherlands.
PLoS One. 2015 Jul 24;10(7):e0134011. doi: 10.1371/journal.pone.0134011. eCollection 2015.
Gene expression studies are generally performed through multi-step analysis processes, which require the integrated use of a number of analysis tools. In order to facilitate tool/data integration, an increasing number of analysis tools have been developed as or adapted to semantic web services. In recent years, some approaches have been defined for the development and semantic annotation of web services created from legacy software tools, but these approaches still present many limitations. In addition, to the best of our knowledge, no suitable approach has been defined for the functional genomics domain. Therefore, this paper aims at defining an integrated methodology for the implementation of RESTful semantic web services created from gene expression analysis tools and the semantic annotation of such services. We have applied our methodology to the development of a number of services to support the analysis of different types of gene expression data, including microarray and RNASeq. All developed services are publicly available in the Gene Expression Analysis Services (GEAS) Repository at http://dcm.ffclrp.usp.br/lssb/geas. Additionally, we have used a number of the developed services to create different integrated analysis scenarios to reproduce parts of two gene expression studies documented in the literature. The first study involves the analysis of one-color microarray data obtained from multiple sclerosis patients and healthy donors. The second study comprises the analysis of RNA-Seq data obtained from melanoma cells to investigate the role of the remodeller BRG1 in the proliferation and morphology of these cells. Our methodology provides concrete guidelines and technical details in order to facilitate the systematic development of semantic web services. Moreover, it encourages the development and reuse of these services for the creation of semantically integrated solutions for gene expression analysis.
基因表达研究通常通过多步骤分析过程来进行,这需要综合使用多种分析工具。为了促进工具/数据整合,越来越多的分析工具已被开发成语义网服务或适配为语义网服务。近年来,已经定义了一些方法来开发和语义注释从遗留软件工具创建的网络服务,但这些方法仍然存在许多局限性。此外,据我们所知,尚未为功能基因组学领域定义合适的方法。因此,本文旨在定义一种集成方法,用于实现从基因表达分析工具创建的RESTful语义网服务以及此类服务的语义注释。我们已将我们的方法应用于开发许多服务,以支持对不同类型基因表达数据的分析,包括微阵列和RNA测序。所有开发的服务均可在http://dcm.ffclrp.usp.br/lssb/geas的基因表达分析服务(GEAS)存储库中公开获取。此外,我们使用了许多开发的服务来创建不同的综合分析场景,以重现文献中记录的两项基因表达研究的部分内容。第一项研究涉及对从多发性硬化症患者和健康供体获得的单色微阵列数据的分析。第二项研究包括对从黑色素瘤细胞获得的RNA测序数据的分析,以研究重塑因子BRG1在这些细胞增殖和形态中的作用。我们的方法提供了具体的指导方针和技术细节,以促进语义网服务的系统开发。此外,它鼓励开发和重用这些服务,以创建用于基因表达分析的语义集成解决方案。