Department of Computer Science & Applied Statistics, University of New Brunswick, Saint John, New Brunswick, E2L 4L5, Canada.
BMC Bioinformatics. 2011;12 Suppl 4(Suppl 4):S6. doi: 10.1186/1471-2105-12-S4-S6. Epub 2011 Jul 5.
Mutation impact extraction is an important task designed to harvest relevant annotations from scientific documents for reuse in multiple contexts. Our previous work on text mining for mutation impacts resulted in (i) the development of a GATE-based pipeline that mines texts for information about impacts of mutations on proteins, (ii) the population of this information into our OWL DL mutation impact ontology, and (iii) establishing an experimental semantic database for storing the results of text mining.
This article explores the possibility of using the SADI framework as a medium for publishing our mutation impact software and data. SADI is a set of conventions for creating web services with semantic descriptions that facilitate automatic discovery and orchestration. We describe a case study exploring and demonstrating the utility of the SADI approach in our context. We describe several SADI services we created based on our text mining API and data, and demonstrate how they can be used in a number of biologically meaningful scenarios through a SPARQL interface (SHARE) to SADI services. In all cases we pay special attention to the integration of mutation impact services with external SADI services providing information about related biological entities, such as proteins, pathways, and drugs.
We have identified that SADI provides an effective way of exposing our mutation impact data such that it can be leveraged by a variety of stakeholders in multiple use cases. The solutions we provide for our use cases can serve as examples to potential SADI adopters trying to solve similar integration problems.
突变影响提取是一项重要的任务,旨在从科学文献中提取相关注释,以便在多个上下文中重复使用。我们之前在突变影响的文本挖掘方面的工作导致了 (i) 开发了一个基于 GATE 的管道,用于从文本中挖掘有关突变对蛋白质影响的信息,(ii) 将这些信息填充到我们的 OWL DL 突变影响本体中,以及 (iii) 建立了一个用于存储文本挖掘结果的实验语义数据库。
本文探讨了使用 SADI 框架作为发布我们的突变影响软件和数据的媒介的可能性。SADI 是一组用于创建具有语义描述的 Web 服务的约定,这些约定促进了自动发现和编排。我们描述了一个案例研究,探讨并展示了 SADI 方法在我们的上下文中的实用性。我们描述了我们根据文本挖掘 API 和数据创建的几个 SADI 服务,并通过到 SADI 服务的 SPARQL 接口 (SHARE) 展示了它们如何在许多具有生物学意义的场景中使用。在所有情况下,我们特别注意将突变影响服务与提供相关生物实体信息的外部 SADI 服务(如蛋白质、途径和药物)集成。
我们已经确定,SADI 提供了一种有效的方法来公开我们的突变影响数据,以便在多个用例中可以被各种利益相关者利用。我们为用例提供的解决方案可以作为试图解决类似集成问题的潜在 SADI 采用者的示例。