BMC Bioinformatics. 2012 Jan 25;13 Suppl 1(Suppl 1):S4. doi: 10.1186/1471-2105-13-S1-S4.
Because of the increasing number of electronic resources, designing efficient tools to retrieve and exploit them is a major challenge. Some improvements have been offered by semantic Web technologies and applications based on domain ontologies. In life science, for instance, the Gene Ontology is widely exploited in genomic applications and the Medical Subject Headings is the basis of biomedical publications indexation and information retrieval process proposed by PubMed. However current search engines suffer from two main drawbacks: there is limited user interaction with the list of retrieved resources and no explanation for their adequacy to the query is provided. Users may thus be confused by the selection and have no idea on how to adapt their queries so that the results match their expectations.
This paper describes an information retrieval system that relies on domain ontology to widen the set of relevant documents that is retrieved and that uses a graphical rendering of query results to favor user interactions. Semantic proximities between ontology concepts and aggregating models are used to assess documents adequacy with respect to a query. The selection of documents is displayed in a semantic map to provide graphical indications that make explicit to what extent they match the user's query; this man/machine interface favors a more interactive and iterative exploration of data corpus, by facilitating query concepts weighting and visual explanation. We illustrate the benefit of using this information retrieval system on two case studies one of which aiming at collecting human genes related to transcription factors involved in hemopoiesis pathway.
The ontology based information retrieval system described in this paper (OBIRS) is freely available at: http://www.ontotoolkit.mines-ales.fr/ObirsClient/. This environment is a first step towards a user centred application in which the system enlightens relevant information to provide decision help.
由于电子资源的数量不断增加,设计高效的工具来检索和利用这些资源是一项重大挑战。语义 Web 技术和基于领域本体的应用程序提供了一些改进。例如,在生命科学中,基因本体论被广泛应用于基因组应用程序中,而医学主题词是 PubMed 提出的生物医学出版物索引和信息检索过程的基础。然而,当前的搜索引擎存在两个主要缺陷:用户与检索到的资源列表之间的交互有限,并且没有提供对其查询适应性的解释。因此,用户可能会对选择感到困惑,并且不知道如何调整查询以使其结果符合他们的期望。
本文描述了一种信息检索系统,该系统依赖于领域本体来扩大检索到的相关文档集,并使用查询结果的图形呈现来促进用户交互。本体论概念和聚合模型之间的语义相似度用于评估文档相对于查询的适当性。文档的选择显示在语义图中,以提供图形指示,明确它们与用户查询的匹配程度;这种人机界面有利于更具交互性和迭代性地探索数据语料库,通过方便查询概念的加权和可视化解释。我们在两个案例研究中说明了使用此信息检索系统的好处,其中一个案例旨在收集与参与造血途径的转录因子相关的人类基因。
本文描述的基于本体的信息检索系统(OBIRS)可在以下网址免费获得:http://www.ontotoolkit.mines-ales.fr/ObirsClient/。该环境是朝着以用户为中心的应用程序迈出的第一步,在该应用程序中,系统会提供相关信息以提供决策帮助。