Mora Oscar, Engelbrecht Gerhard, Bisbal Jesus
IEEE Trans Inf Technol Biomed. 2012 Nov;16(6):1296-303. doi: 10.1109/TITB.2012.2215045. Epub 2012 Aug 23.
Biomedical research continuously generates large amounts of heterogeneous and multimodal data spread over multiple data sources. These data, if appropriately shared and exploited, could dramatically improve the research practice itself, and ultimately the quality of health care delivered. This paper presents DISMED (DIstributed Semantic MEDiator), an open source semantic mediator that provides a unified view of a federated environment of multiscale biomedical data sources. DISMED is a Web-based software application to query and retrieve information distributed over a set of registered data sources, using semantic technologies. It also offers a userfriendly interface specifically designed to simplify the usage of these technologies by non-expert users. Although the architecture of the software mediator is generic and domain independent, in the context of this paper, DISMED has been evaluated for managing biomedical environments and facilitating research with respect to the handling of scientific data distributed in multiple heterogeneous data sources. As part of this contribution, a quantitative evaluation framework has been developed. It consist of a benchmarking scenario and the definition of five realistic use-cases. This framework, created entirely with public datasets, has been used to compare the performance of DISMED against other available mediators. It is also available to the scientific community in order to evaluate progress in the domain of semantic mediation, in a systematic and comparable manner. The results show an average improvement in the execution time by DISMED of 55% compared to the second best alternative in four out of the five use-cases of the experimental evaluation.
生物医学研究不断产生大量分布在多个数据源的异构和多模态数据。如果这些数据得到适当共享和利用,可能会极大地改善研究实践本身,并最终提高医疗保健的质量。本文介绍了DISMED(分布式语义中介器),这是一个开源语义中介器,它提供了多尺度生物医学数据源联邦环境的统一视图。DISMED是一个基于Web的软件应用程序,用于使用语义技术查询和检索分布在一组注册数据源中的信息。它还提供了一个用户友好的界面,专门设计用于简化非专业用户对这些技术的使用。尽管软件中介器的架构是通用的且与领域无关,但在本文的背景下,已对DISMED进行了评估,以管理生物医学环境并促进在处理分布于多个异构数据源中的科学数据方面的研究。作为这项贡献的一部分,开发了一个定量评估框架。它由一个基准测试场景和五个实际用例的定义组成。这个完全由公共数据集创建的框架已用于比较DISMED与其他可用中介器的性能。它也可供科学界使用,以便以系统和可比的方式评估语义中介领域的进展。结果表明,在实验评估的五个用例中的四个中,与第二好的替代方案相比,DISMED的执行时间平均提高了55%。