Department of Medical Education and Biomedical Informatics, University of Washington, 1959 NE Pacific St., HSB I-264, Seattle, WA 98195-7240, USA.
J Biomed Inform. 2010 Dec;43(6):873-82. doi: 10.1016/j.jbi.2010.07.005. Epub 2010 Jul 17.
Though there have been many advances in providing access to linked and integrated biomedical data across repositories, developing methods which allow users to specify ambiguous and exploratory queries over disparate sources remains a challenge to extracting well-curated or diversely-supported biological information. In the following work, we discuss the concepts of data coverage and evidence in the context of integrated sources. We address diverse information retrieval via a simple framework for representing coverage and evidence that operates in parallel with an arbitrary schema, and a language upon which queries on the schema and framework may be executed. We show that this approach is capable of answering questions that require ranged levels of evidence or triangulation, and demonstrate that appropriately-formed queries can significantly improve the level of precision when retrieving well-supported biomedical data.
尽管在提供跨存储库访问链接和集成生物医学数据方面已经取得了许多进展,但开发允许用户在不同来源上指定模糊和探索性查询的方法仍然是提取精心策划或多样化支持的生物信息的挑战。在接下来的工作中,我们将讨论集成来源背景下的数据覆盖范围和证据的概念。我们通过一个简单的覆盖范围和证据表示框架来解决多样化的信息检索问题,该框架与任意模式并行运行,并提供一种可以在模式和框架上执行查询的语言。我们表明,这种方法能够回答需要范围证据或三角测量的问题,并证明适当形成的查询可以在检索支持良好的生物医学数据时显著提高精度水平。