Urbain Jay, Goharian Nazli, Frieder Ophir
Electrical Engineering and Computer Science Department, Milwaukee School of Engineering, 9050 N. Tennyson Dr. Bayside, WI 53217, USA.
Comput Biol Med. 2009 Jan;39(1):61-8. doi: 10.1016/j.compbiomed.2008.11.002. Epub 2009 Jan 15.
We present a dimensional information retrieval model for combining concept-based semantics and term statistics within multiple levels of document context to identify concise, variable length passages of text that answer a user query. Our results demonstrate improved search results in the presence of varying levels of semantic evidence, and higher performance using retrieval functions that combine document, as well as sentence and passage level information. Experimental results are promising. When ranking documents based on the most relevant extracted passages, the results exceed the state-of-the-art by 15.28% as assessed by the TREC 2005 Genomics track collection of 4.5 million MEDLINE citations.
我们提出了一种维度信息检索模型,该模型可在文档上下文的多个级别内结合基于概念的语义和术语统计信息,以识别能够回答用户查询的简洁、可变长度的文本段落。我们的结果表明,在存在不同程度语义证据的情况下,搜索结果得到了改善,并且使用结合文档以及句子和段落级别信息的检索函数时性能更高。实验结果很有前景。在根据最相关的提取段落对文档进行排名时,根据2005年TREC基因组学跟踪收集的450万篇MEDLINE引用文献评估,结果比现有技术高出15.28%。