Yin Xiaoshi, Huang Jimmy Xiangji, Li Zhoujun
State Key Laboratory of Software Development Environment, Beihang University, Beijing 100191, China.
Int J Data Min Bioinform. 2012;6(2):115-29. doi: 10.1504/ijdmb.2012.048172.
In this paper, we present a context-sensitive approach to re-ranking retrieved documents for further improving the effectiveness of high-performance biomedical literature retrieval systems. For each topic, a two-dimensional positive context is learnt from the top N retrieved documents and a group of negative contexts are learnt from the last N' documents in initial retrieval ranked list. The contextual space contains lexical context and conceptual context. The probabilities that retrieved documents are generated within the contextual space are then computed for document re-ranking. Empirical evaluation on the TREC Genomics full-text collection and three high-performance biomedical literature retrieval runs demonstrates that the context-sensitive re-ranking approach yields better retrieval performance.
在本文中,我们提出了一种上下文敏感方法,用于对检索到的文档进行重新排序,以进一步提高高性能生物医学文献检索系统的有效性。对于每个主题,从排名靠前的N篇检索文档中学习二维正向上下文,并从初始检索排序列表中的最后N'篇文档中学习一组负向上下文。上下文空间包含词汇上下文和概念上下文。然后计算检索到的文档在上下文空间内生成的概率,用于文档重新排序。对TREC基因组学全文集和三次高性能生物医学文献检索运行的实证评估表明,上下文敏感重新排序方法产生了更好的检索性能。