CSE Department, POSTECH, Pohang, South Korea.
BMC Bioinformatics. 2010 Apr 16;11 Suppl 2(Suppl 2):S6. doi: 10.1186/1471-2105-11-S2-S6.
Finding relevant articles from PubMed is challenging because it is hard to express the user's specific intention in the given query interface, and a keyword query typically retrieves a large number of results. Researchers have applied machine learning techniques to find relevant articles by ranking the articles according to the learned relevance function. However, the process of learning and ranking is usually done offline without integrated with the keyword queries, and the users have to provide a large amount of training documents to get a reasonable learning accuracy. This paper proposes a novel multi-level relevance feedback system for PubMed, called RefMed, which supports both ad-hoc keyword queries and a multi-level relevance feedback in real time on PubMed.
RefMed supports a multi-level relevance feedback by using the RankSVM as the learning method, and thus it achieves higher accuracy with less feedback. RefMed "tightly" integrates the RankSVM into RDBMS to support both keyword queries and the multi-level relevance feedback in real time; the tight coupling of the RankSVM and DBMS substantially improves the processing time. An efficient parameter selection method for the RankSVM is also proposed, which tunes the RankSVM parameter without performing validation. Thereby, RefMed achieves a high learning accuracy in real time without performing a validation process. RefMed is accessible at http://dm.postech.ac.kr/refmed.
RefMed is the first multi-level relevance feedback system for PubMed, which achieves a high accuracy with less feedback. It effectively learns an accurate relevance function from the user's feedback and efficiently processes the function to return relevant articles in real time.
在 PubMed 中查找相关文章具有挑战性,因为很难在给定的查询界面中表达用户的特定意图,并且关键字查询通常会检索到大量结果。研究人员已经应用机器学习技术通过根据学习的相关性函数对文章进行排名来查找相关文章。然而,学习和排名的过程通常是离线完成的,而没有与关键字查询集成,用户必须提供大量的训练文档才能获得合理的学习准确性。本文提出了一种新颖的 PubMed 多级相关性反馈系统,称为 RefMed,它支持即席关键字查询和实时的多级相关性反馈。
RefMed 通过使用 RankSVM 作为学习方法来支持多级相关性反馈,从而实现了更高的准确性和更少的反馈。RefMed 将 RankSVM 紧密集成到 RDBMS 中,以支持实时的关键字查询和多级相关性反馈;RankSVM 和 DBMS 的紧密耦合极大地提高了处理时间。还提出了一种有效的 RankSVM 参数选择方法,该方法无需进行验证即可调整 RankSVM 参数。因此,RefMed 无需进行验证过程即可实时实现高学习准确性。RefMed 可在 http://dm.postech.ac.kr/refmed 上访问。
RefMed 是第一个用于 PubMed 的多级相关性反馈系统,它通过较少的反馈实现了较高的准确性。它有效地从用户的反馈中学习准确的相关性函数,并高效地处理该函数以实时返回相关文章。